Prototype: unescapse path #269

troydai · 2015-10-20T23:58:28Z

Issue #124

To unescape the path while read the first line to the request.

Note:

Implementation of the decoding is in HttpAbstraction. It is currently being reviewed: Url Path Decoder HttpAbstractions#448.
A new method is added to MemoryPoolIterator2 to return the raw char[] instead of a string as a result the decoding is able to be operated in place to avoid additional allocation.
The change will not increase additional memory allocation.

Follow up:

Add test cases.
Performance review afterwards.

/cc @halter73 @Tratcher @davidfowl @lodejard @muratg

lodejard · 2015-10-21T18:05:44Z

src/Microsoft.AspNet.Server.Kestrel/Http/Frame.cs

@@ -623,6 +626,11 @@ private bool TakeStartLine(SocketInput input)
                    return false;
                }

+                char[] requestUriChars;
+                var rawPathLength = beginPath.GetCharArray(endPath, out requestUriChars);
+                var decodedLength = UrlPathDecoder.DecodeInPlace(requestUriChars, rawPathLength);


DecodeInPlace should take beginPath/endPath iterators, modify the bytes in-place to unescape % sequences, and return the new endPath position rather than copy to an intermediate char[]. This also means the DecodeInPlace method does not need to understand utf-8 sequences, because the subsequent conversion from byte[] to string via UTF8 encoder will take care of that concern.

To achieve the goal of in place updating in the memory pool block I need to significantly change the UrlPathDecoder so that it accepts two iterator like objects to operator. The existing logic heavily depends on random access for verify UTF8 coding.

I think it's do doable but we won't be able to share the Decoder since it will be highly specified for Kestrel. And I'm more worried about progress at this point to redo this decoder at this point. I'm more than willing to make it happen after rc1.

All right. I'm working out a prototype on this idea now. I'll send a separate PR for that. But I'd like to keep this floating as a fall back.

This is the prototype: #274

troydai · 2015-10-22T07:40:40Z

Replaced by #274

Prototype: unescapse path

c5e800a

dnfclas added the cla-already-signed label Oct 20, 2015

lodejard reviewed Oct 21, 2015
View reviewed changes

troydai closed this Oct 22, 2015

troydai deleted the trdai/kestrel124 branch October 23, 2015 22:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Prototype: unescapse path #269

Prototype: unescapse path #269

Uh oh!

troydai commented Oct 20, 2015

Uh oh!

lodejard Oct 21, 2015

Uh oh!

troydai Oct 21, 2015

Uh oh!

troydai Oct 21, 2015

Uh oh!

troydai Oct 21, 2015

Uh oh!

troydai commented Oct 22, 2015

Uh oh!

Uh oh!

Prototype: unescapse path #269

Prototype: unescapse path #269

Uh oh!

Conversation

troydai commented Oct 20, 2015

Uh oh!

lodejard Oct 21, 2015

Choose a reason for hiding this comment

Uh oh!

troydai Oct 21, 2015

Choose a reason for hiding this comment

Uh oh!

troydai Oct 21, 2015

Choose a reason for hiding this comment

Uh oh!

troydai Oct 21, 2015

Choose a reason for hiding this comment

Uh oh!

troydai commented Oct 22, 2015

Uh oh!

Uh oh!