Add missing Optional types in urllib.parse #3263

robertschweizer · 2019-09-25T15:40:41Z

None values are accepted, and interpreted as empty (byte) strings by
some urllib.parse functions.

This is probably not the desired behavior, but can be useful especially
when passing through values that default to None in outside functions.

Signed-off-by: Schweizer, Robert [email protected]

None values are accepted, and interpreted as empty (byte) strings by some urllib.parse functions. This is probably not the desired behavior, but can be useful especially when passing through values that default to None in outside functions. Signed-off-by: Schweizer, Robert <[email protected]>

gvanrossum · 2019-09-25T15:44:29Z

I see a problem though. When I call e.g. parse_qs(None) what's the return type?

robertschweizer · 2019-09-25T15:52:48Z

It returns an empty dictionary, which fits the return type annotation and is the same behavior as for empty strings.

None is interpreted as an empty byte string though. E.g. urlparse(None) returns a ParseResultBytes instance.

gvanrossum · 2019-09-25T18:18:05Z

Hm, that's unfortunate (choosing bytes is arbitrary). However this should occur rarely -- usually the static type of the argument will be Optional[bytes] or Optional[Text], not None.

I'll leave it to others to approve this PR.

srittau · 2019-10-08T17:44:58Z

I am not sure whether we should change the types. Allowing None seems not to be a conscious design decision, but just a side-effect of using if x to check for an empty string in _decode_args(). Relying on this to work is one of the things that a type checker is supposed to prevent, in my opinion.

gvanrossum

Looking at this line in urllib/parse.py:

    return tuple(x.decode(encoding, errors) if x else '' for x in args)

that looks pretty intentional to skip None -- if x were b'', then x.decode(...) would return '' so there would be no need for the if x else '' part.

This is in _decode_args() which is called from _coerce_args() which is called from parse_qs() and from at least most of other functions touched in this PR (I didn't bother to check all).

So I think it's intentional. @robertschweizer is this how you found this? (By looking for callers of _coerce_args().) For anything that calls _coerce_args() I think this is by design (or at least it's not going to be changed out of fear of breaking existing usage).

gvanrossum approved these changes Oct 9, 2019

View reviewed changes

srittau merged commit 3ee8fc2 into python:master Oct 9, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add missing Optional types in urllib.parse #3263

Add missing Optional types in urllib.parse #3263

Uh oh!

robertschweizer commented Sep 25, 2019

Uh oh!

gvanrossum commented Sep 25, 2019

Uh oh!

robertschweizer commented Sep 25, 2019

Uh oh!

gvanrossum commented Sep 25, 2019

Uh oh!

srittau commented Oct 8, 2019

Uh oh!

gvanrossum left a comment

Uh oh!

Uh oh!

Uh oh!

Add missing Optional types in urllib.parse #3263

Add missing Optional types in urllib.parse #3263

Uh oh!

Conversation

robertschweizer commented Sep 25, 2019

Uh oh!

gvanrossum commented Sep 25, 2019

Uh oh!

robertschweizer commented Sep 25, 2019

Uh oh!

gvanrossum commented Sep 25, 2019

Uh oh!

srittau commented Oct 8, 2019

Uh oh!

gvanrossum left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!