Differentiate between 'url.host' and 'url.raw_host' #1590

tomchristie · 2021-04-22T12:18:21Z

Throughout our URL model we're differentiating neatly between byte-wise cases and str cases.
We're always using bytes when escaping is not applied, and str when escaping is applied.

Eg...

url = httpx.URL("https://jo%40email.com:a%20secret@example.com:1234/pa th")
assert url.username == "jo@email.com"
assert url.password == "a secret"
assert url.userinfo == b"jo%40email.com:a%20secret"
assert url.path == "/pa th"
assert url.raw_path == b"/pa%20th"

This pull request is a proposal for treating IDNA domain names similarly, so...

url = httpx.URL("https://müller.de:80")
assert url.host == "müller.de"
assert url.raw_host == b"xn--mller-kva.de"

For API consistency this also necessarily results in url.netloc becoming a byte interface, which actually makes sense for the contexts in which it is used.

url = httpx.URL("https://müller.de:80")
assert url.netloc == b"xn--mller-kva.de:80"

Finally we also introduce .raw_scheme for a byte-wise representation of the scheme, for a nice consistency so that:

url = httpx.URL("https://müller.de:80/pa th")
assert url.raw == (url.raw_scheme, url.raw_host, url.port, url.raw_path)
assert url.raw == (b"https", b"xn--mller-kva.de", 80, b"/pa%20th")

StephenBrown2 · 2021-04-22T17:48:34Z

Wouldn't raw indicate unencoded?

StephenBrown2

Just a couple typos, and the previous question about what raw means.

httpx/_models.py

Co-authored-by: Stephen Brown II <Stephen.Brown2@gmail.com>

tomchristie · 2021-04-23T08:11:12Z

Raw, as in the "the raw bytes on the wire", or "the raw ingredients that make up the cake".
The raw representation of the host is the actual unaltered bytewise representation that's used to make the connection.

Or, in baking...

The raw ingredients: \xf0\x9f\x8e\x82
The cake: 🎂

Similar usage of "Raw" in other technical docs.

tomchristie · 2021-04-23T10:00:45Z

Thanks so much for the review @StephenBrown2.
(Geez, me & my typos. 😬)

StephenBrown2 · 2021-04-23T15:40:38Z

(Geez, me & my typos. 😬)

I think it was mainly just a copy-paste issue that got propagated, but no more normlized! :-p

Differentiate between 'url.host' and 'url.raw_host'

3b8ab07

tomchristie added the user-experience Ensuring that users have a good experience using the library label Apr 22, 2021

Marginally neater .netloc implementation

b885169

StephenBrown2 approved these changes Apr 22, 2021

View reviewed changes

httpx/_models.py Outdated Show resolved Hide resolved

httpx/_models.py Outdated Show resolved Hide resolved

httpx/_models.py Outdated Show resolved Hide resolved

httpx/_models.py Outdated Show resolved Hide resolved

httpx/_models.py Outdated Show resolved Hide resolved

tomchristie and others added 5 commits April 23, 2021 09:03

Update httpx/_models.py

7166a1e

Co-authored-by: Stephen Brown II <Stephen.Brown2@gmail.com>

Update httpx/_models.py

6a144f4

Co-authored-by: Stephen Brown II <Stephen.Brown2@gmail.com>

Update httpx/_models.py

d56c82e

Co-authored-by: Stephen Brown II <Stephen.Brown2@gmail.com>

Update httpx/_models.py

e53aeaf

Co-authored-by: Stephen Brown II <Stephen.Brown2@gmail.com>

Update httpx/_models.py

c946720

Co-authored-by: Stephen Brown II <Stephen.Brown2@gmail.com>

tomchristie merged commit 39d8ee6 into master Apr 23, 2021

tomchristie deleted the raw-host branch April 23, 2021 10:00

tomchristie mentioned this pull request Apr 27, 2021

Version 0.18.0 #1576

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Differentiate between 'url.host' and 'url.raw_host' #1590

Differentiate between 'url.host' and 'url.raw_host' #1590

tomchristie commented Apr 22, 2021 •

edited

Loading

StephenBrown2 commented Apr 22, 2021

StephenBrown2 left a comment

tomchristie commented Apr 23, 2021 •

edited

Loading

tomchristie commented Apr 23, 2021

StephenBrown2 commented Apr 23, 2021

Differentiate between 'url.host' and 'url.raw_host' #1590

Differentiate between 'url.host' and 'url.raw_host' #1590

Conversation

tomchristie commented Apr 22, 2021 • edited Loading

StephenBrown2 commented Apr 22, 2021

StephenBrown2 left a comment

Choose a reason for hiding this comment

tomchristie commented Apr 23, 2021 • edited Loading

tomchristie commented Apr 23, 2021

StephenBrown2 commented Apr 23, 2021

tomchristie commented Apr 22, 2021 •

edited

Loading

tomchristie commented Apr 23, 2021 •

edited

Loading