httpx
httpx copied to clipboard
HTTP/3 support.
It would be nice to start planning support for HTTP/3. aioquic provides a sans-I/O API for HTTP/3 similar to h2 which would make such an integration possible.
The main hurdle is that the connection model is very different to HTTP/1.1 and HTTP/2 : we cannot simply establish a (reader, writer) pair over a TCP connection and pass this to the protocol layer. Between the HTTP/3 layer and the UDP transport, we have a QUIC layer which returns (datagram, addr) tuples indicating where datagrams need to be sent:
https://aioquic.readthedocs.io/en/latest/quic.html#aioquic.quic.connection.QuicConnection.datagrams_to_send
Note that the destination address may change during the connection's lifetime, for example in response to receiving a PreferredAddress from the server during the TLS handshake.
In order to provide a first demonstration, @tomchristie suggested I write a dispatcher which makes use of aioquic's HTTP/3 support. The code is still a bit rough, but it works:
https://github.com/aiortc/aioquic/blob/master/examples/httpx_client.py
This can be run as:
python examples/httpx_client.py https://cloudflare-quic.com/
See also https://github.com/encode/httpcore/issues/173
I'm not too sure how to move this forward. Ideally I'd like to have a WIP pull request which we can rebase / amend until we're happy with the result. However I haven't really got a feel for how the code should integrate with httpx's connection management. Any hints on how to approach this problem?
We'll need to add UDP support to our backend interface, that's definitely step 1.
After that we need to see how we want to implement client storage so we can remember things like Alt-Svc headers between requests.
Next would be landing an HTTP/3
specifier along with the H3Connection
dispatcher. I'm not sure how compatible aioquic is with an SSLContext object (probably not at all?) so that will take some tweaking. And because HTTP/3 is known to not work in some routes we'll have to implement "fallback to TCP and forget the H3 alt svc" on the HTTPConnection object somehow.
Anything I'm missing @encode/httpx-maintainers?
I don't know enough about HTTP/3 to tell (for example, I'm not sure what the differences between QUIC and HTTP/3 and the associated aioquic APIs are?), but the approach highlighted by @sethmlarson sounds sensible to me.
One thing I'm sure of is that we'll need to chunk this into multiple PRs, otherwise we risk ending up with a 1000 LOC+ PR that's a pain for everyone to look at. That way we can more easily track progress too.
We'll need to add UDP support to our backend interface, that's definitely step 1.
Yup, and UDP support is definitely going to be a big chunk.
HTTPX is currently very much coupled to TCP. In particular, right now a given HTTPConnection
doesn't know about other connections, but from what you've said it seems like a QUIC "connection" needs to know about the surrounding connections to do packet routing (though it might be a defining property of UDP?), correct? Anyway, there'll be some refactoring needed before supporting UDP to clarify what is TCP-specific and what is not.
After that we need to see how we want to implement client storage so we can remember things like Alt-Svc headers between requests.
I think this should be an improvement over a basic design, right? (Under the hypothesis that remembering Alt-Svc headers will only allow us to skip the protocol switching phase of establishing an HTTP/3 connection.)
So, in terms of planning, I tried to think about the various steps in a bit more detail. I could see something like:
- Rename
BaseStream
/Stream
/ConcurrencyBackend.connect()
toBaseTCPStream
/TCPStream
/ConcurrencyBackend.connect_tcp()
. - Add UDP support. This will most likely include building an UDP version of
TCPStream
and the associated methods onConcurrencyBackend
. At this point, the resulting updatedConcurrencyBackend
(with an asyncio implementation) can be unit-tested against a very basic asyncio UDP server. - Add
httpx.dispatch.http3.HTTP3Connection
, an HTTP/3 implementation via aioquic. To reduce the scope of this step, we can set some constraints:-
HTTPConnection.connect()
should try to connect via HTTP/3 and create aUDPStream
/H3Connection
if and only ifHTTP/3
is one of the givenhttp_versions
(e.g.Client(http_versions=["HTTP/3"])
. This way, we don't need to think about Alt-Svc and the switch between TCP and UDP yet. - All QUIC connections can be stored on
H3Connection
directly (similar to how we have oneh11.H1Connection
on ourH1Connection
class), without thinking about sharing those connections yet (provided we need to share those at all?).
-
- Add Alt-Svc support: always connect via TCP first, and only if
Alt-Svc
is received then establish a newH3Connection
. (Will this require to modifyHTTP11Connection
andHTTP2Connection
so that they are able to processAlt-Svc
header in the response? Is raising an exception inHTTP11Connection.send()
andHTTP2Connection.send()
and catching it inConnection.send()
a sensible enough way of signalling we should reconnect via UDP-HTTP/3?) - Optimize connection making by remembering Alt-Svc headers.
- Implement sharing of QUIC connections between
HTTPConnection
s — if relevant?
Some resources I've come across:
- HTTP/3 Explained: https://http3-explained.haxx.se/en/
- asyncio UDP client example: UDP Echo Client
- Trio UDP discussion w/ usage hints: https://github.com/python-trio/trio/issues/537
- Hypercorn initial implementation: https://gitlab.com/pgjones/hypercorn/commit/7c1f519fe117f3fbcc6a62fcccdae2e8dfe0b61a
Lots of great info @florimondmanca!! 🎉
I'm going to chime in here again at the idea of using AnyIO and adding our stream interface to that?
I'm going to chime in here again at the idea of using AnyIO and adding our stream interface to that?
As in, using AnyIO (see #296) to provide an ConcurrencyBackend.connect_udp()
implementation for all backends on top of their UDP sockets API? I'm not against the idea but it might be a bit too much for this particular issue. It could be a preliminary step but we haven't decided yet on whether we should switch over to AnyIO.
Edit: also, since AnyIO mandates the same "strict context management everywhere" approach than Trio, it's highly probable we'd need to refactor some internals to comply with that requirement. One item in particular is the Stream
interface, which trio/AnyIO primarily expose as a context manager, although it's possible to .aclose()
manually.
HTTPX is currently very much coupled to TCP. In particular, right now a given
HTTPConnection
doesn't know about other connections, but from what you've said it seems like a QUIC "connection" needs to know about the surrounding connections to do packet routing (though it might be a defining property of UDP?), correct? Anyway, there'll be some refactoring needed before supporting UDP to clarify what is TCP-specific and what is not.
I don't see why the different connections should have any kind of coupling. The most straightforward approach is going to be opening a distinct UDP socket for every QUIC/HTTP3 connection so there will be a one-to-one mapping between socket and QUIC/HTTP3 connection.
At a high level, an HTTP3 connection is going to be very similar to an HTTP2 connection: you can run any number of requests on top of your connection.
I think this should be an improvement over a basic design, right? (Under the hypothesis that remembering Alt-Svc headers will only allow us to skip the protocol switching phase of establishing an HTTP/3 connection.)
HTTP3 support has landed in cURL and more recently in Chrome/canary so I'd suggest looking into how they handle Alt-Svc.
Some resources I've come across:
You might want to add aioquic's demo HTTP client:
https://github.com/aiortc/aioquic/blob/master/examples/http3_client.py
I tried to follow the guide written by @tomchristie above.
The first step was
Rename BaseStream / Stream / ConcurrencyBackend.connect() to BaseTCPStream / TCPStream / ConcurrencyBackend.connect_tcp().
It has been implemented in https://github.com/encode/httpx/pull/339
Having found the PR and the changes, I wanted to look around it and found that neither BaseTCPStream
nor TCPStream
nor ConcurrencyBackend
are presented in httpx
/httpcore
repos. Moreover the file structure of he repo has significantly changed from the PR.
@florimondmanca , @tomchristie
Do you have a time to provide us with advice, where the process should start? Should it be some sort of new connection to httpcore._async
/httpcore._sync
with the aioquic
backend?
Update Probably the first step might be to add "open_quic_connection" to existing backends?
@cdeler Well, it's about as complex as you could get for a contribution, but if you're up for it then I can certainly put in the guidance for a sensible way to approach it.
The keyword here, as with anything like this, will be incremental. 😀
- I'd start off with a pull request that just adds an
http3
flag to the ConnectionPool configuration, in the same way as the existinghttp2
flag. It should also be passed through to theHTTPConnection
instance. To begin with it shouldn't actually do anything, it should simply be there. - ~The next step could be to "notice" if an HTTP/3 service is advertised. If an HTTP/1.1 or HTTP/2 connection gets a response with an
Alt-Svc: h3
header, and ifhttp3=True
has been set, then aNotImplementedError
exception should be raised.~
At that point we've got a stub behaviour for detecting HTTP/3 support, which we can start to iterate on.
The keyword here, as with anything like this, will be incremental. 😀
it's smart behaviour to make incremental changes, I'm happy to try doing that
Slight update - let's scratch the second part of that, actually I think we'll want to use the HTTP/2
frame-type ALTSVC
to detect if we should upgrade or not.
We'll use that because we don't want to wait for response headers before we decide on if we should upgrade. The ALTSVC
frame can be sent during the opening handshake, before the request itself is made. We'll probably need to end up doing some investigation into exactly when implementations choose to send this frame, before we're able to proceed from there.
It's looking to me like httpx
should never end up making an HTTP/3 request on an initial outgoing request, because either:
- We see the upgrade in an
Alt-Svc
response headers, in which case we've already sent the request, and started receiving the response, not much point in tearing the connection down. - We might potentially see an
ALTSVC
HTTP/2 frame, but we don't want to block on waiting for that before starting to send a request (since it may not exist).
So I think the best we'll be able to do is storing altsvc
information whenever it comes through, and potentially making subsequent requests over HTTP/3 using that information.
@cdeler — Sure, it was posted by Jeremy (OP) in the issue description. :-) Obviously looks outdated by now since some bits of HTTPX API have changed, but I'm sure it's been discussed earlier in this thread?
Any way to make this work again? Currently getting NotImplementedError
I've submitted https://github.com/aiortc/aioquic/pull/203 to update the httpx demo for recent httpcore versions.
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
Bump still important.
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
Bump, still relevant.
Any updates?
bump
+
So, I'm absolutely in favour of someday having support for HTTP/3, yup.
But also... "Any updates?", "bump", "+" aren't valuable in any way.
More interesting would be illustrations of genuine motivations for wanting HTTP/3 support. What context are you using it in & why do you want a client that supports it? I'm interested in cases that go beyond "I want it because it exists".
More interesting would be illustrations of genuine motivations for wanting HTTP/3 support. What context are you using it in & why do you want a client that supports it? I'm interested in cases that go beyond "I want it because it exists".
Well at SearXNG we would want it in order to decrease the loading time when fetching some data from a website that support HTTP3. This would make the experience much snappier for the users of SearXNG.
So, I'm absolutely in favour of someday having support for HTTP/3, yup.
But also... "Any updates?", "bump", "+" aren't valuable in any way.
More interesting would be illustrations of genuine motivations for wanting HTTP/3 support. What context are you using it in & why do you want a client that supports it? I'm interested in cases that go beyond "I want it because it exists".
I am a solana dev. they just announced that they are using quic as an option.
I am a contributor to the solana python library. I also build stuff on top that.
I tried using aioquic. it works, but the latency is worse than using a regular http/1.1 using httpx.
~1.4s on aioquic, and ~0.7s on httpx. (for two concurrent request from initial request time, until closing the connection.)
I don't know if the reason is because aioquic has to bind to the raw ip socket first.
I tried using aioquic. it works, but the latency is worse than using a regular http/1.1 using httpx.
This doesn't surprise me. My expectation is that in Python, using HTTP/3 will generally be slower, because of the increased complexity.
I can get that it might be useful for us to include for general debugging/tooling purposes, tho.
I tried using aioquic. it works, but the latency is worse than using a regular http/1.1 using httpx.
This doesn't surprise me. My expectation is that in Python, using HTTP/3 will generally be slower, because of the increased complexity.
I can get that it might be useful for us to include for general debugging/tooling purposes, tho.
once a connection has been established though, it works amazingly. bulk requests is excellent using quic.
I tried using aioquic. it works, but the latency is worse than using a regular http/1.1 using httpx.
This doesn't surprise me. My expectation is that in Python, using HTTP/3 will generally be slower, because of the increased complexity.
I can get that it might be useful for us to include for general debugging/tooling purposes, tho.
This does surprise me, measurements I made show good connection setup time and throughput against most servers.
I tried using aioquic. it works, but the latency is worse than using a regular http/1.1 using httpx.
This doesn't surprise me. My expectation is that in Python, using HTTP/3 will generally be slower, because of the increased complexity.
I can get that it might be useful for us to include for general debugging/tooling purposes, tho.
This does surprise me, measurements I made show good connection setup time and throughput against most servers.
were your servers local or on the cloud handling hundreds, possibly thousands of requests per second sir? the post requests made is to a solana rpc.
perhaps I am doing something wrong.
were your servers local or on the cloud handling hundreds, possibly thousands of requests per second sir? the post requests made is to a solana rpc.
perhaps I am doing something wrong.
The servers were distant, the results I mention are runs of aioquic's interop suite against a variety of servers, and the acceptance criterion is that the time to download 5MB and 10MB files over HTTP/3 must no more than 10% over HTTP/1.1 or HTTP/2 (using httpx). I'm not sure the server's load is relevant here as we are talking about client performance.
I don't think you're doing anything wrong, there are definitely some reports of initial connection time outliers using aioquic, which can kill the measured performance. I haven't managed to debug these cases yet but I do believe they are fixable.