Limit concurrent HTTP requests #922

Ralith · 2025-04-27T19:05:54Z

When building a non-trivial project with third-party dependencies fetched via http_archive or similar, e.g. from reindeer in non-vendored mode, buck2 can generate extremely large numbers of outgoing HTTP requests. Hyper does not enforce any limits itself, happily expanding its connection pool with every additional concurrent request. This can cause the remote HTTP server to reject requests, and even lead to buck2 itself failing with "too many open files" errors.

Larger numbers of concurrent requests have rapidly diminishing returns, so while there's no obviously correct limit, any smallish number should improve behavior in most cases. It may even improve total throughput by reducing contention for network bandwidth.

See also discussion in facebookincubator/reindeer#46.

facebook-github-bot · 2025-04-27T19:06:05Z

@facebook-github-bot has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. (Because this pull request was imported automatically, there will not be any future comments.)

Ralith · 2025-04-27T19:07:04Z

cc @cormacrelf

Ralith · 2025-04-27T21:09:29Z

Reworked this to reduce the boilerplate and ensure the semaphore permit is held until the response is fully consumed. Hopefully that doesn't confuse the import?

Ralith · 2025-04-28T02:10:37Z

app/buck2_http/src/client.rs

+                .inspect(move |_| {
+                    // Ensure we keep a concurrent request permit alive until the stream is consumed
+                    let _guard = &semaphore_guard;
+                })


This is a little cheeky and might be clearer as a stream transformer struct, but that would probably take about 4x as much code. Let me know if that'd be preferred.

why do we need this in the stream itself? Could we have this in line 142 let _guard = &semaphore_guard;. As far as I can tell the request already occurred by then

These requests are typically downloading potentially large files. We don't want to release semaphore credit until the response has been completed, which is after the stream has been consumed in full and dropped. Without stream ownership of the guard, it would be released as soon as the response begins, and we could still end up with arbitrarily large numbers of concurrent [attempted] requests, which is what this PR seeks to avoid.

Ralith · 2025-05-23T01:17:12Z

Another thought: the motivating case for this PR is downloading from crates.io. In theory, all such requests could be pipelined through a single HTTP/2 connection, which would avoid the resource exhaustion hazard and probably be more efficient for everyone involved. Getting that to happen seems like a significantly more subtle task, however, so maybe best left for follow-up work.

Ralith · 2025-06-03T02:39:03Z

As described in #316 (comment) I think there's a more direct solution available by tweaking hyper's HTTP client to reduce the initial default quantity of multiplexing. I'll pursue that in a separate PR. Leaving this open for now in case there's independent interest in limiting the number of connections, though as currently written this limits the number of requests, which isn't quite right.

cormacrelf · 2025-06-03T10:10:49Z

A few issues:

Doesn't hyper just open a second HTTP/2 connection if you hit the limit of # streams per conn? This is just an http2 optimization to pack more requests into the same connection and amortise handshakes, isn't there a separate connection pool behaviour globally that is still unlimited?
In any case OS-level resource limits on file descriptors don't care about distinct hostnames or http2 features. This all falls down if your web server does http1 only and cannot multiplex any streams, or if you make requests to too many different hostnames. Hyper needs to be configured to enforce global limits. I can't actually find anything in hyper that limits the total size of the connection pool.
If hyper can in fact be configured to limit concurrency globally, you still need to ensure that the w+ files that get created for each download are not created while hyper is queuing a request. These lines in this order will still hit file descriptor limits if relying on hyper to limit concurrency, because the file is created before we start waiting for hyper to (theoretically?) finish waiting for a slot internally.

All things considered, we should have that http2 max concurrent streams change, but for file descriptor limits, the semaphore here is the only thing that will work.

Ralith · 2025-06-03T21:34:13Z

Thanks for the feedback!

Doesn't hyper just open a second HTTP/2 connection if you hit the limit of # streams per conn?

I'd be surprised, but I'll verify.

This all falls down if your web server does http1 only and cannot multiplex any streams, or if you make requests to too many different hostnames.

Yes, this PR is still necessary for that case.

These lines in this order will still hit file descriptor limits if relying on hyper to limit concurrency, because the file is created before we start waiting for hyper to (theoretically?) finish waiting for a slot internally.

This seems like a pretty easy fix, though? I'll draft a separate PR.

Ralith · 2025-06-04T01:01:09Z

Opened #991 to delay file creation. I think we need that to get the most benefit from this PR anyway, since the semaphore acquire happens when the HTTP request is attempted, after the file has already been opened otherwise.

Ralith · 2025-06-04T19:24:26Z

Doesn't hyper just open a second HTTP/2 connection if you hit the limit of # streams per conn?

hyper maintainers confirm that this is not the case.

Summary: Helps reduce the number of file descriptors required for a build, especially in combination with concurrent efforts to limit the number of concurrent HTTP responses being processed. See also #922. Pull Request resolved: #991 Reviewed By: IanChilds Differential Revision: D75920121 fbshipit-source-id: ca17981c2305054c8fc596b165201479d731ee15

facebook-github-bot · 2025-06-05T10:12:13Z

@iguridi merged this pull request in ef396e8.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 27, 2025

Ralith force-pushed the push-vrzyztmsxkup branch 5 times, most recently from d9909c9 to 57a8fac Compare April 27, 2025 21:07

Ralith force-pushed the push-vrzyztmsxkup branch 3 times, most recently from 03cf79a to fcba6c4 Compare April 27, 2025 22:03

Ralith added 2 commits April 27, 2025 15:35

Limit concurrent HTTP requests to 32 by default

9a8a3d8

Read HTTP max concurrent requests from config

9786a5c

Ralith force-pushed the push-vrzyztmsxkup branch from fcba6c4 to 9786a5c Compare April 27, 2025 22:35

Ralith commented Apr 28, 2025

View reviewed changes

Ralith mentioned this pull request Apr 28, 2025

Compatibility with simulated time in tokio tests ggriffiniii/httptest#29

Open

Ralith mentioned this pull request May 30, 2025

Warnings that occur when HTTP2 downloads are throttled are a little bit aggressive #316

Open

Ralith mentioned this pull request Jun 4, 2025

Don't create http_download output file until the response begins #991

Closed

facebook-github-bot closed this in ef396e8 Jun 5, 2025

facebook-github-bot added the Merged label Jun 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Limit concurrent HTTP requests #922

Limit concurrent HTTP requests #922

Ralith commented Apr 27, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Apr 27, 2025

Uh oh!

Ralith commented Apr 27, 2025

Uh oh!

Ralith commented Apr 27, 2025 •

edited

Loading

Uh oh!

Ralith Apr 28, 2025

Uh oh!

iguridi Jun 3, 2025

Uh oh!

Ralith Jun 3, 2025

Uh oh!

Ralith commented May 23, 2025 •

edited

Loading

Uh oh!

Ralith commented Jun 3, 2025

Uh oh!

cormacrelf commented Jun 3, 2025 •

edited

Loading

Uh oh!

Ralith commented Jun 3, 2025

Uh oh!

Ralith commented Jun 4, 2025

Uh oh!

Ralith commented Jun 4, 2025

Uh oh!

facebook-github-bot commented Jun 5, 2025

Uh oh!

Uh oh!

Limit concurrent HTTP requests #922

Limit concurrent HTTP requests #922

Conversation

Ralith commented Apr 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Apr 27, 2025

Uh oh!

Ralith commented Apr 27, 2025

Uh oh!

Ralith commented Apr 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Ralith Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

iguridi Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

Ralith Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

Ralith commented May 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Ralith commented Jun 3, 2025

Uh oh!

cormacrelf commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Ralith commented Jun 3, 2025

Uh oh!

Ralith commented Jun 4, 2025

Uh oh!

Ralith commented Jun 4, 2025

Uh oh!

facebook-github-bot commented Jun 5, 2025

Uh oh!

Uh oh!

Ralith commented Apr 27, 2025 •

edited

Loading

Ralith commented Apr 27, 2025 •

edited

Loading

Ralith commented May 23, 2025 •

edited

Loading

cormacrelf commented Jun 3, 2025 •

edited

Loading