util+server: Fix bug around chunked request handling. #6906

philipaconrad · 2024-07-31T19:26:44Z

What are the changes in this PR?

This PR fixes a request handling bug introduced in #6868, which caused OPA to treat all incoming chunked requests as if they had zero-length request bodies.

The fix detects cases where the request body size is unknown in the DecodingLimits handler, and propagates a request context key down to the util.ReadMaybeCompressedBody function, allowing it to correctly select between using the original io.ReadAll style for chunked requests, or the newer preallocated buffers approach (for requests of known size).

This change has a small, but barely visible performance impact for large requests (<5% increase in GC pauses for a 1GB request JSON blob), and minimal, if any, effect on RPS under load.

Fixes: #6904

Notes to assist PR review:

A new context key is passed down from the DecodingLimits handler wrapper in server/server.go to the util.ReadMaybeCompressedBody function in util/read_gzip_body.go. This is used to signal cases where the request body is of indeterminate length, and the body reader should enforce the size limit itself.
Added back the original io.ReadAll code path in util.ReadMaybeCompressedBody, but it only triggers now on chunked requests, and is guarded both at the top-level in the DecodingLimits handler with a MaxBytesReader, and at the io.ReadAll callsite with a LimitReader. This prevents accidentally reading more of the request body than intended.
Gzip handling did not need any changes for this fix, surprisingly! It takes whatever content we pulled from the incoming request/request-stream, and processes it exactly as before.
Test cases were extended to throw errors on unexpected warnings, which allows us to catch the issue in Chunked HTTP requests broken in 0.67.0 #6904 that the tests didn't surface back in server+util: Limit max request sizes, prealloc request buffers #6868.

Further notes:

Performance impact is minimal: RPS under load seems to be unaffected, and GC overheads increased by <5% for 1GB requests in my local testing.

philipaconrad

Some notes for reviewers:

philipaconrad · 2024-07-31T19:27:48Z

server/handlers/decoding.go

+		// For chunked rqeuests (where full size is not known in advance),
+		// pass server.decoding.max_length down, using the request context.
+		if r.ContentLength < 0 || (r.ContentLength == 0 && r.Body != nil) {
+			ctx := util_decoding.AddServerDecodingMaxLen(r.Context(), maxLength)


This is where the new context key is inserted. Note that we only add it when we have an "indeterminate body size" case, such as what happens during chunked requests.

Is this only the case for chunked requests or are there some other use cases? If only true for chunked requests, can we just check the header (ie. 'Transfer-Encoding: chunked')?

Also it would be helpful to add a comment which explain the 2 conditions in the if statement.

philipaconrad · 2024-07-31T19:28:21Z

server/server_test.go

@@ -1713,8 +1713,10 @@ func generateJSONBenchmarkData(k, v int) map[string]interface{} {
 	}

 	return map[string]interface{}{
-		"keys":   keys,
-		"values": values,
+		"input": map[string]interface{}{


This is a minor refactoring that silences "'input' key missing" warnings from the testcases.

philipaconrad · 2024-07-31T19:29:38Z

util/read_gzip_body.go

+	// incremental read of the body. In this case, we can't be too clever, we
+	// just do the best we can with whatever is streamed over to us.
+	// Fetch gzip payload size limit from request context.
+	if maxLength, ok := decoding.GetServerDecodingMaxLen(r.Context()); ok {


This is where we add back the original io.ReadAll code path. It's only used when we have an indeterminate request body length case, otherwise we fall back to the preallocated buffer logic from #6868.

ashutosh-narkar

Thanks for the fix @philipaconrad!

ashutosh-narkar · 2024-07-31T23:25:29Z

server/handlers/decoding.go

+		// For chunked rqeuests (where full size is not known in advance),
+		// pass server.decoding.max_length down, using the request context.
+		if r.ContentLength < 0 || (r.ContentLength == 0 && r.Body != nil) {
+			ctx := util_decoding.AddServerDecodingMaxLen(r.Context(), maxLength)


Is this only the case for chunked requests or are there some other use cases? If only true for chunked requests, can we just check the header (ie. 'Transfer-Encoding: chunked')?

Also it would be helpful to add a comment which explain the 2 conditions in the if statement.

philipaconrad · 2024-08-01T18:45:49Z

Thanks for the review comments @ashutosh-narkar!

Is this only the case for chunked requests or are there some other use cases?

Searching in the sources of net/http's request.go indicates there are a max of 2x cases where we'd see ContentLength = -1:

Chunked transfer-encoding
An upgrade-to-HTTP2 edge case

If only true for chunked requests, can we just check the header (ie. 'Transfer-Encoding: chunked')?

net/http actually adds and removes the chunked encoding header transparently in most request/response cases, so we will never see that header explicitly set under normal conditions (I ran some local tests to verify this). There is a request field called TransferEncoding that seems to be populated when chunked encoding is present, but ContentLength will always be -1 in those cases, so I've kept just the ContentLength check.

Also it would be helpful to add a comment which explain the 2 conditions in the if statement.

I've embellished the original comment to explain the why behind the "magic" check, and have simplified down to just 1x check, because the other conditional check applies only to client requests, where Golang is sending a request, not receiving it.

There's a bit of history documented in the sources around why that condition existed, and it dates back to backwards compatibility stuff in Go 1.8.

This commit fixes a request handling bug introduced in open-policy-agent#6868, which caused OPA to treat all incoming chunked requests as if they had zero-length request bodies. The fix detects cases where the request body size is unknown in the DecodingLimits handler, and propagates a request context key down to the `util.ReadMaybeCompressedBody` function, allowing it to correctly select between using the original `io.ReadAll` style for chunked requests, or the newer preallocated buffers approach (for requests of known size). This change has a small, but barely visible performance impact for large requests (<5% increase in GC pauses for a 1GB request JSON blob), and minimal, if any, effect on RPS under load. Fixes: open-policy-agent#6904 Signed-off-by: Philip Conrad <philip@chariot-chaser.net>

netlify · 2024-08-01T19:09:05Z

✅ Deploy Preview for openpolicyagent ready!

Name	Link
🔨 Latest commit	`16dee19`
🔍 Latest deploy log	https://app.netlify.com/sites/openpolicyagent/deploys/66abdd0073551f0008d2e215
😎 Deploy Preview	https://deploy-preview-6906--openpolicyagent.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

…ent#6906) This commit fixes a request handling bug introduced in open-policy-agent#6868, which caused OPA to treat all incoming chunked requests as if they had zero-length request bodies. The fix detects cases where the request body size is unknown in the DecodingLimits handler, and propagates a request context key down to the `util.ReadMaybeCompressedBody` function, allowing it to correctly select between using the original `io.ReadAll` style for chunked requests, or the newer preallocated buffers approach (for requests of known size). This change has a small, but barely visible performance impact for large requests (<5% increase in GC pauses for a 1GB request JSON blob), and minimal, if any, effect on RPS under load. Fixes: open-policy-agent#6904 Signed-off-by: Philip Conrad <philip@chariot-chaser.net> (cherry picked from commit ee9ab0b)

This commit fixes a request handling bug introduced in #6868, which caused OPA to treat all incoming chunked requests as if they had zero-length request bodies. The fix detects cases where the request body size is unknown in the DecodingLimits handler, and propagates a request context key down to the `util.ReadMaybeCompressedBody` function, allowing it to correctly select between using the original `io.ReadAll` style for chunked requests, or the newer preallocated buffers approach (for requests of known size). This change has a small, but barely visible performance impact for large requests (<5% increase in GC pauses for a 1GB request JSON blob), and minimal, if any, effect on RPS under load. Fixes: #6904 Signed-off-by: Philip Conrad <philip@chariot-chaser.net> (cherry picked from commit ee9ab0b)

philipaconrad added the bug label Jul 31, 2024

philipaconrad requested review from ashutosh-narkar and johanfylling July 31, 2024 19:26

philipaconrad self-assigned this Jul 31, 2024

philipaconrad commented Jul 31, 2024

View reviewed changes

ashutosh-narkar previously approved these changes Jul 31, 2024

View reviewed changes

philipaconrad dismissed ashutosh-narkar’s stale review via c4135cb August 1, 2024 18:30

philipaconrad force-pushed the philip/fix-issue-6904 branch from c4135cb to 41fa295 Compare August 1, 2024 18:31

philipaconrad requested a review from ashutosh-narkar August 1, 2024 18:45

ashutosh-narkar previously approved these changes Aug 1, 2024

View reviewed changes

philipaconrad dismissed ashutosh-narkar’s stale review via 16dee19 August 1, 2024 19:07

philipaconrad force-pushed the philip/fix-issue-6904 branch from 41fa295 to 16dee19 Compare August 1, 2024 19:07

philipaconrad requested a review from ashutosh-narkar August 1, 2024 19:07

ashutosh-narkar approved these changes Aug 1, 2024

View reviewed changes

philipaconrad merged commit ee9ab0b into open-policy-agent:main Aug 1, 2024
28 checks passed

BrewTestBot mentioned this pull request Aug 5, 2024

opa 0.67.1 Homebrew/homebrew-core#180115

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

util+server: Fix bug around chunked request handling. #6906

util+server: Fix bug around chunked request handling. #6906

philipaconrad commented Jul 31, 2024

philipaconrad left a comment

philipaconrad Jul 31, 2024

ashutosh-narkar Jul 31, 2024

philipaconrad Jul 31, 2024

philipaconrad Jul 31, 2024

ashutosh-narkar left a comment

ashutosh-narkar Jul 31, 2024

philipaconrad commented Aug 1, 2024

netlify bot commented Aug 1, 2024

util+server: Fix bug around chunked request handling. #6906

util+server: Fix bug around chunked request handling. #6906

Conversation

philipaconrad commented Jul 31, 2024

What are the changes in this PR?

Notes to assist PR review:

Further notes:

philipaconrad left a comment

Choose a reason for hiding this comment

philipaconrad Jul 31, 2024

Choose a reason for hiding this comment

ashutosh-narkar Jul 31, 2024

Choose a reason for hiding this comment

philipaconrad Jul 31, 2024

Choose a reason for hiding this comment

philipaconrad Jul 31, 2024

Choose a reason for hiding this comment

ashutosh-narkar left a comment

Choose a reason for hiding this comment

ashutosh-narkar Jul 31, 2024

Choose a reason for hiding this comment

philipaconrad commented Aug 1, 2024

netlify bot commented Aug 1, 2024

✅ Deploy Preview for openpolicyagent ready!