Made client id parsing vcluster aware #18464

mmaslankaprv · 2024-05-14T09:01:21Z

Previously Redpanda virtualized Kafka connections based on the full
client_id string and the string was used as a client id in all
downstream processing.

Change the parsing logic to add context to the parsed client id.
The client id format expected by Redpanda has the following structure:

[vcluster_id][connection_id][actual client id]

where:

vcluster_id - string encoded XID representing virtual cluster
(20 characters)

connection_id - hex encoded 32 bit integer representing virtual
connection id (8 characters)

client_id - standard protocol defined client id

If Redpanda fails to parse the client id while working with virtualized
connections the whole connection is closed.

Backports Required

Release Notes

none

Signed-off-by: Michał Maślanka <michal@redpanda.com>

When using mpx protocol extension actual client id is only a part of the whole client id buffer sent by MPX to Redpanda. Added a method allowing overriding client id. Signed-off-by: Michał Maślanka <michal@redpanda.com>

michael-redpanda

Looks good, have some questions about input validation

michael-redpanda · 2024-05-14T17:21:10Z

src/v/kafka/server/connection_context.cc

+vcluster_connection_id parse_vcluster_connection_id(std::string_view str) {
+    vcluster_connection_id cid;
+    std::stringstream sstream(str.data());
+    sstream >> std::hex >> cid;
+    return cid;
+}


I think there should be some validation to ensure that the vcluster_connection_id contains only valid hex characters. The streaming operation won't throw any exception if an invalid character is encountered and will simply stop processing.

src/v/kafka/server/connection_context.cc

tests/rptest/tests/connection_virtualizing_test.py

oleiman

couple questions but generally lgtm. I agree with @michael-redpanda - some DT tests for junk input might be nice.

src/v/kafka/server/connection_context.cc

src/v/kafka/server/connection_context.h

Previously Redpanda virtualized Kafka connections based on the full client_id string and the string was used as a client id in all downstream processing. Change the parsing logic to add context to the parsed client id. The client id format expected by Redpanda has the following structure: ``` [vcluster_id][connection_id][actual client id] ``` where: `vcluster_id` - string encoded XID representing virtual cluster (20 characters) `connection_id` - hex encoded 32 bit integer representing virtual connection id (8 characters) `client_id` - standard protocol defined client id If Redpanda fails to parse the client id while working with virtualized connections the whole connection is closed. Signed-off-by: Michał Maślanka <michal@redpanda.com>

Signed-off-by: Michał Maślanka <michal@redpanda.com>

Replaced `node_hash_map` keeping state of virtualized connections with `chunked_hash_map`. The change will allow us to avoid large allocations when dealing with large virtual connection number. Signed-off-by: Michał Maślanka <michal@redpanda.com>

vbotbuildovich · 2024-05-15T19:03:22Z

ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/49168#018f7d6c-bd71-4c32-8bfe-1f25bc06addd

oleiman

lgtm

vbotbuildovich · 2024-05-16T06:27:26Z

/backport v24.1.x

mmaslankaprv added 2 commits May 14, 2024 06:43

u/xid: use string_view in xid constructor

5aae639

Signed-off-by: Michał Maślanka <michal@redpanda.com>

k/request_context: introduced method to override client id

934c09c

When using mpx protocol extension actual client id is only a part of the whole client id buffer sent by MPX to Redpanda. Added a method allowing overriding client id. Signed-off-by: Michał Maślanka <michal@redpanda.com>

github-actions bot added the area/redpanda label May 14, 2024

mmaslankaprv requested review from oleiman, graphcareful and michael-redpanda May 14, 2024 14:18

michael-redpanda reviewed May 14, 2024

View reviewed changes

oleiman reviewed May 14, 2024

View reviewed changes

src/v/kafka/server/connection_context.cc Outdated Show resolved Hide resolved

src/v/kafka/server/connection_context.h Outdated Show resolved Hide resolved

mmaslankaprv added 3 commits May 15, 2024 07:40

tests: extracted xid generation to utils

5a11716

Signed-off-by: Michał Maślanka <michal@redpanda.com>

tests: updated test validating handling of virtualized connections

953135b

Signed-off-by: Michał Maślanka <michal@redpanda.com>

mmaslankaprv force-pushed the mpx-client-parsing branch from b28ebec to 953135b Compare May 15, 2024 07:41

michael-redpanda requested review from michael-redpanda and oleiman May 15, 2024 14:46

mmaslankaprv force-pushed the mpx-client-parsing branch from cb96979 to 187dcbf Compare May 15, 2024 16:50

oleiman approved these changes May 15, 2024

View reviewed changes

mmaslankaprv merged commit 076ddb8 into redpanda-data:dev May 16, 2024
18 checks passed

mmaslankaprv deleted the mpx-client-parsing branch May 16, 2024 06:27

vbotbuildovich mentioned this pull request May 16, 2024

[v24.1.x] Made client id parsing vcluster aware #18520

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Made client id parsing vcluster aware #18464

Made client id parsing vcluster aware #18464

mmaslankaprv commented May 14, 2024 •

edited

Loading

michael-redpanda left a comment

michael-redpanda May 14, 2024

oleiman left a comment

vbotbuildovich commented May 15, 2024

oleiman left a comment

vbotbuildovich commented May 16, 2024

Made client id parsing vcluster aware #18464

Made client id parsing vcluster aware #18464

Conversation

mmaslankaprv commented May 14, 2024 • edited Loading

Backports Required

Release Notes

michael-redpanda left a comment

Choose a reason for hiding this comment

michael-redpanda May 14, 2024

Choose a reason for hiding this comment

oleiman left a comment

Choose a reason for hiding this comment

vbotbuildovich commented May 15, 2024

oleiman left a comment

Choose a reason for hiding this comment

vbotbuildovich commented May 16, 2024

mmaslankaprv commented May 14, 2024 •

edited

Loading