fix: message body size unset after parsed which leads to large io throughputs #1008

padmejin · 2022-01-06T05:37:14Z

Issue

Description

We have noticed pegasus 2.0 are using much more network/disk bandwidths than pegasus 1.12.3. After weeks of debugging, we have finally found that this is a bug introduced in #255, when a new thrift message parser that compatible with both old format and new format is refactored to replace the old one. Now we are trying to fix this.

The main cause is that when parsing the message body, instead of return exactly size of the body, it actually return the whole buffer that holds all the received messages. Thus, when there are many write requests piled up, from the moment the write request arrives, to the moment it writes to mutation logs and to other nodes, the throughput is significantly amplified.

To fix that we merely add one line to set the message body size before calling create_message_from_request_blob().

We also noticed that when a message header is found invalid, the bad message will not be consumed and discarded, instead it stays in the buffer forever. We think it is a bug. To fix this bug, we consume the buffer before calling create_message_from_request_blob().

…oughputs

src/runtime/test/thrift_message_parser_test.cpp

acelyc111 · 2022-01-06T06:06:35Z

Better to check that there is no extra buffer space after message_ex been constructed, i.e. at the end of parse_request_body_v0/v1, check msg->header->body_length is equal to msg->buffers[1].length()

hycdong · 2022-01-06T07:33:01Z

I raise a new issue to describe this bug：apache/incubator-pegasus#866. Could you please fulfill it according to your pull request?

padmejin · 2022-01-06T09:16:19Z

I raise a new issue to describe this bug：apache/incubator-pegasus#866. Could you please fulfill it according to your pull request?

done.

acelyc111

LGTM

foreverneverer

@levy5307 need re-check it

src/runtime/test/thrift_message_parser_test.cpp

levy5307 · 2022-01-07T02:01:27Z

We also noticed that when a message header is found invalid, the bad message will not be consumed and discarded, instead it stays in the buffer forever. We think it is a bug.

Could you add a unit test for this bug?

…consumed

padmejin · 2022-01-07T03:29:40Z

padmejin · 2022-01-07T07:16:31Z

We also noticed that when a message header is found invalid, the bad message will not be consumed and discarded, instead it stays in the buffer forever. We think it is a bug.

Could you add a unit test for this bug?

I meant to add a new test case, but then I found an old case that already cover this scenario. For some unknown reason, the author truncated the buffer before next message was parsed. I add a new line to check if the buffer is consumed.

Before this patch this case failed on the assert, but it passed after this patch. So I believed it works.

foreverneverer · 2022-01-07T07:38:21Z

instead of return exactly size of the body, it actually return the whole buffer that holds all the received messages. Thus, when there are many write requests piled up, from the moment the write request arrives

Do you mean:

The request size task piled up queue as follow:
1KB 1KB 1KB 1KB 1KB 1KB

Then mutation log flush size will: 1KB、1K+1KB、1KB+1KB+1KB、1KB+1KB+1KB+1KB.......

padmejin · 2022-01-07T08:07:26Z

instead of return exactly size of the body, it actually return the whole buffer that holds all the received messages. Thus, when there are many write requests piled up, from the moment the write request arrives

Do you mean:

The request size task piled up queue as follow: 1KB 1KB 1KB 1KB 1KB 1KB

Then mutation log flush size will: 1KB、1K+1KB、1KB+1KB+1KB、1KB+1KB+1KB+1KB.......

Yes.

Actually I printed the buffer size out and used awk & sort & uniq to analyze the distribution. The buffer size was up to 10000+, and all the number in a way conformed to arithmetic sequence. Not exactly, but followed some kinds of rules.

I also wrote a script to analyze all the slog file, and It turned out that a certain key pair appears multiple times in adjacent blocks, but when i dumped by mlog_dump I found there were no duplicated key pairs. I guessed those were just dirty data.

padmejin · 2022-01-07T08:20:33Z

The following is a part of mlog_dump result. These three mutation log length were: [19971, 19649, 19327].

mutation [4.0.3.3873136]: gpid=4.0, ballot=3, decree=3873136, timestamp=1640322274780591, last_committed_decree=3873133, log_offset=22216353421, log_length=19971, update_count=1
  [MULTI_PUT] 1
    [PUT] "hashKeyPrefix_00000000000000000000000000000000000000000261387976" : "hashKeyPrefix_00000000000000000000000000000000000000000261387976" => 0 : "pElLpiaC0juYshL86KXI30N5mYvNlwQNkO5k44hKInyhyomgdO4Eswz2RWDrODyP"
mutation [4.0.3.3873137]: gpid=4.0, ballot=3, decree=3873137, timestamp=1640322274781203, last_committed_decree=3873135, log_offset=22216373392, log_length=19649, update_count=1
  [MULTI_PUT] 1
    [PUT] "hashKeyPrefix_00000000000000000000000000000000000000000054160672" : "hashKeyPrefix_00000000000000000000000000000000000000000054160672" => 0 : "drsrinPuO8Zty7nP0YU2RRyQKoSQCBBBcGUaFKOz4Ufk5C2pgPHf0pekCVDCuE7I"
mutation [4.0.3.3873138]: gpid=4.0, ballot=3, decree=3873138, timestamp=1640322274781608, last_committed_decree=3873135, log_offset=22216393041, log_length=19327, update_count=1
  [MULTI_PUT] 1
    [PUT] "hashKeyPrefix_00000000000000000000000000000000000000000336035963" : "hashKeyPrefix_00000000000000000000000000000000000000000336035963" => 0 : "c8kxhTIEw6c6TSAf0jDlV8R01QulIeHSIK3O364HDD7PP8CQGTNs4PldV9mz1PLr"

fix: message body size unset after parsed which leads to large io thr…

1efe457

…oughputs

neverchanje reviewed Jan 6, 2022

View reviewed changes

src/runtime/test/thrift_message_parser_test.cpp Outdated Show resolved Hide resolved

src/runtime/test/thrift_message_parser_test.cpp Outdated Show resolved Hide resolved

src/runtime/test/thrift_message_parser_test.cpp Outdated Show resolved Hide resolved

assert message body size + use default params in ut

2b8fb4d

acelyc111 previously approved these changes Jan 6, 2022

View reviewed changes

foreverneverer suggested changes Jan 6, 2022

View reviewed changes

levy5307 reviewed Jan 7, 2022

View reviewed changes

src/runtime/test/thrift_message_parser_test.cpp Outdated Show resolved Hide resolved

message_num -> message_count; add ut assert on bad message should be …

092a853

…consumed

padmejin dismissed acelyc111’s stale review via 092a853 January 7, 2022 03:22

padmejin closed this Jan 7, 2022

acelyc111 reopened this Jan 7, 2022

fix clang-format

40c981b

acelyc111 approved these changes Jan 7, 2022

View reviewed changes

acelyc111 requested a review from foreverneverer January 7, 2022 11:56

foreverneverer approved these changes Jan 7, 2022

View reviewed changes

acelyc111 merged commit 3d1c988 into XiaoMi:master Jan 10, 2022

foreverneverer mentioned this pull request Jul 5, 2022

Release 2.4.0 apache/incubator-pegasus#1032

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: message body size unset after parsed which leads to large io throughputs #1008

fix: message body size unset after parsed which leads to large io throughputs #1008

padmejin commented Jan 6, 2022 •

edited by foreverneverer

Loading

acelyc111 commented Jan 6, 2022

hycdong commented Jan 6, 2022

padmejin commented Jan 6, 2022

acelyc111 left a comment

foreverneverer left a comment

levy5307 commented Jan 7, 2022 •

edited

Loading

padmejin commented Jan 7, 2022

padmejin commented Jan 7, 2022

foreverneverer commented Jan 7, 2022

padmejin commented Jan 7, 2022

padmejin commented Jan 7, 2022

fix: message body size unset after parsed which leads to large io throughputs #1008

fix: message body size unset after parsed which leads to large io throughputs #1008

Conversation

padmejin commented Jan 6, 2022 • edited by foreverneverer Loading

Issue

Description

acelyc111 commented Jan 6, 2022

hycdong commented Jan 6, 2022

padmejin commented Jan 6, 2022

acelyc111 left a comment

Choose a reason for hiding this comment

foreverneverer left a comment

Choose a reason for hiding this comment

levy5307 commented Jan 7, 2022 • edited Loading

padmejin commented Jan 7, 2022

padmejin commented Jan 7, 2022

foreverneverer commented Jan 7, 2022

padmejin commented Jan 7, 2022

padmejin commented Jan 7, 2022

padmejin commented Jan 6, 2022 •

edited by foreverneverer

Loading

levy5307 commented Jan 7, 2022 •

edited

Loading