-
Notifications
You must be signed in to change notification settings - Fork 3.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug] Bookkeeper returns mac mismatch while reading #21421
Comments
I'm rollbacking brokers (only) to 2.11. EDIT: It fixed the issue. So there is an issue with rendering on the client side on v2.11+. |
A bit more context
Where
|
We hit the same errors on Pulsar v3.0.1 |
@miton18 Do you have any way to reproduce this issue? |
sadly, no It happens when upgrading brokers to v3.x.x |
Only upgrade broker, doesn't upgrade the bookie, right? |
bookies has been upgraded to v3.1.0 without issues |
ok, trying to read a ledger from bookkeeper itself also throw the same error... 023-10-26T09:40:26,062+0000 [main] INFO org.apache.bookkeeper.tools.cli.commands.client.LedgerMetaDataCommand - LedgerMetadata{formatVersion=3, ensembleSize=5, writeQuorumSize=3, ackQuorumSize=3, state=CLOSED, length=8336, lastEntryId=0, digestType=CRC32C, password=base64:, ensembles={0=[yo-bookkeeper-c3-n12, yo-bookkeeper-c3-n8, yo-bookkeeper-c3-n14, yo-bookkeeper-c3-n6, yo-bookkeeper-c3-n3]}, customMetadata={component=base64:c2NoZW1h, application=base64:cHVsc2Fy, pulsar/schemaId=base64:b3JnYV8yMWQ3ZjZkMy04Mzk3LTQ3ODItOWE2NS05MWVjYWMxYTQ1OTUvbG9ncy9fX2NoYW5nZV9ldmVudHM=}} |
Looks fixed in 3.1.1. |
There are some reports on Pulsar Slack that this issue happens when |
@KannarFr Have you checked with 3.0.XX version. Basically Its working, if we set bookkeeperUseV2WireProtocol=true in broker configmap. |
No, I didn't but should I use v2 (legacy) wire proto? I mean this can fix the issue, but this is a jump in the past, nope? :D |
@Debashish-Mallick bookkeeperUseV2WireProtocol=true is the recommended default with Pulsar: Lines 973 to 974 in 9b643c8
However, when using bookkeeperUseV2WireProtocol=true, I believe that TLS is not supported. See apache/bookkeeper#2300 and https://github.com/apache/pulsar-helm-chart/blob/8cb3c18377e547e5de4ca19e63138ca1387aef59/charts/pulsar/templates/broker-configmap.yaml#L131-L147 for more information. I am not exactly sure about the impact since I haven't tested this configuration myself. My concern is that TLS wouldn't be used when the intention is to use TLS between brokers and bookies. However, it's possible that apache/bookkeeper#2300 change added support for TLS for v2, but it's hard to know the full context of that change. |
|
apache/bookkeeper#4140 has figured out the issue. It's a regression from 3.0. |
@hangc0276 @poorbarcode Do I understand correctly that apache/bookkeeper#4140 will be released in bookkeeper version 4.16.4 ? What is the schedule for the Bookkeeper 4.16.4 release with the fix? |
Bookkeeper 4.16.4 has been upgraded in #21983 . Closing this issue since that should contain the fix. |
Reopening until latest concerns about other checksum related bugs in BK are resolved. |
Pending fix in Bookkeeper will be included in 4.16.5 release. |
We are using 4.16.3 and encountering this error with 1-2 events occurring randomly on a daily basis, and we have a workaround to skip the specific position and resend the data to recover it. May I know if you are aware of any other workarounds available before the 4.16.5 release? |
@pqab I'm not aware of a workaround. There's also a concern that there are some other problems (#22103 with some linked BK issue) that cause similar symptoms. |
I believe that this issue is a duplicate of #22601. I have made a fix to bookkeeper which fixes the issue: apache/bookkeeper#4404 |
Search before asking
Version
2.11.3 to 3.1.0
Minimal reproduce step
N/A
What did you expect to see?
No error
What did you see instead?
After bumped I got a lot of exceptions as:
Anything else?
No response
Are you willing to submit a PR?
The text was updated successfully, but these errors were encountered: