Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MINOR: fix security_test for ZK case due to error change #10225

Closed
wants to merge 1 commit into from

Conversation

rondagostino
Copy link
Contributor

The ZooKeeper version of this system test is failing because the producer is no longer seeing LEADER_NOT_AVAILABLE. When the broker sees a METADATA request for the test topic after it restarts the auto topic creation manager is determining that the topic needs to be created due to the TLS hostname verification failure on the inter-broker security protocol. It also thinks there aren't enough brokers available to meet the default topic replication factor (it sees 0 available due to the TLS issue), so it returnsINVALID_REPLICATION_FACTOR for that topic in the Metadata response. In other words, the flow has changed and the inability to produce is not manifesting as it was before, and the test is failing. This patch updates the test to check for INVALID_REPLICATION_FACTOR instead of LEADER_NOT_AVAILABLE.

Committer Checklist (excluded from commit message)

  • Verify design and implementation
  • Verify test coverage and CI build status
  • Verify documentation (including upgrade notes)

@rondagostino
Copy link
Contributor Author

@abbccdda, @cmccabe. Ported from #10199 to discuss separately. We used to see this error message in verifiable_producer.log when security_protocol='PLAINTEXT', interbroker_security_protocol='SSL':

WARN [Producer clientId=producer-1] Error while fetching metadata with correlation id 1 : {test_topic=LEADER_NOT_AVAILABLE} (org.apache.kafka.clients.NetworkClient)

The test does a grep LEADER_NOT_AVAILABLE on the log in this case, and it used to pass.

Now we are instead seeing this in the log file:

WARN [Producer clientId=producer-1] Error while fetching metadata with correlation id 1 : {test_topic=INVALID_REPLICATION_FACTOR} (org.apache.kafka.clients.NetworkClient)

And of course now the test fails.

The INVALID_REPLICATION_FACTOR is coming from the auto topic creation manager as I described above.

It is a simple matter to make the test pass -- I have confirmed that it passes if we grep for INVALID_REPLICATION_FACTOR in the log file instead of LEADER_NOT_AVAILABLE.

I think we just need to decide if this change in behavior is acceptable or not.

@hachikuji
Copy link

@rondagostino Thanks for identifying the issue. Returning INVALID_REPLICATION_FACTOR seems like a mistake to me if we were previously returning LEADER_NOT_AVAILABLE. I'd suggest we fix the code. Would you mind filing a JIRA for this so that we can mark it as a 2.8 blocker?

@rondagostino
Copy link
Contributor Author

@rondagostino
Copy link
Contributor Author

This PR "fixes" the issue by changing the system test to check for the new error. We can close this PR in favor of another one if @abbccdda decides the behavior change is incorrect and can be corrected.

@rondagostino
Copy link
Contributor Author

Closed in favor of #10240

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants