Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

autopilot stats_fetcher gets mTLS errors sending cross-region RPCs #15085

Closed
tgross opened this issue Oct 31, 2022 · 2 comments · Fixed by #15290
Closed

autopilot stats_fetcher gets mTLS errors sending cross-region RPCs #15085

tgross opened this issue Oct 31, 2022 · 2 comments · Fixed by #15290
Assignees
Milestone

Comments

@tgross
Copy link
Member

tgross commented Oct 31, 2022

@ron-savoia reported getting the following errors with a federated clusters using mTLS:

2022-10-28T20:33:32.124Z [WARN] nomad.rpc: failed TLS handshake: remote_addr=192.168.1.190:55376 error="remote error: tls: bad certificate"
2022-10-28T20:33:32.755Z [WARN] nomad.stats_fetcher: error getting server health: server=server1.west error="rpc error: failed to get conn: x509: certificate is valid for server.west.nomad, localhost, not server.east.nomad"

But normal federated operations such as submitting a job across regions works as expected, so the mTLS configuration itself is fine. The stats_fetcher code was updated when we did the autopilot update in 1.4.0 in #14441. The RPCs involved here should be doing the same cross-region checks as other federated RPCs. But taking a quick look at the code, it's not clear to me that we should be making cross-region stats-fetcher requests at all here either.

@tgross
Copy link
Member Author

tgross commented Nov 17, 2022

Fixed in #15290 and will ship in Nomad 1.4.3.

@github-actions
Copy link

I'm going to lock this issue because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active issues.
If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

@github-actions github-actions bot locked as resolved and limited conversation to collaborators Mar 18, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant