Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add more logs #68

Closed
srgothi92 opened this issue Jul 26, 2021 · 1 comment
Closed

Add more logs #68

srgothi92 opened this issue Jul 26, 2021 · 1 comment

Comments

@srgothi92
Copy link
Contributor

Description
Currently controller logs are very limited even with the debug flag on. Due to this it's hard to understand what is happening in between the labels received and labels posted. Additional logs needs to be added before starting each new step like update, cordon, drain, verify update etc. to better understand update progression and debug issues.

Current logs:

time="2021-07-26T15:55:30Z" level=debug msg="resource update event" component=controller worker=informer
time="2021-07-26T15:55:30Z" level=debug msg="handling event" component=controller node=ip-1-1-1-1.us-west-2.compute.internal worker=manager
time="2021-07-26T15:55:30Z" level=debug msg="not queuing duplicate intent" component=controller intent="reboot-update,reboot-update,ready update:true" node=ip-1-1-1-1.us-west-2.compute.internal worker=manager
time="2021-07-26T16:00:06Z" level=debug msg="resource update event" component=controller worker=informer
time="2021-07-26T16:00:06Z" level=debug msg="handling event" component=controller node=ip-2-2-2-2.us-west-2.compute.internal worker=manager
time="2021-07-26T16:00:06Z" level=debug msg="no action needed" component=controller intent="stabilize,stabilize,ready update:false" node=ip-2-2-2-2.us-west-2.compute.internal worker=manager
time="2021-07-26T16:00:26Z" level=debug msg="resource update event" component=controller worker=informer
time="2021-07-26T16:00:26Z" level=debug msg="handling event" component=controller node=ip-1-1-1-1.us-west-2.compute.internal worker=manager
time="2021-07-26T16:00:26Z" level=debug msg="queue intent" component=controller intent="reboot-update,reboot-update,ready update:true" node=ip-1-1-1-1.us-west-2.compute.internal worker=manager
time="2021-07-26T16:00:26Z" level=debug msg="checking with policy" component=controller intent="reboot-update,reboot-update,ready update:true" node=ip-1-1-1-1.us-west-2.compute.internal worker=manager
time="2021-07-26T16:00:26Z" level=debug msg="handling permitted intent" component=controller intent="reboot-update,reboot-update,ready update:true" node=ip-1-1-1-1.us-west-2.compute.internal worker=manager
time="2021-07-26T16:00:26Z" level=debug msg="handling successful update" component=controller intent="reboot-update,reboot-update,ready update:true" node=ip-1-1-1-1.us-west-2.compute.internal worker=manager
time="2021-07-26T16:00:26Z" level=debug msg="posted intent" component=controller intent="stabilize,unknown,unknown update:unknown" node=ip-1-1-1-1.us-west-2.compute.internal worker=manager
time="2021-07-26T16:00:26Z" level=debug msg="resource update event" component=controller worker=informer
time="2021-07-26T16:00:26Z" level=debug msg="handling event" component=controller node=ip-1-1-1-1.us-west-2.compute.internal worker=manager
time="2021-07-26T16:00:26Z" level=debug msg="intent is not yet realized" component=controller intent="stabilize,unknown,unknown update:unknown" node=ip-1-1-1-1.us-west-2.compute.internal worker=manager
time="2021-07-26T16:00:26Z" level=debug msg="resource update event" component=controller worker=informer
time="2021-07-26T16:00:26Z" level=debug msg="handling event" component=controller node=ip-1-1-1-1.us-west-2.compute.internal worker=manager
time="2021-07-26T16:00:26Z" level=debug msg="intent is not yet realized" component=controller intent="stabilize,stabilize,busy update:unknown" node=ip-1-1-1-1.us-west-2.compute.internal worker=manager
time="2021-07-26T16:00:36Z" level=debug msg="resource update event" component=controller worker=informer
time="2021-07-26T16:00:36Z" level=debug msg="handling event" component=controller node=ip-1-1-1-1.us-west-2.compute.internal worker=manager
time="2021-07-26T16:00:36Z" level=debug msg="no action needed" component=controller intent="stabilize,stabilize,ready update:false" node=ip-1-1-1-1.us-west-2.compute.internal worker=manager

Image I'm using:
0.1.4

Issue or Feature Request:
Issue

@cbgbt
Copy link
Contributor

cbgbt commented Feb 15, 2022

This should be fixed in the latest Update Operator release, 0.2.0. We've surfaced considerably more logs, as well as Prometheus metrics. These should make it easier to debug issues.

@cbgbt cbgbt closed this as completed Feb 15, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants