Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Identify raisons of non-running pods such as the "FailedSync" #1

Open
sroze opened this issue Mar 22, 2017 · 0 comments
Open

Identify raisons of non-running pods such as the "FailedSync" #1

sroze opened this issue Mar 22, 2017 · 0 comments

Comments

@sroze
Copy link
Member

sroze commented Mar 22, 2017

One of the main issues we have with volumes is the "FailedSync" and the mount timeouts.

Events:
  FirstSeen	LastSeen	Count	From								SubobjectPath	Type		Reason		Message
  ---------	--------	-----	----								-------------	--------	------		-------
  4m		4m		1	{default-scheduler }								Normal		Scheduled	Successfully assigned postgres-1015904327-0cq7c to [...]-pool-4cpu-15gb-878b5c33-dfm5
  3m		2m		8	{controller-manager }								Warning		FailedMount	Failed to attach volume "pvc-3ed3f856-0e1c-11e7-824a-42010af0012c" on node "[..]-pool-4cpu-15gb-878b5c33-dfm5" with: googleapi: Error 400: The disk resource '[...]-a-pvc-3ed3f856-0e1c-11e7-824a-42010af0012c' is already being used by '[...]-pool-4cpu-15gb-878b5c33-dm7r'
  2m		2m		1	{kubelet [...]-pool-4cpu-15gb-878b5c33-dfm5}			Warning		FailedMount	Unable to mount volumes for pod "postgres-1015904327-0cq7c_[...](6785daae-0ef2-11e7-824a-42010af0012c)": timeout expired waiting for volumes to attach/mount for pod "[...]"/"postgres-1015904327-0cq7c". list of unattached/unmounted volumes=[database-volume]
  2m		2m		1	{kubelet [...]-pool-4cpu-15gb-878b5c33-dfm5}			Warning		FailedSync	Error syncing pod, skipping: timeout expired waiting for volumes to attach/mount for pod "[...]"/"postgres-1015904327-0cq7c". list of unattached/unmounted volumes=[database-volume]

It looks like it's a bug and is often related to a given node. It'd be nice to identify such "non-running" pods and their hosts, so we can probably quickly identify that one given node is not going well.

@sroze sroze changed the title Identify the "FailedSync" Identify raisons of non-running pods such as the "FailedSync" Mar 22, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant