Configure nodes [localhost] nickhammond.logrotate : nickhammond.logrotate | Setup logrotate.d scripts #18527

aveshagarwal · 2018-02-08T17:43:46Z

https://openshift-gce-devel.appspot.com/build/origin-ci-test/pr-logs/pull/18475/test_pull_request_origin_extended_conformance_install/7120/

 [WARNING]: Could not create retry file '/usr/share/ansible/openshift-
ansible/playbooks/deploy_cluster.retry'.         [Errno 13] Permission denied:
u'/usr/share/ansible/openshift-ansible/playbooks/deploy_cluster.retry'

PLAY RECAP *********************************************************************
localhost                  : ok=399  changed=149  unreachable=0    failed=1   


INSTALLER STATUS ***************************************************************
Initialization             : Complete (0:00:16)
Health Check               : Complete (0:00:22)
etcd Install               : Complete (0:00:39)
Master Install             : Complete (0:02:00)
Master Additional Install  : Complete (0:00:27)
Node Install               : In Progress (0:03:22)
	This phase can be restarted by running: playbooks/openshift-node/config.yml



Failure summary:


  1. Hosts:    localhost
     Play:     Configure nodes
     Task:     restart node
     Message:  Unable to restart service origin-node: Job for origin-node.service failed because the control process exited with error code. See "systemctl status origin-node.service" and "journalctl -xe" for details.
               
++ export status=FAILURE
++ status=FAILURE
+ set +o xtrace
########## FINISHED STAGE: FAILURE: INSTALL ORIGIN [00h 07m 15s] ##########

Node logs:

Feb 08 17:01:41 ip-172-18-6-38.ec2.internal origin-node[28890]: F0208 17:01:41.141377   28890 network.go:46] SDN node startup failed: failed to validate network configuration: master has not created a default cluster network, network plugin "redhat/openshift-ovs-subnet" can not start
Feb 08 17:01:41 ip-172-18-6-38.ec2.internal systemd[1]: origin-node.service: main process exited, code=exited, status=255/n/a
Feb 08 17:01:41 ip-172-18-6-38.ec2.internal systemd[1]: Failed to start OpenShift Node.
Feb 08 17:01:41 ip-172-18-6-38.ec2.internal systemd[1]: Unit origin-node.service entered failed state.
Feb 08 17:01:41 ip-172-18-6-38.ec2.internal systemd[1]: origin-node.service failed.

@sdodson

The text was updated successfully, but these errors were encountered:

sdodson · 2018-02-08T17:46:15Z

Need @openshift/networking to check why the sdn registration failed.

jwforres · 2018-02-08T18:52:39Z

@openshift/sig-networking

aveshagarwal · 2018-02-09T13:51:06Z

This has been failing so many times that this PR #18475 (comment) seems to just stuck.

https://openshift-gce-devel.appspot.com/build/origin-ci-test/pr-logs/pull/18475/test_pull_request_origin_extended_conformance_install/7162/

Feb 09 03:28:47 ip-172-18-5-65.ec2.internal origin-node[29111]: I0209 03:28:47.204835   29111 manager.go:174] CRI-O not connected: Get http://%2Fvar%2Frun%2Fcrio%2Fcrio.sock/info: dial unix /var/run/crio/crio.sock: connect: no such file or directory
Feb 09 03:28:47 ip-172-18-5-65.ec2.internal origin-node[29111]: F0209 03:28:47.217138   29111 network.go:46] SDN node startup failed: failed to validate network configuration: master has not created a default cluster network, network plugin "redhat/openshift-ovs-subnet" can not start
Feb 09 03:28:47 ip-172-18-5-65.ec2.internal systemd[1]: origin-node.service: main process exited, code=exited, status=255/n/a
Feb 09 03:28:47 ip-172-18-5-65.ec2.internal systemd[1]: Failed to start OpenShift Node.
Feb 09 03:28:47 ip-172-18-5-65.ec2.internal systemd[1]: Unit origin-node.service entered failed state.
Feb 09 03:28:47 ip-172-18-5-65.ec2.internal systemd[1]: origin-node.service failed.

danwinship · 2018-02-09T15:48:16Z

The node is failing to start because the master is failing to start. origin-master-controllers.service is looping over and over again failing with:

Feb 08 16:57:44 ip-172-18-6-38.ec2.internal origin-master-controllers[21128]: F0208 16:57:44.937222   21128 plugins.go:234] Invalid configuration: Predicate type not found for NoVolumeNodeConflict

aveshagarwal · 2018-02-09T16:08:36Z

NoVolumeNodeConflict has been removed so should not be used.

jwforres · 2018-02-22T17:26:55Z

@openshift/sig-master

deads2k · 2018-02-22T19:24:27Z

@openshift/sig-master

@openshift/sig-pod
/assign @aveshagarwal
@jwforres I think I saw @aveshagarwal link an ansible pull somewhere.

deads2k · 2018-02-22T19:25:13Z

@openshift/sig-master

@jwforres oh, also, pretty sure that's in the scheduler

aveshagarwal · 2018-02-22T19:29:28Z

openshift/openshift-ansible#7089

sjenning · 2018-02-27T18:56:56Z

this is fixed

jboyd01 · 2018-03-28T16:26:54Z

test_pull_request_origin_extended_conformance_install seems to be hitting this issue with #19117 repeatedly even though this was closed as fixed.

openshift-ci-robot added the sig/networking label Feb 8, 2018

jwforres assigned knobunc Feb 8, 2018

jwforres added the kind/test-flake Categorizes issue or PR as related to test flakes. label Feb 8, 2018

aveshagarwal mentioned this issue Feb 9, 2018

UPSTREAM: 59386: Scheduler - not able to read from config file if configmap not found #18475

Merged

danwinship removed the sig/networking label Feb 9, 2018

danwinship unassigned knobunc Feb 10, 2018

openshift-ci-robot added the sig/master label Feb 22, 2018

jwforres assigned mfojtik Feb 22, 2018

jwforres added the priority/P1 label Feb 22, 2018

openshift-ci-robot assigned aveshagarwal Feb 22, 2018

openshift-ci-robot added the sig/pod label Feb 22, 2018

deads2k removed the sig/master label Feb 22, 2018

openshift-ci-robot added the sig/master label Feb 22, 2018

sjenning closed this as completed Feb 27, 2018

brokenmass mentioned this issue Feb 28, 2018

NoVolumeNodeConflict predicates has been removed libopenstorage/stork#34

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configure nodes [localhost] nickhammond.logrotate : nickhammond.logrotate | Setup logrotate.d scripts #18527

Configure nodes [localhost] nickhammond.logrotate : nickhammond.logrotate | Setup logrotate.d scripts #18527

aveshagarwal commented Feb 8, 2018

sdodson commented Feb 8, 2018

jwforres commented Feb 8, 2018

aveshagarwal commented Feb 9, 2018

danwinship commented Feb 9, 2018

aveshagarwal commented Feb 9, 2018

jwforres commented Feb 22, 2018

deads2k commented Feb 22, 2018

deads2k commented Feb 22, 2018

aveshagarwal commented Feb 22, 2018

sjenning commented Feb 27, 2018

jboyd01 commented Mar 28, 2018 •

edited

Loading

Configure nodes [localhost] nickhammond.logrotate : nickhammond.logrotate | Setup logrotate.d scripts #18527

Configure nodes [localhost] nickhammond.logrotate : nickhammond.logrotate | Setup logrotate.d scripts #18527

Comments

aveshagarwal commented Feb 8, 2018

sdodson commented Feb 8, 2018

jwforres commented Feb 8, 2018

aveshagarwal commented Feb 9, 2018

danwinship commented Feb 9, 2018

aveshagarwal commented Feb 9, 2018

jwforres commented Feb 22, 2018

deads2k commented Feb 22, 2018

deads2k commented Feb 22, 2018

aveshagarwal commented Feb 22, 2018

sjenning commented Feb 27, 2018

jboyd01 commented Mar 28, 2018 • edited Loading

jboyd01 commented Mar 28, 2018 •

edited

Loading