Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Router displays error on startup #1394

Closed
smarterclayton opened this issue Mar 20, 2015 · 10 comments
Closed

Router displays error on startup #1394

smarterclayton opened this issue Mar 20, 2015 · 10 comments
Assignees
Labels
component/routing kind/bug Categorizes issue or PR as related to a bug. priority/P2

Comments

@smarterclayton
Copy link
Contributor

$ docker logs 9e0e9b14e735
E0320 03:23:35.303123       1 router.go:126] Error reloading router: wait: no child processes
 Reload output: + config_file=/var/lib/haproxy/conf/haproxy.config
+ pid_file=/var/lib/haproxy/run/haproxy.pid
+ old_pid=
+ path_map_file=/var/lib/haproxy/conf/os_http_be.map
+ sort -r /var/lib/haproxy/conf/os_http_be.map -o /var/lib/haproxy/conf/os_http_be.map
+ '[' -f /var/lib/haproxy/run/haproxy.pid ']'
+ old_pid=18
+ '[' -n 18 ']'
+ /usr/sbin/haproxy -f /var/lib/haproxy/conf/haproxy.config -p /var/lib/haproxy/run/haproxy.pid -sf 18
[WARNING] 078/032335 (23) : Setting tune.ssl.default-dh-param to 1024 by default, if your workload permits it you should set it to at least 2048. Please set a value >= 1024 to make this warning disappear.
@smarterclayton
Copy link
Contributor Author

The first reloading error shouldn't be displayed.

@pweil-
Copy link
Contributor

pweil- commented Apr 1, 2015

Looks like it has to do with the reaper not having children. Probably just needs suppressed in all cases. I'll take a look.

@pweil-
Copy link
Contributor

pweil- commented Apr 2, 2015

Actually, I tried reproducing this. My container seems fine. I tried deleting and recreating it a couple of times and it always came up ok. I can put a suppression in there just in case though if you think that's necessary

[vagrant@openshiftdev origin]$ docker logs -f 160a
I0402 18:02:49.224641       1 reaper.go:17] Launching reaper
I0402 18:02:49.240737       1 controller.go:26] Running router controller
I0402 18:02:49.280852       1 plugin.go:68] Processing 1 Endpoints for Name: kubernetes (MODIFIED)
I0402 18:02:49.280893       1 plugin.go:71]   Endpoint 0 : {10.0.2.15 %!s(int=8443) %!s(*api.ObjectReference=<nil>)}
I0402 18:02:49.280938       1 plugin.go:83] Modifying endpoints for default/kubernetes
I0402 18:02:49.280946       1 router.go:59] Commiting router changes
I0402 18:02:49.364906       1 plugin.go:68] Processing 1 Endpoints for Name: kubernetes-ro (MODIFIED)
I0402 18:02:49.364943       1 plugin.go:71]   Endpoint 0 : {10.0.2.15 %!s(int=8443) %!s(*api.ObjectReference=<nil>)}
I0402 18:02:49.364950       1 plugin.go:83] Modifying endpoints for default/kubernetes-ro
I0402 18:02:49.364977       1 router.go:59] Commiting router changes
I0402 18:02:49.365469       1 reaper.go:24] Signal received: child exited
I0402 18:02:49.365523       1 reaper.go:27] Waiting to reap
I0402 18:02:49.365552       1 reaper.go:30] No more process to reap.
I0402 18:02:49.400792       1 reaper.go:24] Signal received: child exited
I0402 18:02:49.400932       1 reaper.go:27] Waiting to reap
I0402 18:02:49.400957       1 reaper.go:34] Reaped process with pid 21
I0402 18:02:49.400962       1 reaper.go:27] Waiting to reap
I0402 18:02:49.400967       1 reaper.go:30] No more process to reap.
I0402 18:03:15.627853       1 controller.go:42] Processing Route: hello-nginx
I0402 18:03:15.627870       1 controller.go:43]            Alias: www.example.com
I0402 18:03:15.627875       1 controller.go:44]            Event: ADDED

@danmcp
Copy link

danmcp commented May 11, 2015

Still an issue?

@danmcp danmcp added kind/bug Categorizes issue or PR as related to a bug. priority/P2 component/routing labels May 11, 2015
@danmcp
Copy link

danmcp commented May 15, 2015

@smarterclayton ^^

@danmcp danmcp closed this as completed May 27, 2015
@pweil-
Copy link
Contributor

pweil- commented May 27, 2015

I was unable to reproduce this @smarterclayton. If you aren't still having this issue this can be closed.

@gravis
Copy link

gravis commented Dec 23, 2015

We're having this today.
OS 1.1.0.1
The router is working for some minutes, than completely stop, with this in the logs

[...]
E1223 15:29:05.442754       1 controller.go:100] error reloading router: wait: no child processes
99  ---
100 + config_file=/var/lib/haproxy/conf/haproxy.config
101 + pid_file=/var/lib/haproxy/run/haproxy.pid
102 + old_pid=
103 + haproxy_conf_dir=/var/lib/haproxy/conf
104 + for mapfile in '"$haproxy_conf_dir"/*.map'
105 + sort -r /var/lib/haproxy/conf/os_edge_http_be.map -o /var/lib/haproxy/conf/os_edge_http_be.map
106 + for mapfile in '"$haproxy_conf_dir"/*.map'
107 + sort -r /var/lib/haproxy/conf/os_edge_http_expose.map -o /var/lib/haproxy/conf/os_edge_http_expose.map
108 + for mapfile in '"$haproxy_conf_dir"/*.map'
109 + sort -r /var/lib/haproxy/conf/os_edge_http_redirect.map -o /var/lib/haproxy/conf/os_edge_http_redirect.map
110 + for mapfile in '"$haproxy_conf_dir"/*.map'
111 + sort -r /var/lib/haproxy/conf/os_http_be.map -o /var/lib/haproxy/conf/os_http_be.map
112 + for mapfile in '"$haproxy_conf_dir"/*.map'
113 + sort -r /var/lib/haproxy/conf/os_reencrypt.map -o /var/lib/haproxy/conf/os_reencrypt.map
114 + for mapfile in '"$haproxy_conf_dir"/*.map'
115 + sort -r /var/lib/haproxy/conf/os_sni_passthrough.map -o /var/lib/haproxy/conf/os_sni_passthrough.map
116 + for mapfile in '"$haproxy_conf_dir"/*.map'
117 + sort -r /var/lib/haproxy/conf/os_tcp_be.map -o /var/lib/haproxy/conf/os_tcp_be.map
118 + '[' -f /var/lib/haproxy/run/haproxy.pid ']'
119 + old_pid=254
120 + '[' -n 254 ']'
121 + /usr/sbin/haproxy -f /var/lib/haproxy/conf/haproxy.config -p /var/lib/haproxy/run/haproxy.pid -sf 254

@pweil-
Copy link
Contributor

pweil- commented Dec 23, 2015

@gravis are you able to reproduce this consistently? If so could you elaborate on the steps taken. Do you have any details about route creation or any actions taken on the pod prior to this showing?

@mrunalp - this has the same symptoms that were in #1434 but resolved with using syscall.WNOHANG in the reaper.

@gravis
Copy link

gravis commented Dec 23, 2015

It was on a production cluster, so I can't run many tests. We just took the errors from the log one by one. Most of them were related to mounted volumes (glusterfs). We destroyed the pod many times, and finally rebooted the nodes, and the issue was gone...

@gravis
Copy link

gravis commented Dec 23, 2015

I wonder if it could be related to routes using the default certificate.
I have a lot of:

W1223 21:12:22.086795       1 router.go:524] a edge terminated route with host [redacted] does not have the required certificates.  The route will still be created but no certificates will be written

I have removed these routes for the night, and so far, no occurence of the error reloading router: wait: no child processes message.

jpeeler added a commit to jpeeler/origin that referenced this issue Oct 23, 2017
…service-catalog/' changes from aa27078754..dabde2eb85

dabde2eb85 origin build: add origin tooling
b70c076 Reorder class and plan creation; test plan conflict handling (openshift#1459)
4bea012 Use versioned client APIs (openshift#1458)
ff4af30 clean up logic for 410 gone deprovision poll (openshift#1452)
3fddf27 clean up logic and fix message for failed poll (openshift#1451)
40926cd Fix typo from openshift#1354 (openshift#1456)
ff86ef2 Delete removed serviceplans when they have no instances left (openshift#1444)
8411a16 tweak binding setAndUpdateOrphanMitigation function (openshift#1448)
ce28252 Combine apiserver and controller-manager into a single service-catalog image (openshift#1343)
7bbc8ee Check service class / plan before allowing provisioning or plan changes. (openshift#1439)
baf28de Create listers before adding event handlers in controller (openshift#1446)
294157d remove setServiceBindingCondition dependency on controller (openshift#1441)
118a0f7 Fix typo in validation (openshift#1447)
117bfbd clean up error logging (openshift#1443)
dff470f Move "External" around in some resource names/properties (openshift#1354)
0885edb Adding expectedGot function and using it. (openshift#1440)
a7d582e Pretty controller broker (openshift#1442)
c5edfaf Set apimachinery build variables with semver info (openshift#1429)
0e90d82 Add a pretty formatter for ClusterService[Class|Plan] (openshift#1408)
fb874df Remove deprecated basic auth config support (openshift#1431)
f4cd181 Migrate to metav1 methods for manipulating controllerRefs (openshift#1433)
96b286e Make service/plan reference fields on instance spec selectable (openshift#1422)
33f2b04 First example using the pretty context builder. (openshift#1403)
7852917 Stop using corev1.ObjectReference and corev1.LocalObjectReference (openshift#1417)
fcf9480 Add tests for plan updates (openshift#1412)
819332e Add root CAs (openshift#1419)
b49a76a Clean Makefile a little (openshift#1399)
d681da0 Use a separate etcd prefix for each integration test to keep tests isolated (openshift#1415)
314a622 Wire etcd prefix to storage and call complete with options (openshift#1394)
REVERT: aa27078754 origin build: add origin tooling

git-subtree-dir: cmd/service-catalog/go/src/github.com/kubernetes-incubator/service-catalog
git-subtree-split: dabde2eb859b5e31e97c01a704561fc27e1848b2
jpeeler added a commit to jpeeler/origin that referenced this issue Oct 24, 2017
…service-catalog/' changes from aa27078754..510060232e

510060232e origin build: add origin tooling
de45e94 v0.1.0 chart changes (openshift#1468)
0bb9982 Modify Makefile to only specify ldflags once (openshift#1471)
5d6afac Fixes openshift#735: Add repo-sync script for charts (openshift#1453)
630f13f fix lingering unversioned client API (openshift#1466)
6f49128 Fix several logging errors (openshift#1464)
2aece61 Delete removed serviceClasses when they have no instances left (openshift#1450)
179d302 Uncommenting UID field after updating to k8s 1.8 (openshift#1457)
b70c076 Reorder class and plan creation; test plan conflict handling (openshift#1459)
4bea012 Use versioned client APIs (openshift#1458)
ff4af30 clean up logic for 410 gone deprovision poll (openshift#1452)
3fddf27 clean up logic and fix message for failed poll (openshift#1451)
40926cd Fix typo from openshift#1354 (openshift#1456)
ff86ef2 Delete removed serviceplans when they have no instances left (openshift#1444)
8411a16 tweak binding setAndUpdateOrphanMitigation function (openshift#1448)
ce28252 Combine apiserver and controller-manager into a single service-catalog image (openshift#1343)
7bbc8ee Check service class / plan before allowing provisioning or plan changes. (openshift#1439)
baf28de Create listers before adding event handlers in controller (openshift#1446)
294157d remove setServiceBindingCondition dependency on controller (openshift#1441)
118a0f7 Fix typo in validation (openshift#1447)
117bfbd clean up error logging (openshift#1443)
dff470f Move "External" around in some resource names/properties (openshift#1354)
0885edb Adding expectedGot function and using it. (openshift#1440)
a7d582e Pretty controller broker (openshift#1442)
c5edfaf Set apimachinery build variables with semver info (openshift#1429)
0e90d82 Add a pretty formatter for ClusterService[Class|Plan] (openshift#1408)
fb874df Remove deprecated basic auth config support (openshift#1431)
f4cd181 Migrate to metav1 methods for manipulating controllerRefs (openshift#1433)
96b286e Make service/plan reference fields on instance spec selectable (openshift#1422)
33f2b04 First example using the pretty context builder. (openshift#1403)
7852917 Stop using corev1.ObjectReference and corev1.LocalObjectReference (openshift#1417)
fcf9480 Add tests for plan updates (openshift#1412)
819332e Add root CAs (openshift#1419)
b49a76a Clean Makefile a little (openshift#1399)
d681da0 Use a separate etcd prefix for each integration test to keep tests isolated (openshift#1415)
314a622 Wire etcd prefix to storage and call complete with options (openshift#1394)
REVERT: aa27078754 origin build: add origin tooling

git-subtree-dir: cmd/service-catalog/go/src/github.com/kubernetes-incubator/service-catalog
git-subtree-split: 510060232e54eb64b294213bb5d7847e169a2fac
sttts pushed a commit to sttts/origin that referenced this issue Aug 26, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component/routing kind/bug Categorizes issue or PR as related to a bug. priority/P2
Projects
None yet
Development

No branches or pull requests

4 participants