[BUG]: Not able to run more then one replicas of csm-isilon-controller after upgrading to dell-csm-operator-controller-manager 1.4.0 #1099
Labels
area/csi-powerscale
Issue pertains to the CSI Driver for Dell EMC PowerScale
needs-triage
Issue requires triage.
type/bug
Something isn't working. This is the default label associated with a bug issue.
Bug Description
The first controller start without any problems. For the second controller five containers (podmon,resizer,attacher,provisioner, snapshotter) always gets: "Waiting on connection to driver csi.sock: context deadline exceeded".
The CSM Operator was installed manually using the without OLM using: "bash scripts/install.sh" since the CSM Operator 1.4.0 doesn't seems to be available at RedHats OpertorHub at the moment.
This issue has been reproduced in 3 different cluster.
Logs
Defaulted container "podmon" out of: podmon, resizer, attacher, provisioner, snapshotter, csi-metadata-retriever, driver
time="Fri, 12 Jan 2024 15:27:49 UTC" level=info msg="parameter value after config file processing" PODMON_CONTROLLER_LOG_LEVEL=debug
time="Fri, 12 Jan 2024 15:27:49 UTC" level=info msg="parameter value after config file processing" monitor.ArrayConnectivityPollRate=1m0s
time="Fri, 12 Jan 2024 15:27:49 UTC" level=info msg="parameter value after config file processing" monitor.ArrayConnectivityConnectionLossThreshold=3
time="Fri, 12 Jan 2024 15:27:49 UTC" level=info msg="parameter value after config file processing" monitor.PodMonitor.SkipArrayConnectionValidation=false
time="Fri, 12 Jan 2024 15:27:49 UTC" level=info msg="Running in controller mode"
time="Fri, 12 Jan 2024 15:27:49 UTC" level=info msg="CSI Driver for PowerScale"
time="Fri, 12 Jan 2024 15:27:49 UTC" level=info msg="attempting k8sapi connection"
time="Fri, 12 Jan 2024 15:27:49 UTC" level=info msg="Using InClusterConfig()"
time="Fri, 12 Jan 2024 15:27:49 UTC" level=info msg="connected to k8sapi"
time="Fri, 12 Jan 2024 15:27:49 UTC" level=info msg="Attempting driver connection at: unix:/var/run/csi/csi.sock"
time="Fri, 12 Jan 2024 15:27:59 UTC" level=debug msg="grpc.Dial returned context deadline exceeded"
time="Fri, 12 Jan 2024 15:27:59 UTC" level=error msg="Waiting on connection to driver csi.sock: context deadline exceeded"
time="Fri, 12 Jan 2024 15:28:39 UTC" level=debug msg="grpc.Dial returned context deadline exceeded"
time="Fri, 12 Jan 2024 15:28:39 UTC" level=error msg="Waiting on connection to driver csi.sock: context deadline exceeded"
time="Fri, 12 Jan 2024 15:29:19 UTC" level=debug msg="grpc.Dial returned context deadline exceeded"
time="Fri, 12 Jan 2024 15:29:19 UTC" level=error msg="Waiting on connection to driver csi.sock: context deadline exceeded"
time="Fri, 12 Jan 2024 15:29:59 UTC" level=debug msg="grpc.Dial returned context deadline exceeded"
time="Fri, 12 Jan 2024 15:29:59 UTC" level=error msg="Waiting on connection to driver csi.sock: context deadline exceeded"
time="Fri, 12 Jan 2024 15:30:39 UTC" level=debug msg="grpc.Dial returned context deadline exceeded"
time="Fri, 12 Jan 2024 15:30:39 UTC" level=error msg="Waiting on connection to driver csi.sock: context deadline exceeded"
time="Fri, 12 Jan 2024 15:31:19 UTC" level=debug msg="grpc.Dial returned context deadline exceeded"
time="Fri, 12 Jan 2024 15:31:19 UTC" level=error msg="Waiting on connection to driver csi.sock: context deadline exceeded"
time="Fri, 12 Jan 2024 15:31:59 UTC" level=debug msg="grpc.Dial returned context deadline exceeded"
time="Fri, 12 Jan 2024 15:31:59 UTC" level=error msg="Waiting on connection to driver csi.sock: context deadline exceeded"
time="Fri, 12 Jan 2024 15:32:39 UTC" level=debug msg="grpc.Dial returned context deadline exceeded"
time="Fri, 12 Jan 2024 15:32:39 UTC" level=error msg="Waiting on connection to driver csi.sock: cont
Defaulted container "podmon" out of: podmon, resizer, attacher, provisioner, snapshotter, csi-metadata-retriever, driver
time="Fri, 12 Jan 2024 15:27:49 UTC" level=info msg="parameter value after config file processing" PODMON_CONTROLLER_LOG_LEVEL=debug
time="Fri, 12 Jan 2024 15:27:49 UTC" level=info msg="parameter value after config file processing" monitor.ArrayConnectivityPollRate=1m0s
time="Fri, 12 Jan 2024 15:27:49 UTC" level=info msg="parameter value after config file processing" monitor.ArrayConnectivityConnectionLossThreshold=3
time="Fri, 12 Jan 2024 15:27:49 UTC" level=info msg="parameter value after config file processing" monitor.PodMonitor.SkipArrayConnectionValidation=false
time="Fri, 12 Jan 2024 15:27:49 UTC" level=info msg="Running in controller mode"
time="Fri, 12 Jan 2024 15:27:49 UTC" level=info msg="CSI Driver for PowerScale"
time="Fri, 12 Jan 2024 15:27:49 UTC" level=info msg="attempting k8sapi connection"
time="Fri, 12 Jan 2024 15:27:49 UTC" level=info msg="Using InClusterConfig()"
time="Fri, 12 Jan 2024 15:27:49 UTC" level=info msg="connected to k8sapi"
time="Fri, 12 Jan 2024 15:27:49 UTC" level=info msg="Attempting driver connection at: unix:/var/run/csi/csi.sock"
time="Fri, 12 Jan 2024 15:27:59 UTC" level=debug msg="grpc.Dial returned context deadline exceeded"
time="Fri, 12 Jan 2024 15:27:59 UTC" level=error msg="Waiting on connection to driver csi.sock: context deadline exceeded"
time="Fri, 12 Jan 2024 15:28:39 UTC" level=debug msg="grpc.Dial returned context deadline exceeded"
time="Fri, 12 Jan 2024 15:28:39 UTC" level=error msg="Waiting on connection to driver csi.sock: context deadline exceeded"
time="Fri, 12 Jan 2024 15:29:19 UTC" level=debug msg="grpc.Dial returned context deadline exceeded"
time="Fri, 12 Jan 2024 15:29:19 UTC" level=error msg="Waiting on connection to driver csi.sock: context deadline exceeded"
time="Fri, 12 Jan 2024 15:29:59 UTC" level=debug msg="grpc.Dial returned context deadline exceeded"
time="Fri, 12 Jan 2024 15:29:59 UTC" level=error msg="Waiting on connection to driver csi.sock: context deadline exceeded"
time="Fri, 12 Jan 2024 15:30:39 UTC" level=debug msg="grpc.Dial returned context deadline exceeded"
time="Fri, 12 Jan 2024 15:30:39 UTC" level=error msg="Waiting on connection to driver csi.sock: context deadline exceeded"
time="Fri, 12 Jan 2024 15:31:19 UTC" level=debug msg="grpc.Dial returned context deadline exceeded"
time="Fri, 12 Jan 2024 15:31:19 UTC" level=error msg="Waiting on connection to driver csi.sock: context deadline exceeded"
time="Fri, 12 Jan 2024 15:31:59 UTC" level=debug msg="grpc.Dial returned context deadline exceeded"
time="Fri, 12 Jan 2024 15:31:59 UTC" level=error msg="Waiting on connection to driver csi.sock: context deadline exceeded"
time="Fri, 12 Jan 2024 15:32:39 UTC" level=debug msg="grpc.Dial returned context deadline exceeded"
time="Fri, 12 Jan 2024 15:32:39 UTC" level=error msg="Waiting on connection to driver csi.sock: context deadline exceeded"
ext deadline exceeded"
I0112 15:42:15.550926 1 main.go:93] Version : v1.9.2
I0112 15:42:15.551044 1 feature_gate.go:249] feature gates: &{map[]}
I0112 15:42:15.552151 1 connection.go:164] Connecting to unix:///var/run/csi/csi.sock
W0112 15:42:25.552252 1 connection.go:183] Still connecting to unix:///var/run/csi/csi.sock
W0112 15:42:35.552923 1 connection.go:183] Still connecting to unix:///var/run/csi/csi.sock
W0112 15:42:45.553309 1 connection.go:183] Still connecting to unix:///var/run/csi/csi.sock
F0112 15:42:45.553587 1 main.go:134] failed to connect to CSI driver: context deadline exceeded
attacher
I0112 15:42:30.555491 1 main.go:97] Version: v4.4.2
I0112 15:42:30.556813 1 connection.go:164] Connecting to unix:///var/run/csi/csi.sock
W0112 15:42:40.557690 1 connection.go:183] Still connecting to unix:///var/run/csi/csi.sock
W0112 15:42:50.557912 1 connection.go:183] Still connecting to unix:///var/run/csi/csi.sock
W0112 15:43:00.557201 1 connection.go:183] Still connecting to unix:///var/run/csi/csi.sock
E0112 15:43:00.557760 1 main.go:136] context deadline exceeded
provisioner
W0112 15:42:30.710740 1 feature_gate.go:241] Setting GA feature gate Topology=true. It will be removed in a future release.
I0112 15:42:30.710829 1 feature_gate.go:249] feature gates: &{map[Topology:true]}
I0112 15:42:30.710845 1 csi-provisioner.go:154] Version: v3.6.2
I0112 15:42:30.710849 1 csi-provisioner.go:177] Building kube configs for running in cluster...
I0112 15:42:30.711441 1 connection.go:164] Connecting to unix:///var/run/csi/csi.sock
W0112 15:42:40.712179 1 connection.go:183] Still connecting to unix:///var/run/csi/csi.sock
W0112 15:42:50.712476 1 connection.go:183] Still connecting to unix:///var/run/csi/csi.sock
W0112 15:43:00.712464 1 connection.go:183] Still connecting to unix:///var/run/csi/csi.sock
E0112 15:43:00.712581 1 csi-provisioner.go:215] context deadline exceeded
snapshotter
I0112 15:42:30.858708 1 main.go:109] Version: v6.3.2
I0112 15:42:30.859928 1 connection.go:164] Connecting to unix:///var/run/csi/csi.sock
W0112 15:42:40.860373 1 connection.go:183] Still connecting to unix:///var/run/csi/csi.sock
W0112 15:42:50.860734 1 connection.go:183] Still connecting to unix:///var/run/csi/csi.sock
W0112 15:43:00.860639 1 connection.go:183] Still connecting to unix:///var/run/csi/csi.sock
E0112 15:43:00.860672 1 main.go:174] error connecting to CSI driver: context deadline exceeded
Screenshots
Additional Environment Information
No response
Steps to Reproduce
Install the CSM Operator 1.4.0
Expected Behavior
Expected to to be able able to run two replicas of the csm-isilon-controller
CSM Driver(s)
isilon v2.9.0
Installation Type
Manually using the without OLM using: "bash scripts/install.sh"
Container Storage Modules Enabled
resiliency v1.8.0
observability v1.7.0
Container Orchestrator
OpenShift 4.13.27
Operating System
RHEL 9.2
The text was updated successfully, but these errors were encountered: