when csi-plugin need exit and restart for upgrade or painc, pod will recive error msg that 'Transport endpoint is not connected', #91

huaizong · 2018-10-17T06:41:50Z

when csi-plugin exit and restart for upgrade or painc, pod will recive error msg that 'Transport endpoint is not connected', is cephfs-csi plan to support remount history mounted path when
csi-plugin start

df: ‘/var/lib/kubelet/pods/d132d662-d1d1-11e8-8297-28d24488ad30/volumes/kubernetes.io~csi/pvc-d123868ad1d111e8/mount’: Transport endpoint is not connected

The text was updated successfully, but these errors were encountered:

gman0 · 2018-10-17T09:48:52Z

Please attach the plugin logs.

huaizong · 2018-10-18T06:14:02Z

what i mean is that may be ceph-csi need a feature that can mounted history mounted path,

rootfs · 2018-10-18T12:57:49Z

mountpath is given by kubelet. if the pod is deleted, the mountpoint will be gone too.

huaizong · 2018-10-20T07:43:35Z

mountpath is given by kubelet. if the pod is deleted, the mountpoint will be gone too.

if we need upgrade ceph-csi now, we need to taint all node all drain all pod that used ceph-csi plugin , so if the ceph-csi plugin support remount last mounted path, may be ceph-csi plugin can support rolling update

tangle329 · 2018-11-20T15:26:00Z

I meet the same issue. Is there any solution for it? Do we need to monitor the plugin and drain node when it restart, panic or updated?

rootfs · 2018-11-20T15:34:51Z

yes, drain the node before update. It is not the best solution but gives you some protection

Madhu-1 · 2019-03-18T09:53:41Z

@rootfs do we need to do something in the code to fix this issue? if not can we close this one?

rootfs · 2019-03-19T13:39:50Z

@Madhu-1 would you add a upgrade process in readme? for cephfs mount, drain the node before upgrade. I believe this process applies to other FUSE mount drivers too.

huaizong · 2019-03-20T15:15:39Z

@rootfs as mentioned in #217 if csi plugin exit unexpect, the pod use cephfs pv can not auto recovery until pod be killed and reschedule. i think this is may be a problem. may be csi plugin can do more thing to remount the old path so when pod may be auto recovery when exit and restart , the old mount path can use

issue ceph#217 Goal we try to solve when csi exit unexpect, the pod use cephfs pv can not auto recovery because lost mount relation until pod be killed and reschedule to other node. i think this is may be a problem. may be csi plugin can do more thing to remount the old path so when pod may be auto recovery when pod exit and restart, the old mount path can use. NoGoal Pod should exit and restart when csi plugin pod exit and mount point lost. if pod not exit will get error of **transport endpoint is not connected**. implment logic csi-plugin start: 1. load all MountCachEntry from node local dir 2. check if volID exist in cluster, if no we ignore this entry, if yes continue 3. check if stagingPath exist, if yes we mount the path 4. check if all targetPath exist, if yes we binmount to staging path NodeServer: 1. NodeStageVolume: add MountCachEntry on local dir include readonly attr and ceph secret 2. NodeStagePublishVolume: add pod bind mount path to MountCachEntry and persist local dir 3. NodeStageunPublishVolume: remove pod bind mount path From MountCachEntry and persist local dir 4. NodeStageunStageVolume: remove MountCachEntry from local dir

sync devel branch with upstream

huaizong changed the title ~~when csi-plugin exit and restart for upgrade or painc, pod will recive error msg that 'Transport endpoint is not connected',~~ when csi-plugin need exit and restart for upgrade or painc, pod will recive error msg that 'Transport endpoint is not connected', Oct 18, 2018

huaizong closed this as completed Oct 18, 2018

huaizong reopened this Oct 20, 2018

herrypio mentioned this issue Feb 20, 2019

[csi-cephfs] Plugin Pod dies #217

Closed

huaizong mentioned this issue Mar 25, 2019

remount old mount point when csi plugin unexpect exit #282

Merged

huaizong closed this as completed Sep 10, 2019

Rakshith-R referenced this issue in Rakshith-R/ceph-csi May 26, 2022

Merge pull request #91 from ceph/devel

b74a6c0

sync devel branch with upstream

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

when csi-plugin need exit and restart for upgrade or painc, pod will recive error msg that 'Transport endpoint is not connected', #91

when csi-plugin need exit and restart for upgrade or painc, pod will recive error msg that 'Transport endpoint is not connected', #91

huaizong commented Oct 17, 2018 •

edited

Loading

gman0 commented Oct 17, 2018

huaizong commented Oct 18, 2018

rootfs commented Oct 18, 2018

huaizong commented Oct 20, 2018

tangle329 commented Nov 20, 2018

rootfs commented Nov 20, 2018

Madhu-1 commented Mar 18, 2019

rootfs commented Mar 19, 2019

huaizong commented Mar 20, 2019 •

edited

Loading

when csi-plugin need exit and restart for upgrade or painc, pod will recive error msg that 'Transport endpoint is not connected', #91

when csi-plugin need exit and restart for upgrade or painc, pod will recive error msg that 'Transport endpoint is not connected', #91

Comments

huaizong commented Oct 17, 2018 • edited Loading

gman0 commented Oct 17, 2018

huaizong commented Oct 18, 2018

rootfs commented Oct 18, 2018

huaizong commented Oct 20, 2018

tangle329 commented Nov 20, 2018

rootfs commented Nov 20, 2018

Madhu-1 commented Mar 18, 2019

rootfs commented Mar 19, 2019

huaizong commented Mar 20, 2019 • edited Loading

huaizong commented Oct 17, 2018 •

edited

Loading

huaizong commented Mar 20, 2019 •

edited

Loading