rbd: don't delete volume/snapshot if metadata creation fails #201

gman0 · 2019-02-14T20:52:10Z

When MetadataStore.Create() failed in either CreateVolume or CreateSnapshot, the rbd volume/snapshot would get deleted and the whole creation process would start over (volume created successfully -> failed to store metadata for volume -> delete volume -> create volume again -> try to store metadata again -> ...). This is quite expensive error handling.

This PR relies on idempotency of both creation of rbd volumes/snapshots and metadata. If the volume/snapshot was created successfully but metadata storage failed, reuse the volume/snapshot and in next attempt only try to store metadata:

CreateVolume first attempt:

create volume
volume created successfully
store volume metadata
failed to store metadata, exit with error

CreateVolume second attempt:

create volume
volume already exists
store volume metadata
let's say we're now successful :)

The same goes for snapshots.

rootfs · 2019-02-14T20:58:08Z

pkg/rbd/controllerserver.go

@@ -115,6 +115,15 @@ func parseVolCreateRequest(req *csi.CreateVolumeRequest) (*rbdVolume, error) {
 	return rbdVol, nil
 }

+func storeVolumeMetadata(vol *rbdVolume, cp util.CachePersister) error {
+	if err := cp.Create(vol.VolID, vol); err != nil {


any special handling if the err is already exist?

Already taken care of :)

ceph-csi/pkg/util/k8scmcache.go

Lines 145 to 152 in 8223ae3

_, err = k8scm.Client.CoreV1().ConfigMaps(k8scm.Namespace).Create(cm)

if err != nil {

if apierrs.IsAlreadyExists(err) {

klog.V(4).Infof("k8s-cm-cache: configmap already exists")

return nil

}

return errors.Wrapf(err, "k8s-cm-cache: couldn't persist %s metadata as configmap", identifier)

}

rootfs · 2019-02-14T21:15:23Z

ha, no that nolint can help

pkg/rbd/controllerserver.go:308::warning: cyclomatic complexity 11 of function (*ControllerServer).CreateSnapshot() is high (> 10) (gocyclo)

Madhu-1 · 2019-02-18T05:07:19Z

pkg/rbd/controllerserver.go

@@ -136,6 +145,11 @@ func (cs *ControllerServer) CreateVolume(ctx context.Context, req *csi.CreateVol
 		// request
 		if exVol.VolSize >= req.GetCapacityRange().GetRequiredBytes() {
 			// existing volume is compatible with new request and should be reused.
+
+			if err = storeVolumeMetadata(exVol, cs.MetadataStore); err != nil {


@gman0 I have a question here, as this exVol is retrieved from the metadataStore why we need to store it back?

Well yes and no... rbdVolumes (and snapshots) is initialized from the metadata cache - but after that, it's populated by the CreateVolume function itself https://github.com/gman0/ceph-csi/blob/d9f71d3887f4eaed3cfa34de8bbd1686163d0a32/pkg/rbd/controllerserver.go#L178

But my question is why we have store it back. I think we can update the local cache once we store the Metadata on configmap is successful.
By doing this we can avoid extra step of storing if volume is already present, isn't it?

Oops, you're right! For whatever reason I've completely missed this bit https://github.com/gman0/ceph-csi/blob/d9f71d3887f4eaed3cfa34de8bbd1686163d0a32/pkg/rbd/controllerserver.go#L172-L176

which checks for the existence of the image too...nice catch!

@gman0 even with the current design, suppose CSI pod restarts after creating volume in the backend and before storing metadata on config map, we may end up in having stale volumes in the backend. do we have a CLI command in cephfs to list the volumes (by checking at the backend we can assure that there won't be any stale volumes even if CSI restarts) (note: PVC create/delete may become a slow process)

reading more through the code, the only issue we are having with restart (as explained above) is volumeID and snapshotID because we are generating new UUID for each request for the same volume we may generate two volume ID if restart happens.

#205 fixes the issue as of now. but as suggested by @ShyamsundarR in #135 (comment) will fix the issue in long run.

Madhu-1 · 2019-02-18T05:07:34Z

pkg/rbd/controllerserver.go

@@ -311,6 +323,10 @@ func (cs *ControllerServer) CreateSnapshot(ctx context.Context, req *csi.CreateS
 	// check for the requested source volume id and already allocated source volume id
 	if exSnap, err := getRBDSnapshotByName(req.GetName()); err == nil {
 		if req.SourceVolumeId == exSnap.SourceVolumeID {
+			if err = storeSnapshotMetadata(exSnap, cs.MetadataStore); err != nil {


same here as well for snapshot

rbd: don't delete volume/snapshot if metadata creation fails

36dc01a

rootfs reviewed Feb 14, 2019

View reviewed changes

rootfs approved these changes Feb 14, 2019

View reviewed changes

fix lint error

d9f71d3

mergify bot merged commit 49f5d4a into ceph:csi-v1.0 Feb 14, 2019

gman0 mentioned this pull request Feb 15, 2019

Cleaning up resources #202

Closed

Madhu-1 reviewed Feb 18, 2019

View reviewed changes

Madhu-1 mentioned this pull request Feb 20, 2019

use req.Name for generating ID #205

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rbd: don't delete volume/snapshot if metadata creation fails #201

rbd: don't delete volume/snapshot if metadata creation fails #201

gman0 commented Feb 14, 2019

rootfs Feb 14, 2019

gman0 Feb 14, 2019

rootfs commented Feb 14, 2019

Madhu-1 Feb 18, 2019

gman0 Feb 18, 2019

Madhu-1 Feb 18, 2019

gman0 Feb 18, 2019

Madhu-1 Feb 19, 2019

Madhu-1 Feb 19, 2019

Madhu-1 Feb 18, 2019

	_, err = k8scm.Client.CoreV1().ConfigMaps(k8scm.Namespace).Create(cm)
	if err != nil {
	if apierrs.IsAlreadyExists(err) {
	klog.V(4).Infof("k8s-cm-cache: configmap already exists")
	return nil
	}
	return errors.Wrapf(err, "k8s-cm-cache: couldn't persist %s metadata as configmap", identifier)
	}

rbd: don't delete volume/snapshot if metadata creation fails #201

rbd: don't delete volume/snapshot if metadata creation fails #201

Conversation

gman0 commented Feb 14, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rootfs commented Feb 14, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment