chore(docs): add trobleshot steps to migrate cStor pool from lost nod…

…e to new node Signed-off-by: mittachaitu <sai.chaithanya@mayadata.io>
openebs-archive · Jul 27, 2020 · 696f3ae · 696f3ae
1 parent d522edb
commit 696f3ae
Show file tree

Hide file tree

Showing 2 changed files with 233 additions and 0 deletions.
diff --git a/docs/troubleshooting/migrate_cstor_pools.md b/docs/troubleshooting/migrate_cstor_pools.md
@@ -0,0 +1,229 @@
+# Pool Migration when node and underlying disks were lost
+
+## Intro
+- There can be a situations where you lost node and disks attached to the node were also lost. This is very common for users having Kubernetes autoscalar feature enabled and nodes will be scaledown and scaleup based on the demand.
+
+- When the new node come up, then disks that are attached to the node were also new(It doesn't have any previous data).
+
+Consider an example where you have following nodes in a Kubernetes cluster with disks attached it:
+
+    Worker-1 (bd-1(w1), bd-2(w1) are attached to worker1)
+
+    Worker-2 (bd-1(w2), bd-2(w2) are attached to worker 2)
+
+    Worker-3 (bd-1(w3), bd-2(w3) are attached to worker 3)
+
+**NOTE**: Disks attached to a node are represented by a blockdevice(bd) in OpenEBS installed cluster. A block device is of the form `bd-<some-hash>`. For example, bd-1(w1), bd-3(w2) etc are BD resources. For illustration purpose hash is not included in BD name, e.g. `bd-1(w1)` represents a block device attached to worker-1.
+
+## What happens if Node and underlying disk lost?
+If node and underlying disk attached to the node are lost in your kubernetes cluster then cStor pool manager pods will be in pending state and the cStor pools that are running on lost node and volumes in that pool will be marked offline. Workloads using those cStor volumes will not be able to perform read and write operations on the volume if volume is not configured with HA configuration(i.e storage replica count is 3).
+
+## How can this be fixed?
+We can perform few manual [steps](#steps-to-be-followed-to-move-the-pool-from-failed-node-to-new-node) to recover from this situtation. But before we do this, the tutorial will illustrate a node and underlying disk lost situtations. So essentially we are trying to do the following:
+
+**NOTE**: If CStorVolumes are **Healthy** then only perform below steps are recommended.
+
+**__Migrate CStorPool when nodes and underlying disks were lost__**
+
+## Reproduce the Node and underlying disk lost situtation?
+
+#### In cloud environment
+    Able to remove the nodes by deleting the nodes from the K8s cluster managed by autoscalar group(ASG). If external volumes/disks are attached to the node with an option deleting the nodes should delete the disks attached to the node. Currently it was experimented in EKS, GCP managed kuberntes cluster.
+
+#### On-Premise
+    In Kubernetes cluster one of the node and underlying disks attached to the node were corrupted (or) disks consumed by pool are corrupted by manual interventions(by formating/performing someother action). In this case cStor pool pod will be crashing for every 5 minutes due to unavailability of the pool. Very quite common case is if disk in stripe cStor pool is corrupted.
+
+## Infrastructure details
+    Following are infrastructure details where reproduced the situation
+
+    **Kubernetes Cluster**: Amazon(EKS)
+
+    **Kubernetes Version**: 1.15.0
+
+    **OpenEBS Version**: 1.12.0(Deployed OpenEBS by enabling **feature-gates="GPTBasedUUID"** since EBS volumes in EKS are virtual and some mechanisam is reuired to identify the disks uniquely).
+
+    **Node OS**: Ubuntu 18.04
+
+## Instalation of cStor setup
+Created CStor pools using CSPC API by following [doc](../quick.md) and then provisioned cStor-CSI volume with HA configuration(storage replica count as 3) on top of cStor pools.
+
+### OpenEBS setup
+List of resources that are created after provisioning cStor pools and volumes
+
+- To know about cStorpoolcluster
+```sh
+$ Kubectl get cspc -n openebs
+NAME         HEALTHYINSTANCES   PROVISIONEDINSTANCES   DESIREDINSTANCES   AGE
+cstor-cspc   3                  3                      3                  108s
+```
+- To know more details about the cStor pools
+```sh
+$ kubectl get cspi -n openebs
+NAME              HOSTNAME            ALLOCATED  FREE   CAPACITY  READONLY  PROVISIONEDREPLICAS  HEALTHYREPLICAS  TYPE    STATUS  AGE
+cstor-cspc-4tr5   ip-192-168-52-185   98k        9630M  9630098k  false     1                    1                stripe  ONLINE  4m26s
+cstor-cspc-xnxx   ip-192-168-79-76    101k       9630M  9630101k  false     1                    1                stripe  ONLINE  4m25s
+cstor-cspc-zdvk   ip-192-168-29-217   98k        9630M  9630098k  false     1                    1                stripe  ONLINE  4m25s
+```
+- To know more about the cStor volumes resources
+```sh
+$ kubectl get cvc,cv,cvr -n openebs
+NAME                                                                          STATUS   AGE
+cstorvolumeconfig.cstor.openebs.io/pvc-81746e7a-a29d-423b-a048-76edab0b0826   Bound    7m3s
+
+NAME                                                                                           USED   ALLOCATED   STATUS    AGE
+cstorvolumereplica.cstor.openebs.io/pvc-81746e7a-a29d-423b-a048-76edab0b0826-cstor-cspc-4tr5   6K     6K          Healthy   7m3s
+cstorvolumereplica.cstor.openebs.io/pvc-81746e7a-a29d-423b-a048-76edab0b0826-cstor-cspc-xnxx   6K     6K          Healthy   7m3s
+cstorvolumereplica.cstor.openebs.io/pvc-81746e7a-a29d-423b-a048-76edab0b0826-cstor-cspc-zdvk   6K     6K          Healthy   7m3s
+
+NAME                                                                    STATUS    AGE    CAPACITY
+cstorvolume.cstor.openebs.io/pvc-81746e7a-a29d-423b-a048-76edab0b0826   Healthy   7m3s   5Gi
+```
+Here:
+- CVC refers to cStorVolumeManager(Which is responsible for managing cStor volume resources like target service, target deployment, CV and CVR).
+- CV refers to CStorVolume(Which contains the configuration required for process running inside the cStor target pod container).
+- CVR refers to CStorVolumeReplica(Which contains the configuration required to create zfs replica in cStor pool this replica is responsible for reading/writing application IOs to pool. It follows the naming convention like <pv_name>-<pool_name>).
+
+### Lost Node and underlying disk attached to the ndoe
+Performed node and disk lost action as specified [here](#reproduce-the-node-and-underlying-disk-lost-situtation). Now only two nodes are present in the cluster which means whatever data in that corresponding cStor pool was lost. To reconstruct the data we need to make use of the rebuild feature provided by cStor(i.e reconstructing data from other replicas). Below are details after loosing the node and disk
+```sh
+$ kubectl get cspc -n openebs
+NAME         HEALTHYINSTANCES   PROVISIONEDINSTANCES   DESIREDINSTANCES   AGE
+cstor-cspc   2                  3                      3                  20m
+```
+- Pool Manager details
+```sh
+$ kubectl get po -n openebs -l app=cstor-pool
+NAME                               READY   STATUS    RESTARTS   AGE
+cstor-cspc-4tr5-8455898c74-c7vbv   0/3     Pending   0          6m17s
+cstor-cspc-xnxx-765bc8d899-7696q   3/3     Running   0          20m
+cstor-cspc-zdvk-777df487c8-l62sv   3/3     Running   0          20m
+```
+In the above output cStor pool manager i.e **cstor-cspc-4tr5** which was schedule on lost node is in pending state and output of CSPC also shows only two HealthyCSPIInstances.
+
+## Steps to be followed to move the pool from failed node to new node
+
+### Step1: Remove the cStorVolumeReplicas from the lost pool
+
+This step is required to remove pool from the lost node. Before removing the pool first we need to remove cStorVolumeReplicas in the pool or else admission server will reject the scale down request of admission server. This can be achieved by removing the pool entry from the CStorVolumeConfig(CVC) spec section.
+
+**Note**: This step will be succeed only if the cstorvolume and target pod are in running state.
+
+Edit and change the CVC resource corresponding to the volume
+```sh
+...
+...
+  policy:
+   provision:
+    replicaAffinity: false
+     replica: {}
+     replicaPoolInfo:
+     - poolName: cstor-cspc-4tr5
+     - poolName: cstor-cspc-xnxx
+     - poolName: cstor-cspc-zdvk
+...
+...
+```
+To
+```sh
+$ kubectl edit cvc pvc-81746e7a-a29d-423b-a048-76edab0b0826 -n openebs
+...
+...
+  policy:
+   provision:
+    replicaAffinity: false
+     replica: {}
+     replicaPoolInfo:
+     - poolName: cstor-cspc-xnxx
+     - poolName: cstor-cspc-zdvk
+...
+...
+cstorvolumeconfig.cstor.openebs.io/pvc-81746e7a-a29d-423b-a048-76edab0b0826 edited
+```
+From the above spec **cstor-cspc-4tr5** CSPI entry is removed from CVC under spec. Repeat the same thing for all the volumes which have cStor volume replicas on the lost pool i.e cstor-cspc-4tr5. We can get list of volume replicas in lost pool using the following command
+
+```sh
+$ kubectl get cvr -n openebs -l cstorpoolinstance.openebs.io/name=cstor-cspc-4tr5
+NAME                                                       USED   ALLOCATED   STATUS    AGE
+pvc-81746e7a-a29d-423b-a048-76edab0b0826-cstor-cspc-bf9h   6K     6K          Healthy   4m7s
+```
+
+### Step2: Remove the finalizer from cStor volume replicas
+This step is required to remove the `cstorvolumereplica.openebs.io/finalizer` finalizer from CVRs which were present on the lost cStor pool. After removing the finalizer CVR will be deleted from etcd. Usually, finalizer should be removed by pool-manager pod since the pod is not in running state manual intervention is required to remve the finalizer
+```sh
+$ kubectl get cvr -n openebs
+NAME                                                       USED   ALLOCATED   STATUS    AGE
+pvc-81746e7a-a29d-423b-a048-76edab0b0826-cstor-cspc-xnxx   6K     6K          Healthy   52m
+pvc-81746e7a-a29d-423b-a048-76edab0b0826-cstor-cspc-zdvk   6K     6K          Healthy   52m
+```
+
+After this step scaledown of CStorVolume will be successful. One can verify from events on corresponding CVC
+```sh
+$ kubectl describe cvc <pv_name> -n openebs
+Events:
+Type     Reason                 Age    From                         Message
+----     ------                 ----   ----                         -------
+Normal   ScalingVolumeReplicas  6m10s  cstorvolumeclaim-controller  successfully scaled volume replicas to 2
+```
+
+### Step3: Remove the pool spec from CSPC belongs to lost node
+Edit the CSPC spec using `kubectl edit cspc <cspc_name> -n openebs` and remove the pool spec belongings to nodes which is no longer exists in the cluster. Once the spec was removed from the pool output then DesiredInstances will be 2. It can be verified using `kubectl get cspc -n openebs` will looks like
+```sh
+$ kubectl get cspc -n openebs
+NAME         HEALTHYINSTANCES   PROVISIONEDINSTANCES   DESIREDINSTANCES   AGE
+cstor-cspc   2                  3                      2                  56m
+```
+Since the pool manager pod is not in running state and because of the existence of pool protection finalizer i.e `openebs.io/pool-protection` on CSPI. CSPC-Operator was not able to delete the CSPI waiting for pool protection finalizer to get removed. Since the CSPI is not deleted, the ProvisionedInstances count is not updated. To fix this `openebs.io/pool-protection` finalizer should be removed from the cspi which was on the lost node.
+```sh
+kubectl edit cspi  cstor-cspc-4tr5
+cstorpoolinstance.cstor.openebs.io/cstor-cspc-4tr5 edited
+```
+
+After removing finalizer Healthy, Provisioned and Desired instances will match as shown below
+```sh
+$ kubectl get cspc -n openebs
+NAME         HEALTHYINSTANCES   PROVISIONEDINSTANCES   DESIREDINSTANCES   AGE
+cstor-cspc   2                  2                      2                  68m
+```
+
+### Step4: Scale the cStorVolumeReplicas back to 3 using CVC
+Scale the cStorVolumeReplcias back to 3 to the new/existing cStor pools where corresponding cStor pool where volume replica doesn't exist for corresponding volume. Since in the cluster there were no extra pools so scaled the cStor pool by adding new node pool spec.
+```sh
+$ kubectl get cspi -n openebs
+NAME             HOSTNAME           ALLOCATED  FREE   CAPACITY  READONLY  PROVISIONEDREPLICAS  HEALTHYREPLICAS  TYPE    STATUS  AGE
+cstor-cspc-bf9h  ip-192-168-49-174  230k       9630M  9630230k  false     0                    0                stripe  ONLINE  66s
+```
+Add above newly created CStorPoolInstance i.e cstor-cspc-bf9h under CVC.Spec
+```sh
+$ kubectl edit cvc pvc-81746e7a-a29d-423b-a048-76edab0b0826 -n openebs
+...
+...
+spec:
+ policy:
+  provision:
+   replicaAffinity: false
+  replica: {}
+  replicaPoolInfo:
+  - poolName: cstor-cspc-bf9h
+  - poolName: cstor-cspc-xnxx
+  - poolName: cstor-cspc-zdvk
+...
+...
+```
+Repeat the same thing for all the scaled down cStor volumes and verify whether all the newly provisioned CStorVolumeReplica(CVR) are **Healthy**.
+```sh
+$ kubectl get cvr -n openebs
+NAME                                                       USED   ALLOCATED   STATUS    AGE
+pvc-81746e7a-a29d-423b-a048-76edab0b0826-cstor-cspc-bf9h   6K     6K          Healthy   11m
+pvc-81746e7a-a29d-423b-a048-76edab0b0826-cstor-cspc-xnxx   6K     6K          Healthy   96m
+pvc-81746e7a-a29d-423b-a048-76edab0b0826-cstor-cspc-zdvk   6K     6K          Healthy   96m
+```
+
+```sh
+$ kubectl get cspi -n openebs
+NAME              HOSTNAME            ALLOCATED  FREE   CAPACITY  READONLY  PROVISIONEDREPLICAS  HEALTHYREPLICAS  TYPE    STATUS  AGE
+cstor-cspc-bf9h  ip-192-168-49-174  230k       9630M  9630230k  false     1                    1                stripe  ONLINE  66s
+cstor-cspc-xnxx   ip-192-168-79-76    101k       9630M  9630101k  false     1                    1                stripe  ONLINE  4m25s
+cstor-cspc-zdvk   ip-192-168-29-217   98k        9630M  9630098k  false     1                    1                stripe  ONLINE  4m25s
+```
+By comparing to the [previous](#openebs-setup) outputs of cStor pool has been migrated from lost node to new node.
+
diff --git a/docs/tutorial/intro.md b/docs/tutorial/intro.md
@@ -19,3 +19,7 @@ On a high level, this tutorial covers following:
 2. This [link](./volumes/snapshot.md) explains the volume snapshot related operations.
 3. This [link](./volumes/resize.md) explains the volume resize related operations.
 4. This [link](./volumes/migration.md) explains the volume migration related operations across k8s nodes.
+
+## FAQ and Troubleshooting
+
+1. Node and underlying disk in a Kubernetes cluster were lost and the cStor pools are pending. How can I fix this? see [Migrate cStor Pools](../troubleshooting/migrate_cstor_pools.md)