Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Temporarily decom MIT Brain hubs (bican, dandi, linc) #4097

Merged
merged 2 commits into from
May 22, 2024

Conversation

yuvipanda
Copy link
Member

@yuvipanda yuvipanda commented May 17, 2024

Partnerships is working on reconfiguring this relationship, but it isn't going to look like 3 separate hubs on 3 separate clusters. Decom this to save our cloud costs, as nobody is currently using these.

Based on a meeting I just had with them (along with @colliand). Will have an associated issue soon.

Ref https://github.com/2i2c-org/leads/issues/344

Partnerships is working on reconfiguring this relationship,
but it isn't going to look like 3 separate hubs on 3 separate
clusters. Decom this to save our cloud costs, as nobody is
currently using these.
@yuvipanda
Copy link
Member Author

I've terraform destroyed everything for now.

@consideRatio consideRatio changed the title Temporarily decom MIT Brain hubs Temporarily decom MIT Brain hubs (bican, dandi, linc) May 21, 2024
@consideRatio
Copy link
Contributor

These clusters still incurring cloud costs from k8s clusters (72 USD / month) a core node (~180 USD / month), as this was controlled via eksctl rather than terraform.

@yuvipanda if you think its OK to proceed i'll just delete the eksctl resources as well.

@yuvipanda
Copy link
Member Author

@consideRatio ah, I missed that. yes, please do when you get a minute.

@consideRatio
Copy link
Contributor

consideRatio commented May 22, 2024

Done, bican, dandi, linc deleted.

jsonnet $CLUSTER_NAME.jsonnet > $CLUSTER_NAME.eksctl.yaml
# WARNING: eksctl delete cluster doesn't provide a confirmation with --approve, I've opened issue 7777 in eksctl about this https://github.com/eksctl-io/eksctl/issues/7777
eksctl delete cluster --config-file $CLUSTER_NAME.eksctl.yaml
# to help drain operation from another terminal without availability of deployer creds
eksctl utils write-kubeconfig --config-file=$CLUSTER_NAME.eksctl.yaml --auto-kubeconfig
export KUBECONFIG=/home/erik/.kube/eksctl/clusters/$CLUSTER_NAME

# eksctl reports: 6 pods are unevictable from node ip-192-168-8-78.us-east-2.compute.internal
# I think four of these referred to two coredns pods and two ebs-csi-controller
# pods with protection from PDB resources
# I think another pod was from the cryptnono daemonset, which as a daemonset
# shouldn't cause issues, but maybe it had a emptyDir volume or similar that maybe requires
# a flag that it was ok to delete also such pods?

This comment was marked as outdated.

@consideRatio consideRatio added the deployer:skip-deploy Skips deployment of anything (support, staging, prod) label May 22, 2024
@consideRatio
Copy link
Contributor

  • we'll now start seeing pagerduty warnings I think
  • we'll also until this is merged risk causing additional maintenance work and confusion about bican/dandi/linc during maintenance

Due to this, I'll go for a merge - then the pagerduty healthchecks are deleted I think via automation scanning our cluster config files.

@consideRatio consideRatio merged commit e8f25c8 into 2i2c-org:main May 22, 2024
39 checks passed
Copy link

🎉🎉🎉🎉

Monitor the deployment of the hubs here 👉 https://github.com/2i2c-org/infrastructure/actions/runs/9185897951

@consideRatio
Copy link
Contributor

consideRatio commented May 22, 2024

I applied the deployer:skip-deploy label as 166e53f incorrectly seemed to have caused the wrong conclusion that all hubs need to re-deploy and test.

I've opened #4116 about this

@sgibson91
Copy link
Member

incorrectly seemed to have caused the wrong conclusion that all hubs need to re-deploy and test.

This can happen if the branch is not up-to-date with main #2766

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
deployer:skip-deploy Skips deployment of anything (support, staging, prod)
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants