A proof-of-concept swiss-army knife for managing High-Performance Computing clusters built with Ubuntu and Juju.
Pluto is proof-of-concept application for demonstrating the work that the High-Performance Computing (HPC) team at Canonical has accomplished over the past several months. Pluto showcases the Charmed HPC project by automatically bootstrapping a working cluster. The following services are deployed by Pluto to build the cluster:
- SLURM + Munge -> Provides workload scheduling and resource management.
- GLAuth + SSSD -> Provides identity management service for users and groups.
- NFSv4 -> Provides shared filesystem.
After several minutes, you will have a fully-functional charmed micro-HPC cluster at your fingertips! Take the cluster for a spin and see how Ubuntu can meet your HPC needs!
Note: Pluto is proof-of-concept application to set up a small, working Charmed HPC cluster for personal experimentation. It should not be used for production-level deployments.
Your system needs to have the following requirements installed to use pluto to deploy a micro-HPC cluster:
Also, you should have access to a Juju supported cloud as well. Pluto only supports machine charms, so Kubernetes-base clouds such as microk8s or Google GKE are not supported. If you are planning on using LXD as your cloud, see the section Appendix: Using LXD for extra steps before you bootstrap the micro-HPC cluster.
After installing snapd and juju, bootstrap a cloud controller using the following command:
juju bootstrap
You will be taken an interactive dialog to configure your cloud controller. Please visit the Juju documentation for specific information on how to bootstrap a Juju controller for your target cloud.
Once the Juju controller for the cloud of you choice has been bootstrapped, use snap
to install pluto:
sudo snap install pluto
Now use the following command to bootstrap your HPC cluster:
pluto bootstrap test-cluster
In several minutes you will now have access to your very own HPC cluster! Have fun!
pluto will not initially work with LXD due to LXD containers being initially unable to mount or export NFS shares. This has to do with LXD containers AppArmor configuration. To enable NFS exporting/mounting, use the following commands to modify the default LXD profile on your system. This needs to be done before you bootstrap the HPC cluster with pluto:
lxc profile set default security.privileged true
lxc profile set default raw.apparmor 'mount fstype=nfs*, mount fstype=rpc_pipefs,'
Note: Given that you need to elevate the privileges of the LXD containers for the HPC cluster to function, it is strongly recommended to use another cloud provider such as OpenStack or MAAS.
Pluto is just a proof-of-concept for deploying charmed HPC clusters, so it most likely will not receive heavy feature development. However, if you notice any issues, feel free to open a bug report!
Pluto is free software, distributed under the GNU General Public License version 3. See LICENSE for more details.