We utilized GH200 systems at JLSE testbeds at ALCF. We use apptainers to setup vLLM.
- Build a container
$ source build-container.sh
This script builds a apptainer image vllm-gh200.def
using vllm-gh200.sif
definition file in the same directory.
You will need power_utils.py
file for power metric collectring in the same direcotry as the benchmark_power.py
- Use provided shell script
run-container-throughput.sh
in this directory to run container that runsrun-throughput-bench.sh
to invokebenchmark_throughput.py
for various configurations of input, output lengths and batch sizes.
source run-container-throughput.sh
- Use provided shell script
run-container-throughput.sh
in this directory to run container that runsrun-power-bench.sh
to invokebenchmark_power.py
for various configurations of input, output lengths and batch sizes.
source run-container-power.sh