NVIDIA-CUDA-EXPORTER

Overview

This README provides comprehensive information about the project, including its purpose, usage instructions, and configuration details.

Dockerized Python Application with NVIDIA CUDA Support

Dockerfile

The Dockerfile sets up a Docker image for running a Python application that utilizes NVIDIA CUDA for GPU computation. It includes the following steps:

Starts from the NVIDIA CUDA image with Python support (nvidia/cuda:12.3.2-cudnn9-runtime-ubuntu20.04).
Installs Python, pip, and virtualenv.
Creates a virtual environment and activates it.
Upgrades pip in the virtual environment.
Copies the requirements.txt file into the container and installs dependencies.
Makes sure the appuser owns the application directory.
Copies the gpu_metrics.py script into the container.
Exposes port 8888.
Specifies the command to run the script (CMD ["python3", "gpu_metrics.py"]).

Usage

To use the Docker image:

Clone or download the repository containing the Dockerfile and other necessary files.
Build the Docker image:

docker build -t nvidia-cuda-exporter .

Run a Docker container based on the image:

docker run --name nvidia-cuda-exporter --gpus all -p 8888:8888 -v /usr/local/nvidia:/usr/local/nvidia nvidia-cuda-exporter

Access the Python application through a web browser or programmatically: http://localhost:<host-port>

Replace <host-port> with the desired port on your host machine.

NVIDIA GPU Metrics Collector

Python Script

The gpu_metrics_collector.py script collects GPU metrics using NVIDIA Management Library (NVML) and exposes them as Prometheus metrics via an HTTP server. It includes the following functionality:

Collects various GPU metrics such as memory usage, utilization, temperature, power usage, fan speed, clock speeds, PCIe transmission speed, and more.
Exposes the metrics at http://localhost:8888/metrics.

Usage

To use the script:

Clone or download the repository containing the script.
Install the required dependencies: pip install prometheus_client pynvml
Run the script: python gpu_metrics_collector.py

Access the metrics at http://localhost:8888/metrics.

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
Dockerfile		Dockerfile
README.md		README.md
gpu_metrics.py		gpu_metrics.py
nvidia-cuda-exporter-dashboard.json		nvidia-cuda-exporter-dashboard.json
renovate.json		renovate.json
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NVIDIA-CUDA-EXPORTER

Overview

Dockerized Python Application with NVIDIA CUDA Support

Dockerfile

Usage

NVIDIA GPU Metrics Collector

Python Script

Usage

About

Releases

Packages

Contributors 2

Languages

h4ckm1n-dev/nvidia-cuda-exporter

Folders and files

Latest commit

History

Repository files navigation

NVIDIA-CUDA-EXPORTER

Overview

Dockerized Python Application with NVIDIA CUDA Support

Dockerfile

Usage

NVIDIA GPU Metrics Collector

Python Script

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages