trainium

Here are 2 public repositories matching this topic...

A high-throughput and memory-efficient inference and serving engine for LLMs

Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stack options.

Add a description, image, and links to the trainium topic page so that developers can more easily learn about it.

To associate your repository with the trainium topic, visit your repo's landing page and select "manage topics."