Skip to content
Change the repository type filter

All

    Repositories list

    • felafax

      Public
      Felafax is building AI infra for non-NVIDIA GPUs
      Python
      Apache License 2.0
      2751110Updated Nov 28, 2024Nov 28, 2024
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      4.7k000Updated Oct 21, 2024Oct 21, 2024
    • Jupyter Notebook
      0000Updated Sep 3, 2024Sep 3, 2024
    • gemma

      Public
      Open weights LLM from Google DeepMind.
      Python
      Apache License 2.0
      311000Updated Jul 30, 2024Jul 30, 2024
    • felafax-gateway is a fast and lightweight proxy for LLMs, written in Rust. Designed for low latency and high scalability.
      Rust
      0200Updated Jul 30, 2024Jul 30, 2024