Skip to content

Pinned Loading

  1. flashinfer flashinfer Public

    FlashInfer: Kernel Library for LLM Serving

    Cuda 1.1k 102

  2. whl whl Public

    Pre-built wheels for flashinfer python package.

    HTML

Repositories

Showing 8 of 8 repositories
  • flashinfer Public

    FlashInfer: Kernel Library for LLM Serving

    flashinfer-ai/flashinfer’s past year of commit activity
    Cuda 1,131 Apache-2.0 102 26 6 Updated Sep 11, 2024
  • web-data Public
    flashinfer-ai/web-data’s past year of commit activity
    0 Apache-2.0 0 0 0 Updated Sep 2, 2024
  • whl Public

    Pre-built wheels for flashinfer python package.

    flashinfer-ai/whl’s past year of commit activity
    HTML 0 0 0 0 Updated Aug 28, 2024
  • debug-print Public

    Debug print operator for cudagraph debugging

    flashinfer-ai/debug-print’s past year of commit activity
    Cuda 8 0 0 0 Updated Aug 2, 2024
  • llvm-project Public Forked from llvm/llvm-project

    The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

    flashinfer-ai/llvm-project’s past year of commit activity
    0 11,705 0 0 Updated Apr 21, 2024
  • rocMLIR Public Forked from ROCm/rocMLIR
    flashinfer-ai/rocMLIR’s past year of commit activity
    0 40 0 0 Updated Apr 19, 2024
  • candle Public Forked from huggingface/candle

    Minimalist ML framework for Rust

    flashinfer-ai/candle’s past year of commit activity
    Rust 0 Apache-2.0 899 0 0 Updated Mar 7, 2024
  • flashinfer-ai.github.io Public

    Project website of FlashInfer project

    flashinfer-ai/flashinfer-ai.github.io’s past year of commit activity
    SCSS 0 2 0 0 Updated Feb 25, 2024

Top languages

Loading…

Most used topics

Loading…