Skip to content
Change the repository type filter

All

    Repositories list

    • lmms-eval

      Public
      Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
      Python
      Other
      1982.1k1754Updated Feb 7, 2025Feb 7, 2025
    • VideoMMMU

      Public
      Python
      Other
      12011Updated Feb 7, 2025Feb 7, 2025
    • A fork to add multimodal model training to open-r1
      Python
      Apache License 2.0
      2443790Updated Feb 7, 2025Feb 7, 2025
    • Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.
      Python
      Other
      510400Updated Jan 24, 2025Jan 24, 2025
    • .github

      Public
      0100Updated Dec 11, 2024Dec 11, 2024
    • LongVA

      Public
      Long Context Transfer from Language to Vision
      Python
      Apache License 2.0
      19359270Updated Nov 20, 2024Nov 20, 2024
    • demos

      Public
      Python
      0000Updated Sep 18, 2024Sep 18, 2024
    • sglang

      Public
      SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
      Python
      Apache License 2.0
      858400Updated Sep 18, 2024Sep 18, 2024
    • Otter

      Public
      🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
      Python
      MIT License
      2153.2k612Updated Mar 5, 2024Mar 5, 2024
    • Relate Anything Model is capable of taking an image as input and utilizing SAM to identify the corresponding mask within the image.
      Python
      Apache License 2.0
      2144860Updated Jul 4, 2023Jul 4, 2023