Skip to content

Pinned Loading

  1. OLMo OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 4.6k 476

  2. dolma dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 990 108

  3. scispacy scispacy Public

    A full spaCy pipeline and models for scientific/biomedical documents.

    Python 1.7k 229

  4. ai2thor ai2thor Public

    An open-source platform for Visual AI.

    C# 1.2k 217

Repositories

Showing 10 of 481 repositories
  • OLMo-core Public

    PyTorch building blocks for OLMo

    allenai/OLMo-core’s past year of commit activity
    Python 18 Apache-2.0 4 0 7 Updated Nov 19, 2024
  • dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    allenai/dolma’s past year of commit activity
    Python 990 Apache-2.0 108 23 9 Updated Nov 19, 2024
  • OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    allenai/OLMoE’s past year of commit activity
    Jupyter Notebook 456 Apache-2.0 35 5 0 Updated Nov 19, 2024
  • rslearn Public

    A tool for developing remote sensing datasets and models.

    allenai/rslearn’s past year of commit activity
    Python 11 Apache-2.0 0 9 3 Updated Nov 19, 2024
  • OLMo Public

    Modeling, training, eval, and inference code for OLMo

    allenai/OLMo’s past year of commit activity
    Python 4,646 Apache-2.0 476 48 50 Updated Nov 19, 2024
  • allenai/open-instruct’s past year of commit activity
    Python 1,272 Apache-2.0 172 11 23 Updated Nov 18, 2024
  • regmixer Public
    allenai/regmixer’s past year of commit activity
    Python 0 0 0 2 Updated Nov 18, 2024
  • wimbd Public

    What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets

    allenai/wimbd’s past year of commit activity
    Python 190 Apache-2.0 20 0 1 Updated Nov 16, 2024
  • ir_datasets Public

    Provides a common interface to many IR ranking datasets.

    allenai/ir_datasets’s past year of commit activity
    Python 323 Apache-2.0 43 73 9 Updated Nov 15, 2024
  • marg-reviewer Public

    Code/data for MARG (multi-agent review generation)

    allenai/marg-reviewer’s past year of commit activity
    Python 33 Apache-2.0 2 1 0 Updated Nov 14, 2024