Skip to content
Change the repository type filter

All

    Repositories list

    • lmms-eval

      Public
      Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
      Python
      Other
      169000Updated Dec 17, 2024Dec 17, 2024
    • Ongoing research training transformer models at scale
      Python
      Other
      2.4k001Updated Dec 17, 2024Dec 17, 2024
    • Python
      42600Updated Dec 12, 2024Dec 12, 2024
    • HFYN

      Public
      Hopfield-Fenchel-Young Networks: A Unified Framework for Associative Memory Retrieval
      Jupyter Notebook
      MIT License
      0000Updated Dec 3, 2024Dec 3, 2024
    • A package for sampling from Gibbs distributions during inference with LLMs.
      Python
      Apache License 2.0
      1610Updated Nov 25, 2024Nov 25, 2024
    • doce

      Public
      This is the a repo of DOCE
      Jupyter Notebook
      Apache License 2.0
      0200Updated Nov 22, 2024Nov 22, 2024
    • Jupyter Notebook
      0100Updated Oct 15, 2024Oct 15, 2024
    • Python
      0000Updated Oct 10, 2024Oct 10, 2024
    • axolotl

      Public
      Go ahead and axolotl questions
      Python
      Apache License 2.0
      893000Updated Sep 26, 2024Sep 26, 2024
    • nanotron

      Public
      Minimalistic large language model 3D-parallelism training
      Python
      Apache License 2.0
      132000Updated Sep 19, 2024Sep 19, 2024
    • Python
      76427Updated Aug 29, 2024Aug 29, 2024
    • DeepSPIN's submission to SIGMORPHON 2020
      Python
      MIT License
      1511Updated Jul 25, 2024Jul 25, 2024
    • Ongoing research training transformer language models at scale, including: BERT & GPT-2
      Python
      Other
      2.4k102Updated Jul 12, 2024Jul 12, 2024
    • SSHN

      Public
      Sparse and Structured Hopfield Networks
      Python
      MIT License
      0200Updated Jul 4, 2024Jul 4, 2024
    • entmax

      Public
      The entmax mapping and its loss, a family of sparse softmax alternatives.
      Python
      MIT License
      44418102Updated Jun 22, 2024Jun 22, 2024
    • COMET

      Public
      A Neural Framework for MT Evaluation
      Python
      Apache License 2.0
      82000Updated Jun 11, 2024Jun 11, 2024
    • robust-mt

      Public
      0000Updated Mar 6, 2024Mar 6, 2024
    • Repository for SPECTRA: Sparse Structured Text Rationalization, accepted at EMNLP 2021 main conference.
      Python
      MIT License
      21010Updated Feb 14, 2024Feb 14, 2024
    • 31811Updated Jan 16, 2024Jan 16, 2024
    • Code for alignment for the towerllm project.
      Python
      Apache License 2.0
      418100Updated Nov 29, 2023Nov 29, 2023
    • LP-SparseMAP: Differentiable sparse structured prediction in coarse factor graphs
      C++
      MIT License
      84131Updated Nov 20, 2023Nov 20, 2023
    • Jupyter Notebook
      MIT License
      3800Updated Nov 10, 2023Nov 10, 2023
    • Shell
      0300Updated Oct 17, 2023Oct 17, 2023
    • Jupyter Notebook
      2800Updated Oct 9, 2023Oct 9, 2023
    • Repository for "BLEU Meets COMET: Combining Lexical and Neural Metrics Towards Robust Machine Translation Evaluation", accepted at EAMT 2023.
      Jupyter Notebook
      Apache License 2.0
      01800Updated Jul 19, 2023Jul 19, 2023
    • Code and data for the paper "Disentangling Uncertainty in Machine Translation Evaluation", accepted at EMNLP 2022.
      Python
      Apache License 2.0
      32300Updated Jun 23, 2023Jun 23, 2023
    • Shell
      01700Updated Jun 13, 2023Jun 13, 2023
    • crest

      Public
      Code for CREST: A Joint Framework for Rationalization and Counterfactual Text Generation, accepted at ACL 2023.
      Python
      MIT License
      1800Updated May 29, 2023May 29, 2023
    • Python
      1500Updated May 28, 2023May 28, 2023
    • This repository provides open-source code for sparse continuous distributions and corresponding Fenchel-Young losses.
      Python
      MIT License
      41601Updated May 10, 2023May 10, 2023