Skip to content

Pull requests: allenai/OLMo

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Docs model ladder
#708 opened Aug 19, 2024 by IanMagnusson Draft
Add OLMoE checkpoints and run config
#707 opened Aug 19, 2024 by 2015aroras Loading…
Extend functionality of Wandb Config Diff script
#687 opened Aug 2, 2024 by kyleclo Loading…
DNM: Loss issue checkpoint with refine1b setups
#682 opened Jul 31, 2024 by undfined Loading…
[wip] Kylel/readme
#681 opened Jul 31, 2024 by kyleclo Draft
Ladder 1xC
#677 opened Jul 27, 2024 by AkshitaB Loading…
Alternative evals
#675 opened Jul 23, 2024 by AkshitaB Loading…
1 task
MoE
#639 opened Jun 30, 2024 by Muennighoff Loading…
muP implementation
#637 opened Jun 28, 2024 by AkshitaB Loading…
Unit tests
#635 opened Jun 26, 2024 by AkshitaB Loading…
Config for Amberish experiments at 1B
#621 opened Jun 12, 2024 by drschwenk Loading…
Normal baselines
#618 opened Jun 12, 2024 by AkshitaB Loading…
added git ref to the config keys
#617 opened Jun 11, 2024 by drschwenk Loading…
Optionally load trainer state
#573 opened May 13, 2024 by Muennighoff Loading…
Reverse weight decay
#567 opened May 3, 2024 by AkshitaB Loading…
1 task done
Add reorder cache for beam search
#526 opened Mar 26, 2024 by cshaib Loading…
Add scripts for Dave
#516 opened Mar 21, 2024 by epwalsh Draft
Scripts for QKV experiments
#510 opened Mar 20, 2024 by AkshitaB Loading…
hf_olmo: support flash attn 2
#471 opened Feb 29, 2024 by wade3han Loading…
integrate mock vision backbone into model
#441 opened Feb 8, 2024 by epwalsh Loading…
DeepSpeed
#384 opened Nov 27, 2023 by Muennighoff Draft
Kebab7
#360 opened Nov 3, 2023 by dirkgr Draft
ProTip! Mix and match filters to narrow down what you’re looking for.