Skip to content

Commit

Permalink
Add two data-centric papers for LLMs (#226)
Browse files Browse the repository at this point in the history
* Update awesome_llm_data.md

* Update awesome_llm_data.md
  • Loading branch information
zhenqincn authored Mar 7, 2024
1 parent e2238e5 commit 475c52b
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions docs/awesome_llm_data.md
Original file line number Diff line number Diff line change
Expand Up @@ -115,3 +115,5 @@ Data Management
| Dynamics of Instruction Tuning: Each Ability of Large Language Models Has Its Own Growth Pace | `arXiv'2310`, `Data_Usage_FineTune`, `Data_Domain_Text`, `Data_Processing_Enhancement`, `Data_Generalization`, `Data_Diversity` |
| How abilities in large language models are affected by supervised fine-tuning data composition | `arXiv'2310`, `Data_Usage_FineTune`, `Data_Domain_Text`, `Data_Scaling`, `Data_Generalization`, `Data_Processing_Selection` |
| Scaling Relationship on Learning Mathematical Reasoning with Large Language Models | `arXiv'2308`, `Data_Usage_FineTune`, `Data_Domain_Text`, `Data_Processing_Deduplication`, `Data_Quantity`, `Data_Generalization` |
| Data-Centric Financial Large Language Models | `arXiv'2310`, `Data_Usage_FineTune`, `Data_Domain_Text`, `Data_Quantity`, `Data_Processing_Enhancement` |
| Ziya2: Data-centric Learning is All LLMs Need | `arXiv 2311`, `Data_Usage_Pretrain`, `Data_Domain_Text`, `Data_Domain_Code`, `Data_Quality`

0 comments on commit 475c52b

Please sign in to comment.