This includes the starter code for the LLM-Merging competition.
The library was tested on CUDA 10.1 on an A6000.
conda env create -f environment.yml --name llm-merging
conda activate llm-merging
export PYTHONPATH=`pwd`
Authentication tokens are requied for certain models like Llama2 which require user agreeing to some terms. The authentication token can be found here
export HF_AUTH_TOKEN=""
Do not modify any files other than the new file created and setup.py
. Doing so can result in the grounds for invalidating your submission. If there is any code in the other files you need to change, feel free to open a pull request to change it.
-
To add a new merging method, create a new file in
llm_merging/merging
. This file should implement__init__.py
andmerge.py
functions and extendllm_merging/merging/Merges
. Seellm_merging/merging/FlanT5Avg.py
orllm_merging/merging/LlamaAvg.py
for examples. -
Modify
setup.py
and add an entry with the merging method inllm_merging.merging.Merges
.For example, the entry
llama_avg = llm_merging.merging.LlamaAvg:LlamaAvg
indicates the method is calledllama_avg
and the file is atllm_merging/merging/LlamaAvg
python llm_merging/setup.py install
python llm_merging/main.py -m {merging_method}
After modifying the file, zip the file in a tarball and send to ``.