📚️ Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks [Paper]

Shengbin Yue, Siyuan Wang, Wei Chen, Xuanjing Huang, and Zhongyu Wei*

SMART, an innovative multi-agent co-framework to internalize complex trajectories in knowledge-intensive tasks by Long- and Short-Trajectory Learning. This general paradigm can be expanded to other complex tasks, empowering arbitrary multi-agent frameworks to internalize tailored trajectories.

In this repository, we will release:

The constructed Trajectory Dataset.
SMART 7B model.
Training scripts utilized to train SMART.
Evaluation datasets and scripts used in our paper.

Installation

Clone this repository

git clone https://github.com/yueshengbin/SMART.git
cd SMART

Install packages

conda create -n smart python=3.10 -y
conda activate smart
pip install --upgrade pip
pip install -r requirements.txt

Please use the latest version of vllm.

Setting Knowledge Retriever

The Knowledge Retriever is driven by Contriever-MSMARCO and access knowledge documents from the official Wikipedia corpus.

Download wikipedia data

Download preprocessed passage data and generated passages.

cd multi_agent
wget https://dl.fbaipublicfiles.com/dpr/wikipedia_split/psgs_w100.tsv.gz
wget https://dl.fbaipublicfiles.com/contriever/embeddings/contriever-msmarco/wikipedia_embeddings.tar

Run retriever

You can run document retrieval by running the command below.

cd multi_agent
python passage_retrieval.py \
    --model_name_or_path facebook/contriever-msmarco --passages psgs_w100.tsv \
    --passages_embeddings "wikipedia_embeddings/*" \
    --data YOUR_INPUT_FILE  \
    --output_dir YOUR_OUTPUT_FILE \
    --n_docs 25

Your input file should be either a json or jsonl. In each instance, instructionis as a query for retrieval.

Generate embeddings for your own data

You can generate embeddings for your own data by running the following command.

cd retrieval_lm
for i in {0..3}; do
  export CUDA_VISIBLE_DEVICES=${i}
  python generate_passage_embeddings.py  --model_name_or_path facebook/contriever-msmarco \
  --output_dir YOUR_OUTPUT_DIR \
  --passages YOUR_PASSAGE_DATA --shard_id ${i}  --num_shards 4 > ./log/nohup.my_embeddings.${i} 2>&1 &

Data Construction

Trajectory Dataset Construction contains two components: the long-trajectory subset and the short-trajectory subset. The data construction follows two distinct principles:

Collect long-trajectory data
Collect short-trajectory data

The code to create Trajectory Dataset is under data_creation. See the instructions at README.md.

🚀 You can download our dataset at HuggingFace

Training

Long Short-Trajectory Learning is optimized our multi-agent framework, which consists of two stages, Short Trajectory and Long Trajectory Learning. The training code is under multi_agent.

Short Trajectory Learning

Stage 1 is use short-trajectory subset to train the pre-train LLM.

bash script_short_learning.sh

Long Trajectory Learning

Stage 2 is use long-trajectory subset to train the model after short trajectory learning.

bash script_long_learning.sh

if use Lora by setting --use_lora, plaese merge the lora weight with original model.

python merge_lora --base BASE_MODEL_NAME \
--target OUTPUT \
--lora LORA_NAME

Evaluation

Please see eval for details about the evaluation datasets and evaluation scripts.

To improve the inference efficiency, we use a static retrieval approach, i.e., we first retrieve all documents and then continue to inference.

Citation

If you find our work useful, please cite our paper:

@article{yue2024synergistic,
  title={Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks},
  author={Yue, Shengbin and Wang, Siyuan and Chen, Wei and Huang, Xuanjing and Wei, Zhongyu},
  journal={arXiv preprint arXiv:2407.09893},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Supplementary_Material		Supplementary_Material
data_creation/example		data_creation/example
eval/data		eval/data
images		images
multi_agent		multi_agent
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📚️ Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks [Paper]

Content

Installation

Setting Knowledge Retriever

Download wikipedia data

Run retriever

Generate embeddings for your own data

Data Construction

Training

Short Trajectory Learning

Long Trajectory Learning

Evaluation

Citation

About

Releases

Packages

Languages

License

yueshengbin/SMART

Folders and files

Latest commit

History

Repository files navigation

📚️ Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks [Paper]

Content

Installation

Setting Knowledge Retriever

Download wikipedia data

Run retriever

Generate embeddings for your own data

Data Construction

Training

Short Trajectory Learning

Long Trajectory Learning

Evaluation

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages