Instruct Large Language Models to Drive like Humans

This is the official repository of Instruct Large Language Models to Drive like Humans.

Overview

Our approach transforms scenario data into textual descriptions and, by setting specific instructions, enables a fine-tuned LLM to generate InstructChain and trajectories that align with human driving behavior. The trajectory is subsequently applied in a simulated environment.

Results

Getting Started

Dataset Setup

Follow the official documentation to set up the nuPlan dataset.

Setup Environment

Create an environment using Python 3.10

conda create -n instruct_driver python=3.10
source activate instruct_driver

Follow the official documentation to set up the LLaMA2-Accessory environment.
Follow the official documentation to set up the nuplan-devkit environment. Make sure to set the following variables correctly:

- NUPLAN_DATA_ROOT
- NUPLAN_MAPS
- NUPLAN_EXP_ROOT

Clone the instruct_driver repository:

cd nuplan-devkit
git clone https://github.com/bonbon-rj/InstructDriver.git

After setting up the environment, your directory structure should appear as follows:

├── LLaMA2-Accessory
├── nuplan-devkit
│  ├── instruct_driver

Feature cache

This section preprocesses the dataset to enable faster subsequent data retrieval.

This refers to the implementation of planTF.

Execute the command below to generate 1M frames of training data in cache.cache_path.

You may need to:

Modify cache.cache_path according to your setup.
Adjust worker.threads_per_node based on your RAM and CPU capacity.

Please note that this step is time-intensive and may take dozens of hours to complete.

export PYTHONPATH=$(pwd)/nuplan-devkit:$PYTHONPATH
export PYTHONPATH=$(pwd)/nuplan-devkit/instruct_driver:$PYTHONPATH

cd ./nuplan-devkit/instruct_driver
python run_cache.py \
+caching=cache_llm \
scenario_builder=nuplan \
cache.cache_path=/path/to/cache_1M \
cache.cleanup_cache=true \
scenario_filter=training_scenarios_1M \
worker.threads_per_node=40

Get training data from cache

This section of the code transforms cached data into a json file formatted for training LLM.

Use the command below to process limit_num cache entries (modifiable in the code).

It converts these entries into a json file and saves it at /path/to/cache_1M/training_json/train.json:

cd ./nuplan-devkit/instruct_driver
python cache2json.py \
+caching=cache_llm \
cache.cache_path=/path/to/cache_1M

Training

Following the steps outlined above, you will obtain the train.json file suitable for training LLM.

For guidance on fine-tuning the model, please consult the official documentation.

Evaluation

After training, populate the following parameters in the llm_patches/llm_singleton.py file:

llama_config=''
lora_config=''
tokenizer_path=''
pretrained_path=''

Then, execute the command below to initiate the simulation. You can run various types of simulations by modifying the simulation_type parameter:

export PYTHONPATH=$(pwd)/nuplan-devkit:$PYTHONPATH
export PYTHONPATH=$(pwd)/nuplan-devkit/instruct_driver:$PYTHONPATH
export PYTHONPATH=$(pwd)/LLaMA2-Accessory:$PYTHONPATH
cd ./nuplan-devkit/instruct_driver
simulation_type=open_loop_boxes # closed_loop_nonreactive_agents closed_loop_reactive_agents
sh ./script/benchmarks_test14-hard.sh $simulation_type

Acknowledgement

nuplan-devkit LLaMA2-Accessory planTF

Citation

@misc{zhang2024instruct,
      title={Instruct Large Language Models to Drive like Humans}, 
      author={Ruijun Zhang and Xianda Guo and Wenzhao Zheng and Chenming Zhang and Kurt Keutzer and Long Chen},
      year={2024},
      eprint={2406.07296},
      archivePrefix={arXiv},
      primaryClass={cs.RO}
}

Note: This code is only used for academic purposes, people cannot use this code for anything that might be considered commercial use.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
assets		assets
config		config
llm_patches		llm_patches
script		script
src		src
README.md		README.md
cache2json.py		cache2json.py
data_utils.py		data_utils.py
instruction.txt		instruction.txt
run_cache.py		run_cache.py
run_simulation.py		run_simulation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Instruct Large Language Models to Drive like Humans

Overview

Results

Getting Started

Dataset Setup

Setup Environment

Feature cache

Get training data from cache

Training

Evaluation

Acknowledgement

Citation

About

Releases

Packages

Languages

bonbon-rj/InstructDriver

Folders and files

Latest commit

History

Repository files navigation

Instruct Large Language Models to Drive like Humans

Overview

Results

Getting Started

Dataset Setup

Setup Environment

Feature cache

Get training data from cache

Training

Evaluation

Acknowledgement

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages