TreeDQN: Learning variable selection rules for combinatorial optimization problems

This repository is the official implementation of TreeDQN: Learning variable selection rules for combinatorial optimization problems.

Setup environment

# pull docker image
docker pull idono/rlbnb:release

# run docker container
docker run -dit --gpus all --shm-size=10g --name rlbnb idono/rlbnb:release /bin/bash

# enter docker container
docker exec -it rlbnb /bin/bash

# work with TreeDQN
git clone https://github.com/dmitrySorokin/treedqn.git
conda activate bb

# work with baseline rl2branch
git clone https://github.com/lascavana/rl2branch.git
conda activate rl2branch

Training

To train the RL agent, run this commands:

# generate validation data
python gen_instances.py --config-name <cfg from configs>

# run training
python main.py --config-name <cfg from configs>

To train the IL agent, run this commands:

# generate training data
python gen_imitation_data.py --config-name <cfg from configs>

# run training
python il_train.py --config-name <cfg from configs>

Evaluation

To evaluate the agent, run:

python eval.py --config-name <cfg from configs> agent.name={agent_name}

agent_name: strong, dqn, il, random
results will be saved in results/{task_name}/{agent_name}.csv

Pre-trained Models

Pretrained weights for IL, TreeDQN and REINFORCE agents are in models/ dir.

P-P plots

To plot results, run:

python plot.py results/<task name>

Results

Geometric mean of tree sizes (lower is better):

Model	Comb.Auct	Set Cover	Max.Ind.Set.	Facility Loc.	Mult.Knap
Strong Branching	48 $\pm$ 3	43 $\pm$ 2	40 $\pm$ 4	294 $\pm$ 9	700 $\pm$ 10
IL	56 $\pm$ 3	53 $\pm$ 2	42 $\pm$ 5	323 $\pm$ 8	670 $\pm$ 9
TreeDQN	58 $\pm$ 3	56 $\pm$ 2	42 $\pm$ 6	324 $\pm$ 8	290 $\pm$ 6
FMCTS	65 $\pm$ 3	76 $\pm$ 3	96 $\pm$ 8	499 $\pm$ 10	299 $\pm$ 6
tmdp+DFS	93 $\pm$ 3	204 $\pm$ 3	88 $\pm$ 4	521 $\pm$ 10	308 $\pm$ 6

Contributing

Submit Github issue if you have any questions or want to contribute.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
agent		agent
configs		configs
env		env
gifs		gifs
models		models
test_instances		test_instances
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
eval.py		eval.py
gen_imitation_data.py		gen_imitation_data.py
gen_instances.py		gen_instances.py
il_train.py		il_train.py
main.py		main.py
plot.py		plot.py
requirements.txt		requirements.txt
tasks.py		tasks.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TreeDQN: Learning variable selection rules for combinatorial optimization problems

Setup environment

Training

Evaluation

Pre-trained Models

P-P plots

Results

Contributing

About

Releases

Packages

Languages

License

dmitrySorokin/treedqn

Folders and files

Latest commit

History

Repository files navigation

TreeDQN: Learning variable selection rules for combinatorial optimization problems

Setup environment

Training

Evaluation

Pre-trained Models

P-P plots

Results

Contributing

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages