- Environments
- MS-SL++ on TVR
- MS-SL++ on Activitynet
- MS-SL++ on Charades-STA
- Reference
- Acknowledgement
- python 3.8
- pytorch 1.9.0
- torchvision 0.10.0
- tensorboard 2.6.0
- tqdm 4.62.0
- easydict 1.9
- h5py 2.10.0
- cuda 11.1
We used Anaconda to setup a deep learning workspace that supports PyTorch. Run the following script to install the required packages.
conda create --name ms_sl_pp python=3.8
conda activate ms_sl_pp
git clone https://github.com/HuiGuanLab/ms-sl-pp.git
cd ms-sl-pp
pip install -r requirements.txt
conda deactivate
The data can be downloaded from Baidu pan or Google drive. Please refer to here for more description of the dataset. Run the following script to place the data in the specified path.
# download the data of TVR
ROOTPATH=$HOME/VisualSearch
mkdir -p $ROOTPATH && cd $ROOTPATH
unzip tvr.zip -d $ROOTPATH
Run the following script to train MS-SL++
network on TVR. It will save the chechpoint that performs best on the validation set as the final model.
#Add project root to PYTHONPATH (Note that you need to do this each time you start a new session.)
source setup.sh
conda activate ms_sl_pp
ROOTPATH=$HOME/VisualSearch
RUN_ID=runs_0
GPU_DEVICE_ID=0
./do_tvr.sh $RUN_ID $ROOTPATH $GPU_DEVICE_ID
$RUN_ID
is the name of the folder where the model is saved in.
$GPU_DEVICE_ID
is the index of the GPU where we train on.
The model is placed in the directory $ROOTPATH/$DATASET/results/$MODELDIR after training. To evaluate it, please run the following script:
DATASET=tvr
FEATURE=i3d_resnet
ROOTPATH=$HOME/VisualSearch
MODELDIR=tvr-pos-256_cluster-32-2023_10_03_20_26_58
./do_test.sh $DATASET $FEATURE $ROOTPATH $MODELDIR
We also provide the trained checkpoint on TVR, run the following script to evaluate it. The model can also be downloaded from Here.
DATASET=tvr
FEATURE=i3d_resnet
ROOTPATH=$HOME/VisualSearch
MODELDIR=checkpoint_tvr
wget http://8.210.46.84:8787/prvr/checkpoints/ms_slpp_checkpoint_tvr.tar
tar -xvf ms_slpp_checkpoint_tvr.tar -C $ROOTPATH/$DATASET/results
./do_test.sh $DATASET $FEATURE $ROOTPATH $MODELDIR
$DATASET
is the dataset that the model trained and evaluate on.
$FEATURE
is the video feature corresponding to the dataset.
$MODELDIR
is the path of checkpoints saved.
R@1 | R@5 | R@10 | R@100 | SumR | |
---|---|---|---|---|---|
Text-to-Video | 13.6 | 33.1 | 44.2 | 83.5 | 174.5 |
The data can be downloaded from Baidu pan or Google drive. Please refer to here for more description of the dataset. Run the following script to place the data in the specified path.
ROOTPATH=$HOME/VisualSearch
mkdir -p $ROOTPATH && cd $ROOTPATH
unzip activitynet.zip -d $ROOTPATH
Run the following script to train MS-SL++
network on Activitynet.
#Add project root to PYTHONPATH (Note that you need to do this each time you start a new session.)
source setup.sh
conda activate ms_sl_pp
ROOTPATH=$HOME/VisualSearch
RUN_ID=runs_0
GPU_DEVICE_ID=0
./do_activitynet.sh $RUN_ID $ROOTPATH $GPU_DEVICE_ID
The model is placed in the directory $ROOTPATH/$DATASET/results/$MODELDIR after training. To evaluate it, please run the following script:
DATASET=activitynet
FEATURE=i3d
ROOTPATH=$HOME/VisualSearch
MODELDIR=activitynet-runs_0-2022_07_11_20_27_02
./do_test.sh $DATASET $FEATURE $ROOTPATH $MODELDIR
We also provide the trained checkpoint on Activitynet, run the following script to evaluate it. The model can also be downloaded from Here.
DATASET=activitynet
FEATURE=i3d
ROOTPATH=$HOME/VisualSearch
MODELDIR=checkpoint_activitynet
wget http://8.210.46.84:8787/prvr/checkpoints/ms_slpp_checkpoint_activitynet.tar
tar -xvf ms_slpp_checkpoint_activitynet.tar -C $ROOTPATH/$DATASET/results
./do_test.sh $DATASET $FEATURE $ROOTPATH $MODELDIR
R@1 | R@5 | R@10 | R@100 | SumR | |
---|---|---|---|---|---|
Text-to-Video | 7.0 | 23.1 | 35.2 | 75.8 | 141.1 |
The data can be downloaded from Baidu pan or Google drive. Please refer to here for more description of the dataset. Run the following script to place the data in the specified path.
ROOTPATH=$HOME/VisualSearch
mkdir -p $ROOTPATH && cd $ROOTPATH
unzip charades.zip -d $ROOTPATH
Run the following script to train MS-SL++
network on Charades-STA.
#Add project root to PYTHONPATH (Note that you need to do this each time you start a new session.)
source setup.sh
conda activate ms_sl_pp
ROOTPATH=$HOME/VisualSearch
RUN_ID=runs_0
GPU_DEVICE_ID=0
./do_charades.sh $RUN_ID $ROOTPATH $GPU_DEVICE_ID
The model is placed in the directory $ROOTPATH/$DATASET/results/$MODELDIR after training. To evaluate it, please run the following script:
DATASET=charades
FEATURE=i3d_rgb_lgi
ROOTPATH=$HOME/VisualSearch
MODELDIR=charades-runs_0-2022_07_11_20_27_02
./do_test.sh $DATASET $FEATURE $ROOTPATH $MODELDIR
We also provide the trained checkpoint on Charades-STA, run the following script to evaluate it. The model can also be downloaded from Here.
DATASET=charades
FEATURE=i3d_rgb_lgi
ROOTPATH=$HOME/VisualSearch
MODELDIR=checkpoint_charades
wget http://8.210.46.84:8787/prvr/checkpoints/ms_slpp_checkpoint_charades.tar
tar -xvf ms_sl_pp_checkpoint_charades.tar -C $ROOTPATH/$DATASET/results
./do_test.sh $DATASET $FEATURE $ROOTPATH $MODELDIR
R@1 | R@5 | R@10 | R@100 | SumR | |
---|---|---|---|---|---|
Text-to-Video | 1.8 | 7.6 | 12.0 | 48.4 | 69.7 |
#
The codes are modified from TVRetrieval and ReLoCLNet.