An implementation of a baseline method of OWTAL. However, it is not the complete version yet.
We use the 2048-d features provided by MM 2021 paper: Cross-modal Consensus Network for Weakly Supervised Temporal Action Localization. You can get access of the dataset from Google Drive or Baidu Disk. The annotations are included within this package.
Pre-trained models can be downloaded from Google Disk. They need to be unzipped and put in the directory './ckpt/'.
To test pre-trained models, run:
cd scripts
bash test_split0/1/2.sh
To train from scratch, run:
cd scripts
bash train_split0/1/2.sh