Skip to content

LoCoNet: Long-Short Context Network for Active Speaker Detection (2023 CVPR)

License

Notifications You must be signed in to change notification settings

DongKeon/LoCoNet-ASD

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LoCoNet

LoCoNet: Long-Short Context Network for Active Speaker Detection (2023 CVPR)

Code Usage

  1. Download the audio-visual features and the annotation csv files from Google Drive. The directories should look like as follows:
|-- features
    |-- resnet18-tsm-aug
        |-- train_forward
        |-- val_forward
    |-- resnet50-tsm-aug
        |-- train_forward
        |-- val_forward
|-- csv_files
    |-- ava_activespeaker_train.csv
    |-- ava_activespeaker_val.csv
  1. Training / Validation
bash run.sh conf/LoCoNet/ResNet18/large [gpu_id] 1

Note

  • Code for training only LoCoNet backend with pre-trained feature provided by SPELL.

About

LoCoNet: Long-Short Context Network for Active Speaker Detection (2023 CVPR)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published