Annotation-free Audio-Visual Segmentation

Official implementation of Annotation-free Audio-Visual Segmentation .

This paper has been accepted by WACV 2024, the project page is https://jinxiang-liu.github.io/anno-free-AVS/.

Requirements

Installation

Create a conda environment and install dependencies:

conda create -n sama python=3.10.11
conda activate sama

pip install -r requirements.txt

Dataset

1. Download the datasets

AVSBench
- Please refer to https://github.com/OpenNLPLab/AVSBench to download the AVSBench dataset.
- Please download re-organized split files with the OneDrive link which might be helpful.
AVS-Synthetic
- Please download the dataset from https://zenodo.org/record/8125822.

2. Configure the dataset locations

After downloading the datasets with annotations, please declare the directory and file locations in the configs/sam_avs_adapter.yaml file.

Get Started

Evaluation

Model weights: All the weights including the image backbone from SAM, audio backbone for VGGish and our pretrained models are obtained with the OneDrive link.

Please place vggish-10086976.pth and sam_vit_h_4b8939.pth in assets sub-folder.
Please place the pretrained model weights in ckpts sub-folder.

Test

Test on AVS-Synthetic test set

bash scripts/synthetic_test.sh

Test on AVSBench S4 test set

bash scripts/s4_test.sh

Test on AVSBench MS3 test set

bash scripts/ms3_test.sh

Training

Train AVS-Synthetic

bash scripts/synthetic_train.sh

Train AVSBench S4

bash scripts/s4_train.sh

Train AVSBench MS3

bash scripts/ms3_train.sh

Citation

@inproceedings{liu2024annotation,
  title={Annotation-free audio-visual segmentation},
  author={Liu, Jinxiang and Wang, Yu and Ju, Chen and Ma, Chaofan and Zhang, Ya and Xie, Weidi},
  booktitle={Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision},
  pages={5604--5614},
  year={2024}
}

Contact

If you have any question, feel free to contact jinxliu#sjtu.edu.cn (replace # with @).

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
assets		assets
ckpts		ckpts
configs		configs
datasets		datasets
models		models
scripts		scripts
torchvggish		torchvggish
README.md		README.md
requirements.txt		requirements.txt
test_avs.py		test_avs.py
train_avs.py		train_avs.py
utility.py		utility.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Annotation-free Audio-Visual Segmentation

Requirements

Installation

Dataset

1. Download the datasets

2. Configure the dataset locations

Get Started

Evaluation

Test

Training

Citation

Contact

About

Releases

Packages

Languages

jinxiang-liu/anno-free-AVS

Folders and files

Latest commit

History

Repository files navigation

Annotation-free Audio-Visual Segmentation

Requirements

Installation

Dataset

1. Download the datasets

2. Configure the dataset locations

Get Started

Evaluation

Test

Training

Citation

Contact

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages