Domain-Agnostically Aligned Optimal Transport for Domain-Adaptive Crowd Counting

News

[Project page] [paper]
An officical implementation of "Domain-Agnostically Aligned Optimal Transport for Domain-Adaptive Crowd Counting" (Accepted by ACM MM 2023).
We propose a novel domain adaption method named DAOT,aligning domain-agnostic factors to bridge the source-target domain gap.

Overview

Our work proposes a domain-adaptive framework for crowd counting based on optimal transport (OT). The training process explains the acquisition of 𝑀𝑇0 and 𝑀𝑇, respectively. The inference process is divided into two stages. In the individual-level measurement stage (stage 1), region information from both the source and target domains is collected. The source domain model 𝑀𝑆 and target domain model 𝑀𝑇0 trained with pseudo-labels from 𝑀𝑆 are used for distribution perception, yielding the source domain distribution 𝐷𝑆 and target domain distribution 𝐷𝑇. The distance matrix 𝐶 is calculated using SSIM to measure the distance between each distribution and extended to form the cost matrix 𝐶. In the domain-level alignment stage (stage 2), we use the Sinkhorn algorithm with iterative updates to obtain the optimal transfer matrix solution 𝑃 and the final simulated distribution. We fine-tune the initial model 𝑀𝑇0 using the simulated distribution to obtain the target domain model 𝑀𝑇.

Visualizations

Visualization results in cross-domain setting involving Q2A, A2B, and A2Q.

Environment

As we take FIDT as our baseline ,you can see [FIDT] for requirements.

Datasets

You can download datasets to data folder.

Download ShanghaiTech dataset from Baidu-Disk, passward:cjnx; or Google-Drive
Download UCF-QNRF dataset from here
Download JHU-CROWD ++ dataset from here
Download NWPU-CROWD dataset from Baidu-Disk, passward:3awa; or Google-Drive

Generate Ground-Truth

cd data Generate FIDT map : python fidt_generate_xx.py
“xx” means the dataset name,including sh, jhu, qnrf, and nwpu. You should change the dataset path.

Generate image file list for training or test: python make_npydata.py

How to train

The model is trained within two stages.

Retrain the model with pseudo lables
- run python pseudo_generate.py to generate pseudo lables.
- Don't forget to assign your costumed path like --save_pseudo or --save_pseudo in config.py
- To have better performance,you can download the pretrained source model from Baidu-Disk, passward:gqqm, or OneDrive
- run python train_baseline.py to retrain the model. Before training,make npy file list like other datasets and check the configuration.
FineTune the model with source patches selected by OT
- run slide.py to divide source images and GT into patches.
- run feature_extract.py to obtain distribution of source and target domain.
- run oot2.py to select the best aligned patches from source domain and make the finetune dataset.
- run python train_baseline.py to finetune the model. Before training,make npy file list like other datasets and check the configuration.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Networks/HR_Net		Networks/HR_Net
__pycache__		__pycache__
data		data
image		image
DAOT-master.zip		DAOT-master.zip
LICENSE		LICENSE
LoadImage.py		LoadImage.py
README.md		README.md
config.py		config.py
dataset.py		dataset.py
feature_extract.py		feature_extract.py
helpers.py		helpers.py
image.py		image.py
make_npydata.py		make_npydata.py
oot2.py		oot2.py
pseudo_generate.py		pseudo_generate.py
slide.py		slide.py
train_baseline.py		train_baseline.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Domain-Agnostically Aligned Optimal Transport for Domain-Adaptive Crowd Counting

News

Overview

Visualizations

Environment

Datasets

Generate Ground-Truth

How to train

The model is trained within two stages.

About

Releases

Packages

Languages

License

HopooLinZ/DAOT

Folders and files

Latest commit

History

Repository files navigation

Domain-Agnostically Aligned Optimal Transport for Domain-Adaptive Crowd Counting

News

Overview

Visualizations

Environment

Datasets

Generate Ground-Truth

How to train

The model is trained within two stages.

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages