Video-based-person-ReID-with-Attribute-information

This is the code repository for our paper "Attribute-aware Identity-hard Triplet Loss for Video-based Person Re-identification": https://arxiv.org/pdf/2006.07597.pdf. If you find this help your research, please cite it.

@article{chen2020attribute,
  title={Attribute-aware Identity-hard Triplet Loss for Video-based Person Re-identification},
  author={Chen, Zhiyuan and Li, Annan and Jiang, Shilu and Wang, Yunhong},
  journal={arXiv preprint arXiv:2006.07597},
  year={2020}
}

Introduction

This repository contains a project which firstly introducing the pedestrain attribute information into video-based Re-ID, we address this issue by introducing a new metric learning method called Attribute-aware Identity-hard Triplet Loss (AITL), which reduces the intra-class variation among positive samples via calculating attribute distance. To achieve a complete model of video-based person Re-ID, a multitask framework with Attribute-driven Spatio-Temporal Attention (ASTA) mechanism is also proposed.

1. Attribute-aware Identity-hard Triplet Loss

The batch-hard triplet loss frequently used in video-based person Re-ID suffers from the Distanc eVariance among Different Positives(DVDP) problem.

Attribute-aware Identity-hard Triplet Loss to solve the DVDP.

2. Attribute-driven Spatio-Temporal Attention

Introducing the spatial-temporal attention in attribute recognition process into Re-ID process.

Deployment

It is mainly forked from video-person-reid and reid-strong-baseline. Since I suffered from severe poverty, I introduce the nvidia-apex to train the model in FP16 settings, so the training codes can be directly ran on a single RTX2070s, which is very friendly to proletarians like me. If you owes a 32GB V100 Graphic Card or 2 * GTX 1080Ti Cards, you can just ignore the apex operation and run the codes on a single card, and increase the batch size to 64, the u can get a higher performance :).

Requirements:

pytorch >= 0.4.1 ( < 1.5.0 apex is not friendly to pytorch 1.5.0 according to my practice)
torchvision >= 0.2.1
tqdm
[nvidia-apex](https://github.com/NVIDIA/apex), please follow the detailed install instructions

Dataset

MARS

Experiments on MARS, as it is the largest dataset available to date for video-based person reID. Please follow deep-person-reid to prepare the data. The instructions are copied here:

Create a directory named mars/.
Download dataset to mars/ from http://www.liangzheng.com.cn/Project/project_mars.html.
Extract bbox_train.zip and bbox_test.zip.
Download split information from https://github.com/liangzheng06/MARS-evaluation/tree/master/info and put info/ in data/mars (we want to follow the standard split in [8]). The data structure would look like:
Download mars_attributes.csv from http://irip.buaa.edu.cn/mars_duke_attributes/index.html, and put the file in data/mars. The data structure would look like:

mars/
    bbox_test/
    bbox_train/
    info/
    mars_attributes.csv

Change the global variable _C.DATASETS.ROOT_DIR to /path2mars/mars and _C.DATASETS.NAME to mars in config or configs.

Duke-VID

Create a directory named duke/ under data/.
Download dataset to data/duke/ from http://vision.cs.duke.edu/DukeMTMC/data/misc/DukeMTMC-VideoReID.zip.
Extract DukeMTMC-VideoReID.zip.
Download duke_attributes.csv from http://irip.buaa.edu.cn/mars_duke_attributes/index.html, and put the file in data/duke. The data structure would look like:

duke/
    train/
    gallery/
    query/
    duke_attributes.csv

Change the global variable _C.DATASETS.ROOT_DIR to /path2duke/duke and _C.DATASETS.NAME to duke in config or configs.

Usage

To train the model, please run

python main_baseline.py

Please modifies the settings directly on the config files.

Performance

Comparision with SOTA

The above performance is achieved in the setting: 2 * 1080Ti, train batchsize 64. (Once i was a middle-class deepnetwork-finetuner when i was in school.)

Best performance on lower devices(MARS, 1 * RTX 2070s, train batchsize 32): (Now i'm a proletarian. 要为了真理而斗争！)

mAP : 82.5% Rank-1 : 86.5%

Better trade-off between speed and performance:

More experiments result can be found in paper.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
config		config
configs		configs
display_images		display_images
models		models
README.md		README.md
__init__.py		__init__.py
data_manager.py		data_manager.py
defaults.py		defaults.py
eval_metrics.py		eval_metrics.py
losses.py		losses.py
lr_schedulers.py		lr_schedulers.py
main_baseline.py		main_baseline.py
re_ranking.py		re_ranking.py
samplers.py		samplers.py
transforms.py		transforms.py
utils.py		utils.py
video_loader.py		video_loader.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video-based-person-ReID-with-Attribute-information

Introduction

1. Attribute-aware Identity-hard Triplet Loss

2. Attribute-driven Spatio-Temporal Attention

Deployment

Dataset

MARS

Duke-VID

Usage

Performance

Comparision with SOTA

Better trade-off between speed and performance:

About

Releases

Packages

Languages

yuange250/Video-based-person-ReID-with-Attribute-information

Folders and files

Latest commit

History

Repository files navigation

Video-based-person-ReID-with-Attribute-information

Introduction

1. Attribute-aware Identity-hard Triplet Loss

2. Attribute-driven Spatio-Temporal Attention

Deployment

Dataset

MARS

Duke-VID

Usage

Performance

Comparision with SOTA

Better trade-off between speed and performance:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages