VaT (ECCV 2024)

Code for Unsupervised Variational Translator for Bridging Image Restoration and High-Level Vision Tasks.

Abstract

Recent research tries to extend image restoration capabilities from human perception to machine perception, thereby enhancing the performance of high-level vision tasks in degraded environments. These methods, primarily based on supervised learning, typically involve the retraining of restoration networks or high-level vision networks. However, collecting paired data in real-world scenarios and retraining large-scale models are challenge. To this end, we propose an unsupervised learning method called Variational Translator (VaT), which does not require retraining existing restoration and high-level vision networks. Instead, it establishes a lightweight network that serves as an intermediate bridge between them. By variational inference, VaT approximates the joint distribution of restoration output and high-level vision input, dividing the optimization objective into preserving content and maximizing marginal likelihood associated with high-level vision tasks. By cleverly leveraging self-training paradigms, VaT achieves the above optimization objective without requiring labels. As a result, the translated images maintain a close resemblance to their original content while also demonstrating exceptional performance on high-level vision tasks. Extensive experiments in dehazing and low-light enhancement for detection and classification show the superiority of our method over other state-of-the-art unsupervised counterparts, even significantly surpassing supervised methods in some complex real-world scenarios.

Training

Environment

pip install -r requirement.txt

Obtaining NUQ features

python getNUQ_f.py

VaT Training

python -m visdom.server
python main.py

Training data can be found on Baidu network disk (pw: rhqs ). It may take some time to modify the path in the code.

The pre-trained object detection model follows the official code completely and is trained on clean datasets.

Since our method is unpaired, if you want to achieve better real-world detection performance, you can merge synthetic low-quality images and real low-quality images into the trainA folder.

Testing

The pretrained weight(pw: ggbu ) was trained for 17 epochs, continuing training might yield better results.

python val.py

Statement

If you are interested in our work, please consider citing the following:

@inproceedings{Wu2025VaT,
  title={Unsupervised Variational Translator for Bridging Image Restoration and High-Level Vision Tasks},
  author={Wu, Jiawei
and Jin, Zhi},
  booktitle={European Conference on Computer Vision},
  pages={214--231},
  year={2025},
  organization={Springer}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
HModel/YOLOV5		HModel/YOLOV5
LModel/SCI		LModel/SCI
VaTrainer		VaTrainer
config		config
data		data
nuq		nuq
nuq_s		nuq_s
publicutil		publicutil
NUQ_train_data.pickle		NUQ_train_data.pickle
README.md		README.md
Trainer.py		Trainer.py
getNUQ_f.py		getNUQ_f.py
main.py		main.py
requirement.txt		requirement.txt
val.py		val.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VaT (ECCV 2024)

Abstract

Training

Environment

Obtaining NUQ features

VaT Training

Testing

Statement

About

Releases

Packages

Languages

Fire-friend/VaT

Folders and files

Latest commit

History

Repository files navigation

VaT (ECCV 2024)

Abstract

Training

Environment

Obtaining NUQ features

VaT Training

Testing

Statement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages