GitHub - leoliu5550/DeformConv-and-CDC-RTDETR: Copy from Official RT-DETR pytorch version. In this repository we add central difference convolution and deformable convolution at backbone s5.

Transformer-based method with Deformable Convolution and Central Difference Convolution for Object Detection

Fig

Introduction

This study devotes to enhance the detection capabilities of real-time detection transformer (RT-DETR) by incorporating two adapter modules: the deformable convolutional network (DeformConv) and the central difference convolution (CDC) adapter module. These modules are integrated into backbone of RT-DETR and Transformer network, aiming to improve the ability of model to accurately locate and classify objects.

To evaluate the effectiveness of the proposed adapter modules, comprehensive experiments are conducted on two benchmark datasets: the NEU-DET steel plate crack dataset and the COCO dataset. On the NEU-DET dataset, compared to RT-DETR, the DeformConv adapter module achieved a significant 1% improvement in mean average precision (mAP) for medium-sized defects and a 0.1% improvement in recall (AR) for large-sized defects. These results highlight the capability of DeformConv for this type of defect with complex shapes. On the COCO dataset, the CDC adapter module exhibited a 0.1% mAP gain and a 0.5% AR improvement for medium-sized objects. These results demonstrate the effectiveness of CDC in extracting fine-grained details and distinguishing objects from the background. In summary, both DeformConv and CDC adapter modules have the potential to enhance the object detection capabilities of the RT-DETR model in different application scenarios. DeformConv can effectively capture object shape variations for complex-shaped object detection, while CDC can distinguish target objects from the background in situations where objects may be obscured or the background is complicated.

Quick start

Install

pip install -r requirements.txt

Adapter

For using central difference convolution at s5, replace Adapter with CDCadapter
For using deformable convolution at s5, replace Adapter with CDCadapter
For using RT-DETR, remain Adapter
Modify config Adapter, CDCadapter, Deformadapter

Data

Download and extract COCO 2017 train and val images.

path/to/coco/
  annotations/  # annotation json files
  train2017/    # train images
  val2017/      # val images

Modify config img_folder, ann_file

Training & Evaluation

Training on a Single GPU:

# training on single-gpu
export CUDA_VISIBLE_DEVICES=0
python tools/train.py -c configs/rtdetr/rtdetr_r50vd_6x_coco.yml

Training on Multiple GPUs:

# train on multi-gpu
export CUDA_VISIBLE_DEVICES=0,1,2,3
torchrun --nproc_per_node=4 tools/train.py -c configs/rtdetr/rtdetr_r50vd_6x_coco.yml

Evaluation on Multiple GPUs:

# val on multi-gpu
export CUDA_VISIBLE_DEVICES=0,1,2,3
torchrun --nproc_per_node=4 tools/train.py -c configs/rtdetr/rtdetr_r50vd_6x_coco.yml -r path/to/checkpoint --test-only

Export

python tools/export_onnx.py -c configs/rtdetr/rtdetr_r18vd_6x_coco.yml -r path/to/checkpoint --check

Name		Name	Last commit message	Last commit date
Latest commit History 181 Commits
.github		.github
READIMG		READIMG
TEST		TEST
benchmark		benchmark
configs		configs
leolog		leolog
output		output
src		src
tools		tools
wandb/run-20231113_165147-dgsioe0c		wandb/run-20231113_165147-dgsioe0c
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cocoNB.ipynb		cocoNB.ipynb
img3 copy.png		img3 copy.png
img3.png		img3.png
logging.conf		logging.conf
requirements.txt		requirements.txt
runnning_scrip.sh		runnning_scrip.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformer-based method with Deformable Convolution and Central Difference Convolution for Object Detection

Introduction

Quick start

About

Releases

Packages

Languages

License

leoliu5550/DeformConv-and-CDC-RTDETR

Folders and files

Latest commit

History

Repository files navigation

Transformer-based method with Deformable Convolution and Central Difference Convolution for Object Detection

Introduction

Quick start

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages