Gaussian Focal Loss: Learning Distribution Polarized Angle Prediction for Rotated Object Detection in Aerial Images
With the increasing availability of aerial data, object detection in aerial images has aroused more and more attention in remote sensing community. The difficulty lies in accurately predicting the angular information for each target when using the oriented bounding boxes to represent the arbitrary oriented objects, as the periodicity of the angle could cause inconsistency between target angle values. To resolve the problem, recent works propose to perform angular prediction from a regression problem to a classification task with circular smooth label. However, we find that current loss functions applying to binary soft labels need to approximate the soft label values at each position. When summed over all the negative angle categories, these relatively insignificant loss values can overwhelm the target angle category, thus preventing the network from predicting precise angle information. In this paper, we propose a novel loss function that acts as a more effective alternative to the classification-based rotated detectors. By constructing the classification loss with adaptive Gaussian attenuation on the negative locations, our training objective can not only avoid discontinuous angle boundaries but also enable the network to obtain more accurate angle predictions with higher response at peaks. Moreover, an aspect ratio-aware factor was proposed based on our loss function to enhance the robustness of the model for determining the orientation for square-like objects. Extensive experiments on aerial image datasets DOTA, HRSC2016, and UCAS-AOD demonstrated the effectiveness and superior performances of our approaches.
Please refer to install.md for installation and dataset preparation.
Please see getting_started.md for the basic usage of GF-CSL.
The repository is still under maintenance. If there are any bugs in use, please update the corresponding code according to the current version first. If the issue is still not resolved, please feel free to leave a comment, we will reply as soon as possible.
- 2022.06.08 : upload training and evaluation code and pretrained weight on HRSC2016. (details see Getting Started)
- Pretrained weights on DOTA
Model | Backbone | MS Test | Training size | Training scales | mAP | Download |
---|---|---|---|---|---|---|
GF-CSL | ResNet50 | × | 1024×1024 | [0.5,1.0] | 75.61% | https://drive.google.com/file/d/17Z-0i-ifP_fY58CfoBr8LGBsfLLklm1l/view?usp=sharing |
✓ | 77.54% | |||||
GF-CSL | ResNet101 | × | 1024×1024 | [0.5,1.0] | 75.52% | https://drive.google.com/file/d/1NU5ypyioIIpqCFBLT_87eT-_7K-gYzzS/view?usp=sharing |
✓ | 78.34% | |||||
GF-CSL | ResNet152 | × | 1024×1024 | [0.5,1.0] | 76.35% | https://drive.google.com/file/d/1GgHAI57HFkhw_an3ONGt9Syttfrjg683/view?usp=sharing |
✓ | 78.12% | |||||
GF-CSL | ResNet101 | × | 1024×1024 | [0.5,1.0,1.5] | 76.05% | https://drive.google.com/file/d/1eAz5l-M4IqycL9mW2zegwN6wzVMIdJgM/view?usp=sharing |
✓ | 79.94% |
- Pretrained weights on HRSC2016
Model | Backbone | MS Test | Training size | mAP12 | Download |
---|---|---|---|---|---|
GF-CSL | ResNet50 | × | 640×640 | 97.00% | https://drive.google.com/file/d/1Nzwp7OHFn2LHVMyQnd11D2i7fe_0ASP4/view?usp=sharing |
✓ | 97.94% |
Note that the performance on HRSC2016 is a little bit higher than paper, as we add angle-branch warm up technique during training. (details see Getting Started)
Predicted angular distribution
@article{wang2022gaussian,
title={Gaussian Focal Loss: Learning Distribution Polarized Angle Prediction for Rotated Object Detection in Aerial Images},
author={Wang, Jian and Li, Fan and Bi, Haixia},
journal={IEEE Transactions on Geoscience and Remote Sensing},
year={2022},
publisher={IEEE}
}