This is an official implementation for "Contextual Transformer Networks for Visual Recognition".
imagenet image-classification object-detection semantic-segmentation instance-segmentation mscoco mask-rcnn cotnet vision-transformer contextual-transformer
-
Updated
Aug 8, 2021 - Python