Tracker image input size vs. Detector image size #602
-
Here Yolo takes a resized/preprocessed image Here the tracker takes the original image Why is this the case? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
The only reason is to start with high resolution inputs for the ReID model which then resizes to 128x256. I will run an experiment to determine the metric difference on MOT17 between starting with full resolution and 1280. |
Beta Was this translation helpful? Give feedback.
-
Here are the results: https://github.com/mikel-brostrom/Yolov5_StrongSORT_OSNet/wiki/Effect-of-passing-full-image-input-vs-1280-re-scaled-to-StrongSORT-on-MOT17
IDF1 takes the largest hit by far, which makes sense. This behavior would be further accentuated by lowering the resolution even more in |
Beta Was this translation helpful? Give feedback.
Here are the results: https://github.com/mikel-brostrom/Yolov5_StrongSORT_OSNet/wiki/Effect-of-passing-full-image-input-vs-1280-re-scaled-to-StrongSORT-on-MOT17
IDF1 takes the largest hit by far, which makes sense. This behavior would be further accentuated by lowering the resolution even more in
val.py
. The execution time is barely affected by using the full image. This is because the detections are resized to 128x256 by the ReID preprocessing before they are fed to the ReID model.