-
Notifications
You must be signed in to change notification settings - Fork 2.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Suggestion to reduce Training time for YoloX-S ,YoloX-M & YoloX-L #770
Comments
I updated data workers count + added cache to the training command. yolox/exp/yolox_base.py My training time has reduced to 1 hour 30 mins / epoch. |
I've encountered same problems, too.I'm training custom dataset(5k medical images, much easier and smaller datset comparing to COCO) but my calculation power is much more limited.I wonder if I could increase learning rate and decrease training epochs to achieve an adequate performance. |
@Joker316701882 can you suggest some technique to reduce our training time ? |
What's your data time and train time if your don't change num_workers? @ajtvsv07 |
I am trying to reproduce the results with COCO dataset. Train time : 2 hours 30 mins / epoch |
It's log info on your terminal, also some loss value is logged. Could you please check it? |
@FateScript found it.
|
Your iter time is too long, this might caused by limited computation power or wrong setting of environment. |
Thanks @FateScript @Joker316701882 for your logs. |
Hi All,
I am trying to train YoloX-M model with COCO dataset it takes around 2 hours 30 mins for a single epoch with 6 V100 GPUs.
ETA is shown as approx 30 days.
command used to train model :
python tools/train.py -n yolox-m -d 6 -b 48 --fp16 -o
I tried increasing it data workers & batch size. Training time is not reducing drastically. I can reduce it to 2 hours 20mins
I also tried adding --cache to the train command
python tools/train.py -n yolox-m -d 6 -b 48 --fp16 -o --cache
I do not see a significant reduction in training time.
Any suggestion to reduce it to 10 - 15 mins / epoch.
Note :
yolov5 training time for COCO dataset is 10-15 mins / epoch with 6 V100 GPUs.
Regards,
Arunjeyan TVSV
The text was updated successfully, but these errors were encountered: