-
-
Notifications
You must be signed in to change notification settings - Fork 410
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question about network updating #25
Comments
Hi, @zen-d. Thanks for your question. We will check on this soon. By the way, do they(mem_bank and query_interact) receive gradients and function normally when you unset this modification? |
Hi @zen-d, could you please tell me which config you are experimenting with? The 1-track-map config or the 2-e2e config? |
@YTEP-ZHI I use the 1-track-map config. Sorry, I will update my findings: "memory_bank and query_interact do not receive gradients" does not depend on temporal fusion or not. That is to say, even the original repo without any modification might have such a problem. |
Hi @zen-d, thanks for pointing out this question. Let me explain the error in detail. The error of "not receiving gradients" is correlated with the setting: So where did those unused parameters come from in UniAD?
|
@YTEP-ZHI Thanks for your detailed answer. But I have already tried the following two operations. I think I have sidestepped the two things respectively in your last reply.
prev_img, prev_img_metas = None, None Still, PyTorch DDP training shows the error that mem_bank and query_interact receive no grad. |
To complement, the second point is inherited from BEVFormer, but to the best of my knowledge, BEVFormer does not have such an issue. Correct me if I miss anything. |
Yes, it's inherited from BEVFormer. It's strange that a lot of people have used this repo to train models, without encoutering the gradient problem. I'll check on this. If you have any updated information, please report them in this thread. Thanks. |
Overall, look forward to your in-depth checked results and potential fix soon. Thanks. |
@zen-d |
Hi, @zen-d. The gradient problem, that the
However, I think it's acceptable to set Moreover, our trackformer is modified from MOTR. You can find that they also set |
The third issue is resolved as mentioned here: #21 (comment), and the performance of stage1 model could be reproduced when trained from scratch. It's actually nothinig to do with the gradient issue. Still, thanks for your feedback. |
I'm closing this issue as it's resolved, feel free to reopen it if needed @zen-d. |
@YTEP-ZHI Hello, thanks for your great work. When I set https://github.com/OpenDriveLab/UniAD/blob/main/projects/mmdet3d_plugin/uniad/detectors/uniad_track.py#L545-L546 to
I find that
memory_bank
andquery_interact
do not receive gradients. It is a bit hard for me to understand, could you please explain that? What confuses me more is that MUTR3D does not use temporal feature fusion and could run without such problems.The text was updated successfully, but these errors were encountered: