Excuse me, to run your project, what are the requirements for computer configuration (graphics card, memory) #9

xjyp · 2023-04-09T00:53:03Z

No description provided.

chensong1995 · 2023-04-09T01:00:25Z

Hello xjyp,

Thanks for your question! In our experiments, we train E-CIR using three Tesla V100-SXM2-32GB GPUs for approximately 100 hours (50 epochs). The batch_size is set to 96. With a smaller batch size, you should be able to train the model on most commercial GPUs. In my experience, you start to get an overall acceptable performance way before reaching the 50th epoch. I hope this helps! Let me know if you have further concerns.

xjyp · 2023-04-09T01:13:29Z

Hello chensong1995，

Thank you for your answer,

I currently only have one RTX 3080 (10GB) * 1 GPU with 40G of memory. Can I train?
I see that you are using 3 GPUs. Are you using distributed training?

chensong1995 · 2023-04-09T01:19:03Z

Thanks for the follow-up!

RTX 3080 (10GB) should be good enough for training if you shrink the batch size. I encourage you to also check out our latest work DeblurSR, which has lower computational requirements. With a batch size of 36, DeblurSR only needs about 72 hours and two Tesla V100-SXM2-32GB GPUs for 50 epochs.
We use the DataParallel wrapper from PyTorch instead of DistributedDataParallel mainly because of its simplicity.

I hope this helps! Let me know if you have further concerns.

xjyp · 2023-04-09T01:26:18Z

Okay, thank you very much for your patient answer, I will pay close attention to your work.

chensong1995 · 2023-04-09T03:37:01Z

Thanks for your follow-up! It really depends on what your goal is. Are you searching for an event-based motion deblurring model as an inference model in your application? Are you trying to develop another model that improves E-CIR? I will be in a better position to offer help if you can fill me in with more specifics. If you are hesitant to share the details of your project in public, you can also send me an email privately.

chensong1995 · 2023-04-09T03:47:59Z

Thanks for your reply! If you are developing a follow-up work in event-based motion deblurring, you do not have to run the entire training code. My suggestion is to download only train_0.hdf5, which will allow you to have an overall idea of how each component of our code works. If you change the number 16 to 0 on this line, the program will only load train_0.hdf5 as the training data. You may want to download val_0.hdf5 and val_1.hdf5 as well since they allow you to evaluate the model on the testing split. I hope this helps! Let me know if you need further assistance.

xjyp · 2023-04-09T06:19:34Z

thank you very much

chensong1995 added the question Further information is requested label Apr 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Excuse me, to run your project, what are the requirements for computer configuration (graphics card, memory) #9

Excuse me, to run your project, what are the requirements for computer configuration (graphics card, memory) #9

xjyp commented Apr 9, 2023

chensong1995 commented Apr 9, 2023

xjyp commented Apr 9, 2023

chensong1995 commented Apr 9, 2023

xjyp commented Apr 9, 2023

chensong1995 commented Apr 9, 2023

chensong1995 commented Apr 9, 2023

xjyp commented Apr 9, 2023

Excuse me, to run your project, what are the requirements for computer configuration (graphics card, memory) #9

Excuse me, to run your project, what are the requirements for computer configuration (graphics card, memory) #9

Comments

xjyp commented Apr 9, 2023

chensong1995 commented Apr 9, 2023

xjyp commented Apr 9, 2023

chensong1995 commented Apr 9, 2023

xjyp commented Apr 9, 2023

chensong1995 commented Apr 9, 2023

chensong1995 commented Apr 9, 2023

xjyp commented Apr 9, 2023