-
Notifications
You must be signed in to change notification settings - Fork 246
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Not getting a good accuracy #4
Comments
here is an example, batch size is 64, you can try it! net: "train_val.prototxt" |
@farmingyard I tested on two GPUs, this might cause some problem if the ShuffleChannel layer doesn't support multiple-GPU. I'm not sure tho. I'll try your solver to see. thanks a lot! |
I got 62.8% top1 acc and 84.7% top 5 acc, the result is not good enough with paper's, it still needs tuning... |
mark |
hi @farmingyard i just wonder that how do you write the prototxt? do you code to write? if, can you share it? thanks. |
@zimenglan-sysu-512 |
Hi @farmingyard , Do you finally reach the 65.9% top 1 acc in the paper? I trained with: Yet I only got around 64% acc at the end. I'd be appreciated if you could share with me some tricks in your training process. Thx a lot! |
@LeoLee96 |
Hi @farmingyard ,@LeoLee96 |
hi, @LeoLee96 can you share your pre-trained model. |
hi, @LeoLee96 , when you train shuffle net on two GPUs,you said this might cause some problem beacause the ShuffleChannel layer doesn't support multiple-GPU. how do you solve ? I got "Multi-GPU execution not available - rebuild with USE_NCCL" error, could you give me some advice |
@xiaomr |
@thank you for your advice! I have fixed the problem, it seems that depose layer can support multi gnu, the problem is because I chose the wrong branch of caffe~ |
Hi, @LeoLee96 , do you finally reach the 65.9% val acc ? |
@LeoLee96 Hi, I am a new guy to learn deep learning ,now, I want to use Caffe to train ShuffleNet on my own data ,but just with one .prototxt file I have no idea ,could you give me some direction or advises? |
I can reproduce the paper's accuracy of a 40Mflops shufflenet with tensorflow (https://github.com/tensorpack/tensorpack/tree/master/examples/ImageNetModels#shufflenet). You can use the configuration there as a reference. |
I only get 43% val acc when the epoch is 400000, I use your solver.prototxt and change the deploy.prototxt into train_val.prototxt. Is it not sufficient to train? or the preprocess of data is not true? |
@VectorYYYY |
According to the paper the batch size is 256 on each GPU making a total batch size of 1024. Other settings such as learning rate schedule are also clear so I don't know why would people invent their own settings if the goal is to reproduce the result. |
1080ti can only set batchsize to 64 and I set 4 gpus for training. But I found loss around 2.1 cannot decrease and the model top1 accuracy is around 53% |
According to https://arxiv.org/abs/1706.02677 you can use 1/4 learning rate together with 1/4 batch size and train 4x more steps to get roughly the same results. Besides that, my implementation can actually train a shufflenet 1x with batchsize 128 on a 1080ti, and shufflenet 0.5x with batchsize 256. |
Hi @farmingyard ,
I ran your deploy prototxt on imagenet this weekend yet still got a bad accuracy output. (exactly the same prototxt
I'd be appreciated if you could share your solver file with me to check.
much thanks!
The text was updated successfully, but these errors were encountered: