caffe only need 8min for training mnist with accuracy of 0.99,why mxnet need over 50min for training? #1036

dushoufu · 2015-12-23T07:04:36Z

I separately run caffe and mxnet to train mnist on the same computer with the mode of CPU.The result is that caffe is three times faster than mxnet.
I analyse it，the model and the environment is same，except that caffe‘s input is lmdb。 The two cases are directly executed according to the example。Caffe cost 700% of cpu,while mxnet cost 500% of cpu.
For mxnet,every cpu is not up to 30%. How can I check something to speed up mxnet?
mxnet:
INFO:root:Epoch[19] Time cost=146.922
INFO:root:Epoch[19] Validation-accuracy=0.990100

real 52m37.068s
user 87m37.811s
sys 205m59.149s

caffe
I1223 14:27:11.260304 12600 solver.cpp:408] Test net output #0: accuracy = 0.9909
I1223 14:27:11.260419 12600 solver.cpp:408] Test net output #1: loss = 0.0274076 (* 1 = 0.0274076 loss)
I1223 14:27:11.260432 12600 solver.cpp:325] Optimization Done.
I1223 14:27:11.260442 12600 caffe.cpp:215] Optimization Done.

real 8m28.679s
user 23m31.146s
sys 43m21.118s

dushoufu · 2015-12-24T02:06:36Z

I'm sure the team of mxnet has done detailed testing compared to caffe. I believe there is something wrong or abnormal with my situation.Is there someone that give me some instruction?

lukemetz · 2015-12-24T03:16:38Z

Relevant: #1031

piiswrong · 2015-12-27T20:14:36Z

We mostly focus on GPU performance so CPU is not optimized. But there is an easy fix that should improve performance a lot. check mshadow/tensor_cpu-inl.h:139. Currently the loop is not multithreaded. You can add multitheading with openmp on the outer loop. This should be easy and it should cover most ops.
If you can make this work you are welcome to contribute a PR.
Thanks.

wangzhangup · 2016-01-09T16:10:11Z

@piiswrong here is compare between MxNet and caffe. Both has totally same model and params, were computed on CPU on same machine. But, the predict time of MxNet is about 4 times slow than caffe. Btw, caffe uses one thread on CPU, and they both are built with atlas.

dushoufu · 2016-06-20T03:04:11Z

I finish it

dushoufu changed the title ~~caffe only need 8min for training mnist with accuracy of 0.99,why mxnet need over 30min for training?~~ caffe only need 8min for training mnist with accuracy of 0.99,why mxnet need over 50min for training? Dec 23, 2015

dushoufu closed this as completed Jun 20, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

caffe only need 8min for training mnist with accuracy of 0.99,why mxnet need over 50min for training? #1036

caffe only need 8min for training mnist with accuracy of 0.99,why mxnet need over 50min for training? #1036

dushoufu commented Dec 23, 2015

dushoufu commented Dec 24, 2015

lukemetz commented Dec 24, 2015

piiswrong commented Dec 27, 2015

wangzhangup commented Jan 9, 2016

dushoufu commented Jun 20, 2016

caffe only need 8min for training mnist with accuracy of 0.99,why mxnet need over 50min for training? #1036

caffe only need 8min for training mnist with accuracy of 0.99,why mxnet need over 50min for training? #1036

Comments

dushoufu commented Dec 23, 2015

dushoufu commented Dec 24, 2015

lukemetz commented Dec 24, 2015

piiswrong commented Dec 27, 2015

wangzhangup commented Jan 9, 2016

dushoufu commented Jun 20, 2016