Add inception-resnet-v2 model #64

lim0606 · 2016-06-09T11:31:17Z

For personal interests, I and my friend Sunghun Kang (shuni@kaist.ac.kr) trained inception-resnet-v2 (http://arxiv.org/abs/1602.07261) from scratch based on torch, esp. facebook's training scripts (https://github.com/facebook/fb.resnet.torch).

This PR might not be a proper one for this repository, but I’d like to share this for someone who may be interested in this model too.

See, https://github.com/lim0606/torch-inception-resnet-v2 for more details about the trained model.

colesbury · 2016-06-10T20:40:28Z

Cool!

colesbury · 2016-06-10T20:44:56Z

Regarding momentum, I think we match the Caffe style momentum by setting the dampening to 0.

i.e. by setting dampening to 0 we compute:
v := momentum*v + lr*g
instead of
v := momentum*v + (1-momentum )*lr*g

lim0606 · 2016-06-11T00:54:28Z

@colesbury

Thank you for comments!

I will check the training result with 0.9 momentum.

… all layers in inception-resnet-v2

superzrx · 2016-08-14T02:16:14Z

@lim0606 why changeCAddTable(true) to CAddTable() ?

lim0606 · 2016-08-14T05:30:38Z

@superzrx

Hi,

Since identity layers pass memory addresses of their input tensors directly to next layer, CAddTable(true) seems to cause a problem, changing the values of inputs in residual layers.

In the case of cls task, the effect was minor; therefore, i didn't notice the problem for a long time... ;(

However, when I tried to apply the model to other types of tasks, having additional layers branching feature output to several paths, the model gave me nan within some iterations.

Sincerely,

Jaehyun Lim

superzrx · 2016-08-14T11:37:54Z

@lim0606
Hi Jaehyun,
As CAddTable(true) save result on its first child, it seems ok to make identity path second child.

lim0606 · 2016-08-15T03:29:20Z

@superzrx

Hi,

Since the first child is the identity layer, which directly refer its input as output, writing values over the first child of CAddTable(true) becomes writing values over the input of the second child.

Furthermore, resnet as well as inception-resnet-v2 consist of the residual layers, having identity path; therefore, CAddTable(true)s at different layers access the same memory addresses and change their values in a single forward path.

Best regards,

Jaehyun

superzrx · 2016-08-24T02:54:07Z

@lim0606
Hi lim,
In my last reply, I mean switch identity path and inception path. So identity becomes second child and CAddTable(true) do not couses problem ( which saves some memory ).
Also I see you use ConcatTable and JoinTable instead of Concat. I tried to use Concat on some networks but it encountered training problem. Is there some problem with concat and shareGradInput that you use ConcatTable and JoinTable to walk around?

lim0606 added 3 commits May 21, 2016 22:58

update for inception v4 (esp. inception-resnet-v2)

3251364

change the names of networks from inceptionv4 to inception-resnet-v2

1d2f028

delete inception-resnet-v2-aux.lua since it is not tested yet

5b02c42

ghost added the CLA Signed label Jun 9, 2016

ghost added the CLA Signed label Jul 12, 2016

lim0606 added 2 commits August 4, 2016 14:38

minor updates on opts.lua

2b448e7

fixed a critical bug, changing from CAddTable(true) to CAddTable() in…

e1e4a1f

… all layers in inception-resnet-v2

ghost added the CLA Signed label Aug 14, 2016

ghost added the CLA Signed label Aug 15, 2016

facebook-github-bot added the CLA Signed label Aug 24, 2016

lim0606 mentioned this pull request Oct 30, 2016

Saved model cluttered with buffers lim0606/torch-inception-resnet-v2#4

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add inception-resnet-v2 model #64

Add inception-resnet-v2 model #64

lim0606 commented Jun 9, 2016

colesbury commented Jun 10, 2016

colesbury commented Jun 10, 2016

lim0606 commented Jun 11, 2016

superzrx commented Aug 14, 2016

lim0606 commented Aug 14, 2016 •

edited

Loading

superzrx commented Aug 14, 2016

lim0606 commented Aug 15, 2016 •

edited

Loading

superzrx commented Aug 24, 2016

Add inception-resnet-v2 model #64

Are you sure you want to change the base?

Add inception-resnet-v2 model #64

Conversation

lim0606 commented Jun 9, 2016

colesbury commented Jun 10, 2016

colesbury commented Jun 10, 2016

lim0606 commented Jun 11, 2016

superzrx commented Aug 14, 2016

lim0606 commented Aug 14, 2016 • edited Loading

superzrx commented Aug 14, 2016

lim0606 commented Aug 15, 2016 • edited Loading

superzrx commented Aug 24, 2016

lim0606 commented Aug 14, 2016 •

edited

Loading

lim0606 commented Aug 15, 2016 •

edited

Loading