mobilenet structure omitts last pooling layer #40

zhenglaizhang · 2018-02-08T06:38:03Z

Have a check at the mobilenet structure, and most of the weights are in the final FC layer:

pre_fc1_weight parameter size=25690112, shape=(512L, 50176L)

and looking at the code, the last pooling layer is omitted ().

insightface/src/symbols/fmobilenet.py

Lines 75 to 80 in f1cd542

    
           conv_13_dw = Conv(conv_12, num_group=512, num_filter=512, kernel=(3, 3), pad=(1, 1), stride=(2, 2), name="conv_13_dw") # 14/7 
        
           conv_13 = Conv(conv_13_dw, num_filter=1024, kernel=(1, 1), pad=(0, 0), stride=(1, 1), name="conv_13") # 7/7 
        
           conv_14_dw = Conv(conv_13, num_group=1024, num_filter=1024, kernel=(3, 3), pad=(1, 1), stride=(1, 1), name="conv_14_dw") # 7/7 
        
           conv_14 = Conv(conv_14_dw, num_filter=1024, kernel=(1, 1), pad=(0, 0), stride=(1, 1), name="conv_14") # 7/7 
        
           body = conv_14 
        
           fc1 = symbol_utils.get_fc1(body, num_classes, fc_type)

Is this by design? Ignoring the last pooling layer leads to much larger model size -:(

The text was updated successfully, but these errors were encountered:

nttstar · 2018-02-08T07:01:24Z

please refer to network structure section in our paper.

zhenglaizhang · 2018-02-08T07:31:36Z

@nttstar thanks for the info, I will read carefully the paper.

zhenglaizhang · 2018-02-09T01:46:15Z

@nttstar Hi, you are definitely right, I was using the version 'E' which removed the GP layer.

Then I tried the mobilenetv1 with D as version output, whose model size is around 15M, and started to train 15 hours ago, with 128 batch size, but found the test accuracy is still around 0.5.

Have you guys tried to train with such settings? maybe I need to tune other hyperparameters...

nttstar · 2018-02-11T15:33:33Z

@zhenglaizhang We all use 'E' in our recent experiments.

mmxuan18 · 2019-08-02T12:01:56Z

@zhenglaizhang i also has this question, and in the code last conv directly connect with mx.sym.FullyConnected, is this func will do flatten inner?

which part of the paper explain this modify?

zhenglaizhang closed this as completed Feb 8, 2018

zhenglaizhang reopened this Feb 9, 2018

nttstar closed this as completed Feb 11, 2018

zfhsky mentioned this issue Mar 10, 2018

RuntimeError: simple_bind error. Arguments: #86

Closed

FlyingAle mentioned this issue Jun 2, 2023

2d106det convert to ncnn model,run on android will be crushed #2322

Open

arad2022 mentioned this issue Feb 27, 2024

help for face recognition on my own images #2526

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mobilenet structure omitts last pooling layer #40

mobilenet structure omitts last pooling layer #40

zhenglaizhang commented Feb 8, 2018

nttstar commented Feb 8, 2018

zhenglaizhang commented Feb 8, 2018

zhenglaizhang commented Feb 9, 2018 •

edited

Loading

nttstar commented Feb 11, 2018

mmxuan18 commented Aug 2, 2019 •

edited

Loading

mobilenet structure omitts last pooling layer #40

mobilenet structure omitts last pooling layer #40

Comments

zhenglaizhang commented Feb 8, 2018

nttstar commented Feb 8, 2018

zhenglaizhang commented Feb 8, 2018

zhenglaizhang commented Feb 9, 2018 • edited Loading

nttstar commented Feb 11, 2018

mmxuan18 commented Aug 2, 2019 • edited Loading

zhenglaizhang commented Feb 9, 2018 •

edited

Loading

mmxuan18 commented Aug 2, 2019 •

edited

Loading