add svtr large model #10937

zhangyubo0722 · 2023-09-18T11:58:54Z

No description provided.

paddle-bot · 2023-09-18T11:58:58Z

Thanks for your contribution!

tink2123 · 2023-09-18T12:23:59Z

configs/rec/rec_svtrnet_large.yml

+  use_visualdl: false
+  infer_img: doc/imgs_words/ch/word_1.jpg
+  character_dict_path: ppocr/utils/ppocr_keys_v1.txt
+  max_text_length: &max_text_length 25


这里可以修改成40

tink2123 · 2023-09-18T12:24:31Z

configs/rec/rec_svtrnet_large.yml

+  beta2: 0.99
+  epsilon: 1.0e-08
+  weight_decay: 0.05
+  no_weight_decay_name: norm pos_embed char_node_embed pos_node_embed char_pos_embed vis_pos_embed


这里是优化过的吗？

tink2123 · 2023-09-18T12:25:48Z

configs/rec/rec_svtrnet_large.yml

+    out_channels: 512
+    patch_merging: Conv
+    embed_dim: [192, 256, 512]
+    depth: [6, 6, 9]


参数都是调整过的？

tink2123 · 2023-09-18T12:27:45Z

configs/rec/rec_svtrnet_large.yml

+
+Architecture:
+  model_type: rec
+  algorithm: SVTR_LCNet


algorithm：SVTR?
已经没有LCNet了

tink2123 · 2023-09-25T12:07:57Z

ppocr/modeling/heads/rec_multi_head.py

+        self.dec_pos_embed = self.create_parameter(
+                            shape=[1, w, dim], default_initializer=zeros_)
+        self.add_parameter("dec_pos_embed", self.dec_pos_embed)
+                        # self.pos_drop = nn.Dropout(p=drop_rate)


删除多余代码

tink2123 · 2023-09-25T12:11:51Z

ppocr/modeling/heads/rec_multi_head.py

@@ -88,7 +111,9 @@ def __init__(self, in_channels, out_channels_list, **kwargs):
                    '{} is not supported in MultiHead yet'.format(name))

    def forward(self, x, targets=None):
-
+        if self.use_pool:
+            # print(x.shape)


tink2123 · 2023-09-25T12:11:56Z

ppocr/modeling/heads/rec_multi_head.py

@@ -61,8 +78,14 @@ def __init__(self, in_channels, out_channels_list, **kwargs):
                max_text_length = gtc_args.get('max_text_length', 25)
                nrtr_dim = gtc_args.get('nrtr_dim', 256)
                num_decoder_layers = gtc_args.get('num_decoder_layers', 4)
-                self.before_gtc = nn.Sequential(
+                if self.use_pos:
+                    # add_pos = AddPos(nrtr_dim, 60)


cuicheng01 · 2023-09-25T12:33:31Z

ppocr/modeling/backbones/rec_vit.py

+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+from matplotlib.mlab import stride_windows


删除无关代码

cuicheng01 · 2023-09-25T12:34:19Z

ppocr/modeling/backbones/rec_vit.py

+
+    def forward(self, x):
+
+        qkv = paddle.reshape(self.qkv(x), (0, -1, 3, self.num_heads, self.dim //


这么写会不会不能导出inference model，验证过了吗

tink2123

LGTM

trantuankhoi · 2023-11-03T08:14:44Z

configs/rec/PP-OCRv4/ch_PP-OCRv4_rec_svtr_large.yml

+    embed_dim: [192, 256, 512]
+    depth: [6, 6, 9]
+    num_heads: [6, 8, 16]
+    mixer: ['Conv','Conv','Conv','Conv','Conv','Conv','Conv','Conv','Conv','Global','Global','Global','Global','Global','Global','Global','Global','Global','Global','Global','Global']


Hi @zhangyubo0722, can I have a question?
I guess the Permuation column (in the SVTR paper) is the value of mixer, so I set my config is Conv*10 and Global*11, but your config is Conv*9 and Global*11. Can you show me the quotation you used for this config please. Thank you a lot

zhangyubo0722 force-pushed the add_svtr_large branch 2 times, most recently from d6dc304 to 435a928 Compare September 18, 2023 12:08

tink2123 reviewed Sep 18, 2023

View reviewed changes

zhangyubo0722 force-pushed the add_svtr_large branch 2 times, most recently from 57d2780 to c675aa5 Compare September 25, 2023 12:01

tink2123 reviewed Sep 25, 2023

View reviewed changes

zhangyubo0722 force-pushed the add_svtr_large branch 3 times, most recently from b3487df to f8eb3f5 Compare September 25, 2023 12:24

tink2123 previously approved these changes Sep 25, 2023

View reviewed changes

cuicheng01 reviewed Sep 25, 2023

View reviewed changes

zhangyubo0722 dismissed tink2123’s stale review via 40396c4 September 26, 2023 02:22

zhangyubo0722 force-pushed the add_svtr_large branch from f8eb3f5 to 40396c4 Compare September 26, 2023 02:22

zhangyubo0722 added 2 commits September 26, 2023 03:02

add svtr large model

7eb7844

[WIP]add svtr large model

2d8a6ae

zhangyubo0722 force-pushed the add_svtr_large branch from 40396c4 to 2d8a6ae Compare September 26, 2023 03:17

tink2123 approved these changes Sep 26, 2023

View reviewed changes

tink2123 merged commit e49e491 into PaddlePaddle:dygraph Sep 26, 2023

trantuankhoi reviewed Nov 3, 2023

View reviewed changes

trantuankhoi mentioned this pull request Nov 5, 2023

Config of SVTR-CPPD (large) #11198

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add svtr large model #10937

add svtr large model #10937

zhangyubo0722 commented Sep 18, 2023

paddle-bot bot commented Sep 18, 2023

tink2123 Sep 18, 2023

tink2123 Sep 18, 2023

tink2123 Sep 18, 2023

tink2123 Sep 18, 2023

tink2123 Sep 25, 2023

tink2123 Sep 25, 2023

tink2123 Sep 25, 2023

cuicheng01 Sep 25, 2023

cuicheng01 Sep 25, 2023

tink2123 left a comment

trantuankhoi Nov 3, 2023


		def forward(self, x):

		qkv = paddle.reshape(self.qkv(x), (0, -1, 3, self.num_heads, self.dim //

add svtr large model #10937

add svtr large model #10937

Conversation

zhangyubo0722 commented Sep 18, 2023

paddle-bot bot commented Sep 18, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tink2123 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment