Add new recognition method "ParseQ" #10836

ToddBear · 2023-09-06T07:29:09Z

No description provided.

* Update PP-OCRv4_introduction.md * Update PP-OCRv4_introduction.md * Update PP-OCRv4_introduction.md

…Paddle:Release/2.7 (PaddlePaddle#10655) * Don't break overall processing on a bad image * Add preprocessing common to OCR tasks Add preprocessing to options

added missing pyyaml library

Co-authored-by: Dennis <dvorst@users.noreply.github.com> Co-authored-by: shiyutang <34859558+shiyutang@users.noreply.github.com>

* 修改数据增强导致的DSR报错 * 错误修改回滚

Fixed simple spelling errors.

paddle-bot · 2023-09-06T07:29:13Z

Thanks for your contribution!

ToddBear · 2023-09-06T11:36:39Z

与Torch模型在Real(COCO-Text, RCTW17, Uber-Text, ArT, LSVT, MLT19, ReCTS, TextOCR and OpenVINO)数据集下训练，在IIIT5k, SVT, IC13,IC15,SVTP,CUTE数据集下测试结果对比：

ToddBear · 2023-09-06T11:42:18Z

静态图模型导出成功：

Python部署推理结果验证通过：

tink2123

另外新增算法请提交至 dygraph 分支

tink2123 · 2023-09-07T05:57:27Z

configs/rec/rec_vit_parseq.yml

+  use_visualdl: False
+  infer_img: doc/imgs_words_en/word_10.png
+  # for data or label process
+  character_dict_path: /ssd3/suzejia/newAlgOCR/ppocr/utils/dict/parseq_dict.txt


请上传字典，并提供github上的相对路径

字典已上传，并相应修改configs中的路径

tink2123 · 2023-09-07T06:16:06Z

ppocr/data/imaug/rec_img_aug.py

+        self.image_shape = image_shape
+        self.dst_h, self.dst_w = image_shape[1], image_shape[2]
+
+    def __call__(self, data):


没有特殊的操作，可以看下能否复用之前的resize呢？

已复用SVTR的resize

tink2123 · 2023-09-07T06:16:45Z

ppocr/modeling/backbones/rec_vit_parseq.py

+            ones_(m.weight)
+
+    def forward_features(self, x):
+        # B = x.shape[0]


不需要的注释删除

嗯嗯，已删除

tink2123 · 2023-09-07T06:18:04Z

ppocr/modeling/heads/rec_parseq_head.py

+from typing import Optional
+import copy
+from itertools import permutations
+


如果代码参考了其他repo，加一下 code refer, 参考：

This code is refer from:
https://github.com/PaddlePaddle/PaddleClas/blob/release%2F2.5/ppcls/arch/backbone/model_zoo/vision_transformer.py

已增加code refer

tink2123 · 2023-09-07T06:19:18Z

ppocr/modeling/heads/rec_parseq_head.py

+        sz = perm.shape[0]
+        mask = paddle.zeros(shape=(sz, sz))
+        for i in range(sz):
+            query_idx = perm[i].cpu().numpy().tolist()


必须要转numpy吗？

嗯嗯，如果不转成list，直接用tensor的话，下面的mask索引会出bug

tink2123 · 2023-09-07T06:22:57Z

tools/infer/predict_rec.py

@@ -348,6 +354,17 @@ def resize_norm_img_svtr(self, img, image_shape):
        resized_image /= 0.5
        return resized_image

+    def resize_norm_img_parseq(self, img, image_shape):


同上，能否复用呢？

已复用SVTR的resize

ToddBear · 2023-09-07T07:15:32Z

另外新增算法请提交至 dygraph 分支

已修改至dygraph分支

tink2123

LGTM

tink2123 and others added 11 commits August 11, 2023 16:20

Update PP-OCRv4_introduction.md

ee9dc3c

Update PP-OCRv4_introduction.md (PaddlePaddle#10616)

88e2b13

* Update PP-OCRv4_introduction.md * Update PP-OCRv4_introduction.md * Update PP-OCRv4_introduction.md

Update README.md

6859e14

Cherrypicking PaddlePaddleGH-10217 and PaddlePaddleGH-10216 to Paddle…

b17c2f3

…Paddle:Release/2.7 (PaddlePaddle#10655) * Don't break overall processing on a bad image * Add preprocessing common to OCR tasks Add preprocessing to options

Update requirements.txt (PaddlePaddle#10656)

1614e84

added missing pyyaml library

[TIPC]update xpu tipc script (PaddlePaddle#10658)

c0d51f1

fix-typo (PaddlePaddle#10642)

f9dbab8

Co-authored-by: Dennis <dvorst@users.noreply.github.com> Co-authored-by: shiyutang <34859558+shiyutang@users.noreply.github.com>

修改数据增强导致的DSR报错 (PaddlePaddle#10662) (PaddlePaddle#10681)

b318d20

* 修改数据增强导致的DSR报错 * 错误修改回滚

Update algorithm_overview_en.md (PaddlePaddle#10670)

958abfb

Fixed simple spelling errors.

Implement recoginition method ParseQ

7aa18ee

Document update for new recognition method ParseQ

473cc8b

add prediction for parseq

c37c989

ToddBear force-pushed the parseq branch from 9adbb52 to c37c989 Compare September 6, 2023 10:07

ToddBear added 4 commits September 6, 2023 19:30

Update rec_vit_parseq.yml

57a73c4

Update rec_r31_sar.yml

c7771ad

Update rec_r31_sar.yml

4028421

Update rec_r50_fpn_srn.yml

101a478

tink2123 reviewed Sep 7, 2023

View reviewed changes

ToddBear added 7 commits September 7, 2023 14:26

Update rec_vit_parseq.py

49f5bbc

Update rec_vit_parseq.yml

5cac1e3

Update rec_parseq_head.py

8c09638

Update rec_img_aug.py

6fe47b7

Update rec_vit_parseq.yml

d026281

Update __init__.py

2498154

Update predict_rec.py

525eaa9

ToddBear changed the base branch from release/2.7 to dygraph September 7, 2023 07:15

ToddBear added 4 commits September 7, 2023 15:41

Update paddleocr.py

88f82fc

Update requirements.txt

b7dde39

Update utility.py

b98e42c

Update utility.py

daa824f

tink2123 approved these changes Sep 7, 2023

View reviewed changes

tink2123 merged commit 75d1661 into PaddlePaddle:dygraph Sep 7, 2023

shiyutang added the Contributor PR is merged label Sep 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new recognition method "ParseQ" #10836

Add new recognition method "ParseQ" #10836

ToddBear commented Sep 6, 2023

paddle-bot bot commented Sep 6, 2023

ToddBear commented Sep 6, 2023

ToddBear commented Sep 6, 2023

tink2123 left a comment

tink2123 Sep 7, 2023

ToddBear Sep 7, 2023

tink2123 Sep 7, 2023

ToddBear Sep 7, 2023

tink2123 Sep 7, 2023

ToddBear Sep 7, 2023

tink2123 Sep 7, 2023

ToddBear Sep 7, 2023

tink2123 Sep 7, 2023

ToddBear Sep 7, 2023

tink2123 Sep 7, 2023

ToddBear Sep 7, 2023

ToddBear commented Sep 7, 2023

tink2123 left a comment

Add new recognition method "ParseQ" #10836

Add new recognition method "ParseQ" #10836

Conversation

ToddBear commented Sep 6, 2023

paddle-bot bot commented Sep 6, 2023

ToddBear commented Sep 6, 2023

ToddBear commented Sep 6, 2023

tink2123 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ToddBear commented Sep 7, 2023

tink2123 left a comment

Choose a reason for hiding this comment