Skip to content

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

使用return_word_box功能,输出很奇怪 #14428

Closed
3 tasks done
huameinan219 opened this issue Dec 20, 2024 · 0 comments
Closed
3 tasks done

使用return_word_box功能,输出很奇怪 #14428

huameinan219 opened this issue Dec 20, 2024 · 0 comments

Comments

@huameinan219
Copy link

🔎 Search before asking

  • I have searched the PaddleOCR Docs and found no similar bug report.
  • I have searched the PaddleOCR Issues and found no similar bug report.
  • I have searched the PaddleOCR Discussions and found no similar bug report.

🐛 Bug (问题描述)

直接安装paddleocr包,调用PaddleOCR(lang="ch", return_word_box=True),输出的结果类似于:
[[[26.0, 37.0], [304.0, 37.0], [304.0, 73.0], [26.0, 73.0]], ('纯臻营养护发素', 0.9946897625923157, [46.085826210826205, [['纯', '臻', '营', '养', '护', '发', '素']], [[3, 10, 16, 23, 30, 36, 43]], ['cn']])]
后面的 [46.085826210826205, [['纯', '臻', '营', '养', '护', '发', '素']], [[3, 10, 16, 23, 30, 36, 43]] 代表什么呢?如何将其转化为4点检测框呢?我查看了#10377,里面的解释没看懂。
查看源码源码第233行,cal_ocr_word_box函数返回的应该是检测框啊,为啥调用paddleocr包,返回的单字检测结果不是框?该如何将其转化为框呢?

🏃‍♂️ Environment (运行环境)

x86 CPU

🌰 Minimal Reproducible Example (最小可复现问题的Demo)

PaddleOCR(lang="ch", return_word_box=True)

@PaddlePaddle PaddlePaddle locked and limited conversation to collaborators Dec 20, 2024
@GreatV GreatV converted this issue into discussion #14430 Dec 20, 2024

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

Projects
None yet
Development

No branches or pull requests

1 participant