Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Try to detect the name based on the size of the text #54

Open
Tracked by #77
teolemon opened this issue May 10, 2020 · 1 comment
Open
Tracked by #77

Try to detect the name based on the size of the text #54

teolemon opened this issue May 10, 2020 · 1 comment

Comments

@teolemon
Copy link
Member

A simplistic assumption: the name of the product should be the largest text on the front of the product.
Based on this, we could compute a ratio between the area of the bounding boxes (width by height) and the amount of letters inside it .
Based on this ratio, we could have candidates for the product name

@raphael0202 raphael0202 transferred this issue from openfoodfacts/robotoff Jul 15, 2020
@CloCkWeRX
Copy link

I was thinking a good pipeline also could be:

  • get all text from front image
  • look for brand candidates from barcode group
  • remove labels ("fat free")
  • remove quantity (360g/similar regexp)
  • extract remaining sentences
  • weight "x with y" or "x & y" heavily (perhaps a good list of title patterns)

... and provide these as autocomplete suggestions for the name field.

These steps might help (particularly brands) filter put false positives

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: To discuss and validate
Development

No branches or pull requests

2 participants