Try to detect the name based on the size of the text #54

teolemon · 2020-05-10T19:39:56Z

A simplistic assumption: the name of the product should be the largest text on the front of the product.
Based on this, we could compute a ratio between the area of the bounding boxes (width by height) and the amount of letters inside it .
Based on this ratio, we could have candidates for the product name

CloCkWeRX · 2020-07-30T10:15:05Z

I was thinking a good pipeline also could be:

get all text from front image
look for brand candidates from barcode group
remove labels ("fat free")
remove quantity (360g/similar regexp)
extract remaining sentences
weight "x with y" or "x & y" heavily (perhaps a good list of title patterns)

... and provide these as autocomplete suggestions for the name field.

These steps might help (particularly brands) filter put false positives

raphael0202 transferred this issue from openfoodfacts/robotoff Jul 15, 2020

teolemon added product_name OCR No ML skills required labels Sep 9, 2021

teolemon mentioned this issue Sep 9, 2021

Non ML tasks you can help with (Tracker) #77

Open

3 tasks

teolemon added this to 🤖 Artificial Intelligence @ Open Food Facts Apr 2, 2022

teolemon moved this to To discuss and validate in 🤖 Artificial Intelligence @ Open Food Facts May 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Try to detect the name based on the size of the text #54

Try to detect the name based on the size of the text #54

teolemon commented May 10, 2020

CloCkWeRX commented Jul 30, 2020

Try to detect the name based on the size of the text #54

Try to detect the name based on the size of the text #54

Comments

teolemon commented May 10, 2020

CloCkWeRX commented Jul 30, 2020