Skip to content

Logic for parsing recipes and extracting data for use with the web application.

Notifications You must be signed in to change notification settings

Eaterator/recipe_parser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

#Recipe Parser Package contains helper functions to create parsers to extract data from the recipes, and join the Eaterator pipeline from scraping -> text files (JSON) -> recipe parsing (here) -> insert to models/DB. The current approach uses NLTK and its POS tagging and sentence chunking to detect amounts (CD for numerical values- including word forms), and compound nounds (* or ) types to detech ingredients. Some more logic is required to detech the that represents the measurement amount associated with the value. Also, special consideration may also be required for the case where the for measurement amount does not exist (i.e. '3 lemons').

##Requirements NLTK and Python (version 3.5.2).

##Sources A Ruby recipe parser is available here for inspiration:

https://github.com/iancanderson/ingreedy/blob/master/lib/ingreedy/dictionaries/en.yml

About

Logic for parsing recipes and extracting data for use with the web application.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages