Tools for mining recipes from the internet.
A couple of tools are provided here to automatically fetch recipes, and parse them into a common format.
Recipes can be downloaded from a random-recipe-endpoint as gzip
compressed
(default) html
into a directory (default html
) using get_random_recipes.py
.
With parse_recipe.py
Recipes from the default endpoint can be parsed into
a gzip
compressed json
structure.
metadata.py
can concatenate the recipes into two types of csv
.
- python3
- BeautifulSoup (for parsing)
- requests (for fetching)
- pandas (for creating csvs)