- Food-101 (101 food categories)
- VireoFood-172 (172 food categories & 353 ingredients)
- Recipe1M (1M English recipes & 13M food images)
- ChineseFoodNet (208 food categories)
- Cookpad (Janpanese recipes & food images)
- YouCook2 (Instructional video)
- EPIC-Kitchens (Instructional video)
- Multi-Scale Multi-View Deep Feature Aggregation for Food Recognition (TIP, 2020)
- Zero-shot Ingredient Recognition by Multi-Relational Graph Convolutional Network (AAAI, 2020)
- FoodAI: Food Image Recognition via Deep Learning for Smart Food Logging (KDD, 2019)
- Mixed-dish Recognition with Contextual Relation Networks (MM, 2019)
- Wide-Slice Residual Networks for Food Recognition (WACV, 2018)
- Cross-modal Recipe Retrieval with Rich Food Attributes (MM, 2017)
- Deep-based Ingredient Recognition for Cooking Recipe Retrieva (MM, 2016)
- MCEN: Bridging Cross-Modal Gap between Cooking Recipes and Dish Images with Latent Variable Model (CVPR, 2020)
- Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images (CVPR, 2019)[code]
- R2GAN: Cross-modal Recipe Retrieval with Generative Adversarial Network (CVPR, 2019)
- Cross-Modal Retrieval in the Cooking Context: Learning Semantic Text-Image Embeddings (SIGIR, 2018) [code]
- Deep Understanding of Cooking Procedure for Cross-modal Recipe Retrieval (MM, 2018)
- Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images (TPAMI, 2019)
- Learning Cross-modal Embeddings for Cooking Recipes and Food Images (CVPR, 2017) [code]
- Cross-modal Recipe Retrieval with Rich Food Attributes (MM, 2017)
- Cross-modal Recipe Retrieval: How to Cook This Dish? (MMM, 2017)
- CookGAN: Causality based Text-to-Image Synthesis (CVPR, 2020)
- CookGAN: Meal Image Synthesis from Ingredients (WACV, 2020)
- The art of food: Meal image synthesis from ingredients(arxiv, 2019)
- Inverse Cooking: Recipe Generation from Food Images (CVPR, 2019) [code]
- How to make a pizza: Learning a compositional layer-based GAN model (CVPR, 2019)
- Action Modifiers: Learning from Adverbs in Instructional Videos (CVPR, 2020)
- Multi-Modal Domain Adaptation for Fine-Grained Action Recognition (CVPR, 2020)
- DDLSTM: Dual-Domain LSTM for Cross-Dataset Action Recognition (CVPR, 2019)
- Towards Automatic Learning of Procedures from Web Instructional Videos (AAAI, 2018)
- Scaling Egocentric Vision:The EPIC-KITCHENS Dataset (ECCV, 2018)
- CVF Finding “It”: Weakly-Supervised Reference-Aware Visual Grounding in Instructional Videos (CVPR, 2018)
- Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos (CVPR, 2017)
- A survey on food computing (ACM Computing Surveys, 2019)
If you have anything related in FoodAI and want to add in this repo, feel free to contact me at [email protected].