Back to AI Research

AI Research

Epicure: Navigating the Emergent Geometry of Food I... | AI Research

Key Takeaways

  • Epicure: Navigating the Emergent Geometry of Food Ingredient Embeddings Epicure is a research project that explores how computers can represent the complex r...
  • We present Epicure, a family of three sibling skip-gram ingredient embeddings retrained from scratch on a multilingual recipe corpus.
  • Epicure: Navigating the Emergent Geometry of Food Ingredient Embeddings Epicure is a research project that explores how computers can represent the complex relationships between food ingredients.
  • Epicure: Navigating the Emergent Geometry of Food Ingredient Embeddings
  • Epicure is a research project that explores how computers can represent the complex relationships between food ingredients.
Paper AbstractExpand

We present Epicure, a family of three sibling skip-gram ingredient embeddings retrained from scratch on a multilingual recipe corpus. We aggregate 4.14M recipes from 11 sources spanning seven languages, English, Chinese, Russian, Vietnamese, Spanish, Turkish, Indonesian, German, and Indian-English, and normalise the raw ingredient strings to 1,790 canonical entries via an LLM-augmented pipeline. A 203,508-edge ingredient-ingredient NPMI graph and an 80,019-edge typed FlavorDB ingredient-compound graph, 2,247 typed compound nodes across 15 categories, seed three Metapath2Vec variants that share architecture and hyperparameters and differ only in the random-walk schema: Cooc walks the co-occurrence graph only, Chem walks the typed compound metapaths only, and Core blends both via injected ingredient-ingredient walks at controlled mixing, placing each model at a distinct point on the chemistry-vs-recipe-context spectrum.

Epicure: Navigating the Emergent Geometry of Food Ingredient Embeddings
Epicure is a research project that explores how computers can represent the complex relationships between food ingredients. By analyzing millions of recipes and chemical data, the researchers created a system that maps ingredients into a 300-dimensional space. This allows the model to understand culinary concepts—such as flavor profiles, nutritional content, and cultural traditions—as mathematical directions, enabling tools that can help chefs discover new ingredient pairings or explore culinary categories.

A Multilingual Foundation

To build a comprehensive model, the researchers aggregated over 4.1 million recipes from 11 different sources across seven languages, including English, Chinese, Russian, Vietnamese, Spanish, Turkish, Indonesian, and German. Because raw recipe data is often messy, they used an AI-augmented pipeline to normalize roughly 200,000 unique ingredient terms into a clean, canonical list of 1,790 ingredients. This ensures the model can recognize that different terms across languages often refer to the same culinary building block.

Three Perspectives on Flavor

The core innovation of Epicure is the creation of three "sibling" models that learn from different types of information. All three share the same architecture but differ in how they "walk" through the data:

  • Cooc: Focuses exclusively on recipe co-occurrence, learning which ingredients are typically used together in the same dish.

  • Chem: Focuses on chemical compounds, learning how ingredients are related based on the flavor molecules they share.

  • Core: A hybrid model that blends both recipe context and chemical data.
    By adjusting these "walk schemas," the researchers created a controllable spectrum that allows them to see how much weight should be given to human culinary habits versus the underlying chemistry of food.

Navigating Culinary Geometry

The researchers found that these models naturally organize ingredients into meaningful clusters without being explicitly told to do so. For example, the embeddings clearly separate ingredients by cuisine (such as East Asian or Mediterranean) and nutritional category.
The models support two primary ways of interacting with this "flavor space":

  • Nearest-Neighbor Lookups: Finding ingredients that are mathematically similar to a starting point.

  • Directional Arithmetic: Using a technique called SLERP, users can "rotate" an ingredient toward a specific goal. For instance, a user could take a "rice" seed and rotate it toward a "South-Asian" pole to discover related ingredients like curry leaves, urad dal, or fenugreek seeds.

Important Considerations

While the Epicure models demonstrate that culinary knowledge can be effectively captured in a dense mathematical space, the researchers have not released the code or the trained models at this time. The study highlights that while the models are highly effective at recovering supervised labels like macronutrients or sensory categories, the "geometry" of the food space changes depending on whether the model prioritizes chemical data or recipe-based context.

Comments (0)

No comments yet

Be the first to share your thoughts!