Back to AI Research

AI Research

RAID: Semantic Graph Diffusion for True Cold-Start... | AI Research

Key Takeaways

  • RAID: Semantic Graph Diffusion for True Cold-Start and Cross-Lingual Forecasting Time-series forecasting models often struggle when they encounter a "cold-st...
  • Time-series foundation models show strong transfer performance when given a non-empty history window.
  • However, true cold-start scenarios, where a new item has no prior observations, violate this assumption.
  • We propose RAID (Retrieval-Augmented Iterative Diffusion) a framework, which replaces history-based correlation learning with metadata-driven semantic retrieval and graph-conditioned diffusion.
  • RAID maps textual metadata into a shared semantic space using a frozen multilingual embedding model and constructs an inductive retrieval graph that extends naturally to unseen items.
Paper AbstractExpand

Time-series foundation models show strong transfer performance when given a non-empty history window. However, true cold-start scenarios, where a new item has no prior observations, violate this assumption. We propose RAID (Retrieval-Augmented Iterative Diffusion) a framework, which replaces history-based correlation learning with metadata-driven semantic retrieval and graph-conditioned diffusion. RAID maps textual metadata into a shared semantic space using a frozen multilingual embedding model and constructs an inductive retrieval graph that extends naturally to unseen items. It first forms a base forecast by aggregating information from semantically related neighbors, then refines this forecast with a gated diffusion module to model residual uncertainty. Under a strict true cold-start protocol, RAID outperforms strong foundation models and competitive baselines on both forecasting accuracy and prediction interval coverage, while reducing inference latency by an order of magnitude through non-autoregressive decoding. The shared semantic space also enables zero-shot cross-lingual transfer, allowing a model trained on English descriptions to generalize to items described in other languages without direct supervision.

RAID: Semantic Graph Diffusion for True Cold-Start and Cross-Lingual Forecasting
Time-series forecasting models often struggle when they encounter a "cold-start" scenario—a situation where a new item, such as a newly launched product, has no historical sales data to analyze. While modern foundation models are powerful, they typically require a history window to make predictions, causing them to fail when that data is missing. This paper introduces RAID (Retrieval-Augmented Iterative Diffusion), a framework designed to solve this by replacing history-based learning with metadata-driven insights. Instead of relying on past observations, RAID uses textual descriptions to understand an item's potential demand, allowing it to make accurate forecasts for items that have never been seen before.

How RAID Works

RAID operates by mapping textual metadata—such as product titles, categories, and descriptions—into a shared "semantic space" using a pre-trained multilingual model. By doing this, the system can identify relationships between items based on their descriptions rather than their sales history. For example, if a new running shoe is introduced, the model identifies similar existing shoes and uses their demand patterns as a guide. It constructs an "inductive retrieval graph" that connects these items, allowing the model to propagate demand signals from established products to new ones.

Generating Forecasts

The forecasting process in RAID is split into two main phases. First, it creates a "base forecast" by aggregating the normalized patterns of semantically similar neighbors. This provides a solid starting point that avoids the common pitfall of overestimating or underestimating demand for niche items. Second, it uses a "Gated Diffusion Transformer" to refine this base prediction. This module models the residual uncertainty—the difference between the base prediction and the actual outcome—to produce a final, probabilistic forecast. By focusing on this residual, the model becomes more efficient, requiring significantly fewer refinement steps than traditional diffusion models.

Efficiency and Performance

A major advantage of the RAID framework is its speed. Because it decouples the memory-intensive retrieval process from the generative diffusion process, it achieves much lower inference latency compared to standard transformer-based models. In testing, RAID outperformed existing foundation models in strict cold-start scenarios, where it was able to provide accurate predictions despite having zero historical data. Furthermore, because the model uses a multilingual embedding space, it demonstrates strong "zero-shot" cross-lingual transfer. This means a model trained on English product descriptions can effectively forecast demand for items described in other languages, such as Spanish or Russian, without needing additional training data in those languages.

Key Takeaways

RAID represents a shift in how we approach forecasting for new items. By grounding predictions in static metadata rather than temporal history, it overcomes the "cold-start" failure mode that plagues many current foundation models. The framework’s ability to maintain high accuracy while reducing computational costs makes it a practical solution for real-world applications where new items are frequently added to a catalog. While traditional models remain strong when historical data is abundant, RAID provides a robust, scalable alternative for scenarios where that data is entirely absent.

Comments (0)

No comments yet

Be the first to share your thoughts!