Back to AI Research

AI Research

Toward Calibrated Mixture-of-Experts Under Distribu... | AI Research

Key Takeaways

  • Toward Calibrated Mixture-of-Experts Under Distribution Shift This paper investigates why Mixture-of-Experts (MoE) models—which combine multiple specialized...
  • Calibration aligns a model's predictive uncertainty with the frequencies of its empirical outcomes and is important for understanding and trusting reported probabilities.
  • In this work, we study how MoE models behave under distribution shift, focusing on how routing mechanisms interact with expert-level calibration.
  • We show that expert calibration is sufficient to ensure calibration of the overall model under a broad class of distribution shifts in hard-routed models, but is insufficient for calibrating soft-routed models.
  • Toward Calibrated Mixture-of-Experts Under Distribution Shift
Paper AbstractExpand

Calibration aligns a model's predictive uncertainty with the frequencies of its empirical outcomes and is important for understanding and trusting reported probabilities. Recent work shows that enforcing calibration at the level of individual predictors can improve ensemble accuracy and calibration, with mixture-of-experts (MoE) models showing strong empirical improvements in particular; however, the conditions under which calibration helps MoE are not well understood. In this work, we study how MoE models behave under distribution shift, focusing on how routing mechanisms interact with expert-level calibration. We show that expert calibration is sufficient to ensure calibration of the overall model under a broad class of distribution shifts in hard-routed models, but is insufficient for calibrating soft-routed models. To address this, we propose an adversarial reweighting that penalizes calibration errors of the routed aggregate under distribution shift, and we demonstrate that it improves the accuracy-calibration tradeoff both on average and on difficult subsets of the data, across model classes, prediction tasks, and distribution shifts.

Toward Calibrated Mixture-of-Experts Under Distribution Shift
This paper investigates why Mixture-of-Experts (MoE) models—which combine multiple specialized sub-models to solve complex tasks—often become unreliable when the data they encounter during deployment differs from their training data. While these models are designed to be efficient and specialized, the authors demonstrate that even when individual experts are perfectly calibrated, the overall model can produce misleading confidence scores under distribution shift. The researchers propose new training objectives that improve the model's ability to maintain accurate, trustworthy probability estimates even when routing patterns change.

The Problem with Soft Routing

MoE models use a "router" to decide which experts to use for a given input. In "hard routing," an input is sent to only one expert, creating a clear, predictable path. In "soft routing," the model blends the outputs of multiple experts. The authors show that soft routing is inherently fragile. Because the model collapses many different combinations of expert predictions and routing weights into a single confidence score, it can mask underlying errors. If the distribution of these combinations shifts at test time, the model’s aggregate confidence may no longer match its actual accuracy, even if every individual expert remains reliable.

Why Hard Routing is More Robust

The study highlights that hard-routed MoEs are significantly more stable under distribution shift. Because the router selects a single expert, the model’s reliability depends only on that specific expert and its confidence level. As long as the relationship between the expert’s prediction and the actual outcome remains consistent, the model stays calibrated. This creates a "bottleneck" that protects the model from shifts in the broader data distribution, provided the label behavior within each expert’s region remains unchanged.

Strengthening Models with Adversarial Training

To fix the calibration issues in soft-routed models, the authors introduce two new training objectives: "Robust MoE" and "Robust Filtered." Since it is difficult to identify exactly which routing configurations are fragile, the researchers use the model’s own loss as a signal. They apply an "adversarial reweighting" technique that forces the model to pay more attention to examples where it is currently struggling (high-loss examples). By using entropy-balanced reweighting, the model learns to be more robust across different data subsets without sacrificing overall accuracy.

Key Takeaways

The research demonstrates that expert-level calibration is not enough to guarantee a reliable MoE model. While hard routing provides a natural defense against distribution shift, soft routing requires more sophisticated training to prevent the aggregate predictor from becoming miscalibrated. The proposed robust training methods successfully improve the accuracy-calibration tradeoff across both text and image benchmarks, offering a way to build more trustworthy AI systems that remain stable even when faced with unexpected changes in their environment.

Comments (0)

No comments yet

Be the first to share your thoughts!