Simulation-based inference for rapid Bayesian parameter estimation in epidemiological models: a comparison with MCMC
Mechanistic epidemiological models are essential tools for forecasting disease spread and informing public health decisions. To remain accurate, these models require frequent calibration to real-world surveillance data, such as ICU occupancy. Traditionally, this is done using Markov chain Monte Carlo (MCMC) methods, which are statistically robust but often too slow for the rapid, near-real-time analysis needed during an outbreak. This paper investigates whether simulation-based inference (SBI)—a machine-learning-based approach—can provide a faster, scalable alternative for calibrating these models without sacrificing accuracy.
How the approach works
The researchers used a mechanistic SECIR model to simulate COVID-19 dynamics in Germany during 2020. While MCMC relies on repeated, computationally expensive likelihood evaluations to estimate parameters, SBI uses neural posterior estimation. This method trains a neural network on a large number of simulated parameter-data pairs to learn a probabilistic mapping. Once this network is trained, it can rapidly estimate the posterior distribution of model parameters for new observed data. The study compared this SBI approach against MCMC across both short-term (31-day) and long-term (201-day) time windows to test its performance in different epidemiological scenarios.
Performance and speed
The results demonstrate that SBI is significantly faster than traditional MCMC. For 31-day inference windows, MCMC required approximately 1,000 seconds to complete, while SBI achieved comparable results in just 60 to 70 seconds on a single GPU. The performance gap widened significantly in the more complex 201-day reconstruction task: MCMC took over 19,000 seconds, whereas SBI completed the task in an average of 157 seconds. Despite this massive reduction in runtime, SBI successfully recovered posterior distributions that were in strong agreement with MCMC and accurately reproduced observed ICU occupancy trajectories.
Evaluating accuracy
To ensure the machine-learning approach was reliable, the researchers used several validation techniques. They compared the posterior distributions generated by both methods using quantitative metrics like Wasserstein distances and Kullback–Leibler divergences. Additionally, they performed posterior predictive checks, where they used the inferred parameters to simulate future ICU occupancy and compared those simulations against actual historical data. These checks confirmed that SBI not only produces results similar to MCMC but also maintains the necessary structure for meaningful epidemiological interpretation and uncertainty quantification.
Considerations for implementation
While SBI offers a powerful, efficient framework for rapid outbreak analysis, the authors note that its success depends on several factors. The quality of the results is influenced by the simulation design, the specification of prior distributions, and the architecture of the neural network. Because different parameter combinations can sometimes produce similar epidemic dynamics, the authors emphasize that careful validation against established methods like MCMC remains essential before adopting SBI as a standard tool in time-sensitive public health settings.
Comments (0)
to join the discussion
No comments yet
Be the first to share your thoughts!