HaorFloodAlert: Deseasonalized ML Ensemble for 72-H...

HaorFloodAlert: Deseasonalized ML Ensemble for 72-Hour Flood Prediction in Bangladesh Haor Wetlands
The haor wetlands of northeast Bangladesh are vast, flat basins that fill rapidly during pre-monsoon rains, often leaving local communities with only hours of warning before flash floods destroy the annual boro rice harvest. Existing flood forecasting systems are designed for deep, channelized rivers like the Brahmaputra and fail to account for the unique "backwater" dynamics of these shallow, bowl-like wetlands. This paper introduces HaorFloodAlert, a machine learning system specifically engineered to provide a 72-hour flood probability forecast for the Sunamganj region, helping to bridge the gap between technical data and the urgent survival needs of 3–4 million people.

Correcting for Seasonal Bias

A significant challenge in flood modeling is the "seasonal cheat code." Because floods in this region naturally occur during warmer months, raw temperature data often acts as a calendar proxy rather than a physical cause of flooding. The researchers found that including raw temperature inflated model accuracy by 6.9 percentage points. To fix this, they replaced raw temperature with a "climatological anomaly"—the difference between observed temperature and the long-term monthly average. This ensures the model learns from genuine weather patterns rather than simply identifying the time of year.

Integrating Satellite and Upstream Data

The system moves beyond traditional gauge-based monitoring by using Sentinel-1 satellite radar (SAR) to detect water movement in real-time. Because the haor basins are fed by water flowing down from the Barak River in Assam, India, the researchers built a "Barak River Sentinel-1 proxy." By monitoring this upstream area, the system gains approximately 36 hours of lead time before water reaches the wetlands. The model combines this satellite data with rainfall forecasts, soil moisture levels, and wind speed to create a comprehensive picture of flood risk.

Operational Alerts and Crop Protection

HaorFloodAlert is designed to be a functional tool for the community, not just a research project. It features a three-tier alert pipeline that sends automated SMS, email, and WhatsApp messages to farmers and local officials when flood risk reaches critical levels. Additionally, the system includes a crop damage estimator calibrated by the Bangladesh Rice Research Institute (BRRI). This tool calculates potential yield loss based on the flood's timing relative to the rice growth cycle, providing farmers and disaster managers with actionable information to help mitigate economic devastation.

Performance and Limitations

The ensemble model, which combines Random Forest and XGBoost algorithms, achieved an 89.6% accuracy rate and an AUC-ROC score of 0.943 on 77 real-world flood events. While these results are strong, the authors note several limitations. The system currently struggles to predict floods driven primarily by extreme rainfall when upstream river levels are low. Furthermore, the crop damage estimates carry a significant margin of uncertainty (±25–40%) and are intended for planning purposes rather than as precise insurance-grade assessments. The team plans to refine these estimates and improve validation in future versions of the system.

HaorFloodAlert: Deseasonalized ML Ensemble for 72-H... | AI Research

Key Takeaways

Correcting for Seasonal Bias

Integrating Satellite and Upstream Data

Operational Alerts and Crop Protection

Performance and Limitations

Comments (0)

No comments yet