An Explainable Machine Learning Framework for Forecasting Lake Water Equivalent Using Satellite Data: A 20-Year Analysis of the Urmia Lake Basin

Sara Habibi*, Saeed Tasouji Hassanpour

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

This study presents an explainable machine learning framework to forecast groundwater storage dynamics, quantified as the Lake Water Equivalent (LWE), in the Urmia Lake Basin from 2003 to 2023. Satellite-based observations (GRACE, GLDAS) and climatic variables were integrated to model LWE variability. An ensemble learning approach was employed, combining Ridge Regression and Random Forest enhanced through feature re-weighting based on XGBoost-derived importance scores. Model interpretability was addressed using SHapley Additive exPlanations (SHAP), offering transparent insights into the contributions of climatic drivers. Results demonstrated that the Random Forest model achieved superior performance (RMSE = 3.27; (Formula presented.) = 0.89), with SHAP analysis highlighting the dominant influence of recent LWE values, temperature, and soil moisture. The proposed framework outperformed baseline models including Persistence, Standard Ridge Regression, and XGBoost in terms of both accuracy and explainability. The objectives of this study are (i) to forecast the LWE in the Urmia Lake Basin using an ensemble-based machine learning framework, (ii) to enhance predictive modeling through XGBoost-guided feature weighting, and (iii) to improve model transparency and interpretation using SHAP-based explainability techniques. By integrating ensemble learning with explainable AI, this work advances the transparent data-driven forecasting essential for sustainable groundwater management under climatic uncertainty.

Original languageEnglish
Article number1431
JournalWater (Switzerland)
Volume17
Issue number10
DOIs
Publication statusPublished - May 2025

Bibliographical note

Publisher Copyright:
© 2025 by the authors.

Keywords

  • Mann–Kendall test
  • Urmia Lake
  • groundwater storage
  • machine learning
  • remote sensing
  • time series analysis

Fingerprint

Dive into the research topics of 'An Explainable Machine Learning Framework for Forecasting Lake Water Equivalent Using Satellite Data: A 20-Year Analysis of the Urmia Lake Basin'. Together they form a unique fingerprint.

Cite this