Assessment of soil salinity using explainable machine learning methods and Landsat 8 images

Samet Aksoy, Elif Sertel*, Ribana Roscher, Aysegul Tanik, Nikou Hamzehpour

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

The aim of this study is to comparatively analyze the performance of machine learning (ML) algorithms for modeling soil salinity using field-based electrical conductivity (EC) data and Landsat-8 OLI satellite images with derived environmental covariates. We also aim to interpret and explain the ML models with and without over-sampling methods using Shapley (SHAP) values, an explainable ML approach that has not yet been utilized for soil salinity estimation tasks as an ML problem. We investigate two case study areas from western and southeastern Lake Urmia Playas (LUP) in the Northwest of Iran. Our study uses 26 environmental covariates, two ML models, namely extreme gradient boosting (XGBoost) and random forest (RF), and two over-sampling methods: synthetic minority over-sampling technique (SMOTE) and random over-sampling (ROS). Results indicate that XGBoost performs better compared to RF in terms of both R2 and RMSE. Additionally, the visual interpretation of soil salinity maps demonstrated the superiority of XGBoost. SMOTE produced superior results than ROS and no over-sampling test cases. Finally, SHAP analysis illustrated that vegetation indices made a greater contribution to the soil salinity prediction in the West LUP, while visible bands contributed more in the Southeast LUP Region.

Original languageEnglish
Article number103879
JournalInternational Journal of Applied Earth Observation and Geoinformation
Volume130
DOIs
Publication statusPublished - Jun 2024

Bibliographical note

Publisher Copyright:
© 2024

Keywords

  • Google Earth Engine
  • Landsat-8 OLI
  • Machine learning
  • SHAP
  • Soil salinity
  • XAI

Fingerprint

Dive into the research topics of 'Assessment of soil salinity using explainable machine learning methods and Landsat 8 images'. Together they form a unique fingerprint.

Cite this