Analyzing Susceptibility to Tornado-Induced Injuries Using Hybrid Tree-Based Classifiers and Advanced Resampling Techniques

Research output: Contribution to journalArticlepeer-review

Abstract

Tornadoes are among the most catastrophic atmospheric phenomena, causing extensive damage, injuries, and fatalities. This study aims to predict tornado-induced injuries in Texas of U.S. by employing machine learning (ML) approaches, including random forest (RF), adaptive boosting (AdaBoost), extreme gradient boosting (XGBoost), and support vector machines (SVM) in conjunction with advanced resampling methods to address the class imbalance. Accordingly, the present research utilized a total of 2986 property location data points from tornadoes recorded from 2000 to 2023, and took meteorological, socioeconomic, and geographical features into account to perdict the tornado-induced injuries. Three undersampling techniques, i.e., random undersampling methods (RUS), condensed nearest neighbor (CNN), and instance hardness threshold (IHT), and three oversampling ones, i.e., synthetic minority oversampling technique (SMOTE), random oversampling methods (ROS), and adaptive synthetic sampling (ADASYN), were tested to enhance models’ predictive accuracy. The validation of these hybrid models demonstrated that the RF-IHT outperformed its counterparts, achieving satisfactory outcomes in identifying tornado-induced injuries with an F1-score of 0.965, precision of 0.967, and recall of 0.967. The interpretability of the estimations was enhanced through the local interpretable model-agnostic explanations (LIME), which further identified tornado intensity, land use, and socioeconomic attributes as the most critical predictors. The susceptibility map developed in this study serves as a crucial decision-support mechanism for disaster risk management strategies, aiming to mitigate tornado-related injuries, and thereby enhancing community resilience to extreme weather phenomena.

Original languageEnglish
Pages (from-to)1489-1509
Number of pages21
JournalJournal of Meteorological Research
Volume39
Issue number6
DOIs
Publication statusPublished - Dec 2025

Bibliographical note

Publisher Copyright:
© The Chinese Meteorological Society 2025.

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 11 - Sustainable Cities and Communities
    SDG 11 Sustainable Cities and Communities

Keywords

  • disaster
  • extreme weather phenomenon
  • interpretable artificial intelligence
  • machine learning
  • resampling technique
  • tornado susceptibility

Fingerprint

Dive into the research topics of 'Analyzing Susceptibility to Tornado-Induced Injuries Using Hybrid Tree-Based Classifiers and Advanced Resampling Techniques'. Together they form a unique fingerprint.

Cite this