Analyzing Susceptibility to Tornado-Induced Injuries Using Hybrid Tree-Based Classifiers and Advanced Resampling Techniques

PDF

  • Tornadoes are among the most catastrophic atmospheric phenomena, causing extensive damage, injuries, and fatalities. This study aims to predict tornado-induced injuries in Texas of U.S. by employing machine learning (ML) approaches, including random forest (RF), adaptive boosting (AdaBoost), extreme gradient boosting (XGBoost), and support vector machines (SVM) in conjunction with advanced resampling methods to address the class imbalance. Accordingly, the present research utilized a total of 2986 property location data points from tornadoes recorded from 2000 to 2023, and took meteorological, socioeconomic, and geographical features into account to perdict the tornado-induced injuries. Three undersampling techniques, i.e., random undersampling methods (RUS), condensed nearest neighbor (CNN), and instance hardness threshold (IHT), and three oversampling ones, i.e., synthetic minority oversampling technique (SMOTE), random oversampling methods (ROS), and adaptive synthetic sampling (ADASYN), were tested to enhance models’ predictive accuracy. The validation of these hybrid models demonstrated that the RF-IHT outperformed its counterparts, achieving satisfactory outcomes in identifying tornado-induced injuries with an F1-score of 0.965, precision of 0.967, and recall of 0.967. The interpretability of the estimations was enhanced through the local interpretable model-agnostic explanations (LIME), which further identified tornado intensity, land use, and socioeconomic attributes as the most critical predictors. The susceptibility map developed in this study serves as a crucial decision-support mechanism for disaster risk management strategies, aiming to mitigate tornado-related injuries, and thereby enhancing community resilience to extreme weather phenomena.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return