Abstract
With the widespread adoption of machine learning in recent times, numerous practical and theoretical studies have been conducted to enable machines to learn rare events. Making successful predictions for the minority class in imbalanced datasets has become increasingly crucial. We propose a new method, Genetic Dual Borderline SMOTE, to improve prediction accuracy for imbalanced datasets. The steps of the newly developed SMOTE method, along with its performance, have been compared with frequently used SMOTE, Borderline SMOTE, and KMeans SMOTE methods across eight datasets and four different machine learning algorithms. We used F-1 score of the minority class as the metric for performance evaluation and comparison. Various parameter combinations have been tested for each machine learning model and SMOTE method, and the parameters yielding the best F1 score for each model and SMOTE pair have been used. Our results show that the Genetic Dual Borderline SMOTE method outperforms other SMOTE methods, providing more successful outcomes.
Original language | English |
---|---|
Title of host publication | Intelligent and Fuzzy Systems - Intelligent Industrial Informatics and Efficient Networks Proceedings of the INFUS 2024 Conference |
Editors | Cengiz Kahraman, Sezi Cevik Onar, Basar Oztaysi, Irem Ucal Sari, Selcuk Cebi, A. Cagri Tolga |
Publisher | Springer Science and Business Media Deutschland GmbH |
Pages | 347-354 |
Number of pages | 8 |
ISBN (Print) | 9783031671944 |
DOIs | |
Publication status | Published - 2024 |
Event | International Conference on Intelligent and Fuzzy Systems, INFUS 2024 - Canakkale, Turkey Duration: 16 Jul 2024 → 18 Jul 2024 |
Publication series
Name | Lecture Notes in Networks and Systems |
---|---|
Volume | 1089 LNNS |
ISSN (Print) | 2367-3370 |
ISSN (Electronic) | 2367-3389 |
Conference
Conference | International Conference on Intelligent and Fuzzy Systems, INFUS 2024 |
---|---|
Country/Territory | Turkey |
City | Canakkale |
Period | 16/07/24 → 18/07/24 |
Bibliographical note
Publisher Copyright:© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.
Keywords
- Imbalanced Dataset
- SMOTE