Abstract
Cardiovascular diseases are one of the most common causes of death in the world. At this point, early diagnosis of heart diseases is critically important. The aim of this study is to predict the heart disease using feature selection, classification and optimization algorithms. Firstly, principal component analysis (PCA) is used to create the feature selection model and to determine the effective attributes. Then, Extreme Gradient Boosting (XGBoost) classification model is proposed to predict the heart disease. Finally, genetic algorithm (GA) is applied to optimize the parameters of XGBoost to improve the classification accuracy. The developed hybrid PCA-XGBoost-GA approach is compared with XGBoost, PCA-XGBoost, XGBoost-GA, artificial neural network (ANN) and support vector machine (SVM). The effectiveness of these approaches is illustrated with a case study with the actual data taken from a university hospital in Turkey. The numerical results show that the proposed PCA-XGBoost-GA model outperforms the other classification models in terms of accuracy rate, recall, precision and F-measure. Moreover, feature selection and parameter optimization improve the classification performance of the XGBoost model.
Original language | English |
---|---|
Article number | 2340009 |
Journal | International Journal on Artificial Intelligence Tools |
Volume | 32 |
Issue number | 2 |
DOIs | |
Publication status | Published - 1 Mar 2023 |
Bibliographical note
Publisher Copyright:© 2023 World Scientific Publishing Company.
Keywords
- Heart disease diagnosis
- classification
- extreme gradient boosting
- genetic algorithm
- principal component analysis