Comparative Effectiveness of Classification Algorithms in Predicting Diabetes

Fares A. Dael, D. Mareyev, Ibraheem Shayea, S. Kulniyazova Korlan, Gulnara Abitova

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Diabetes mellitus poses a significant global health challenge, with increasing prevalence, particularly in low socioeconomic regions. Accurate and early diagnosis is crucial to prevent the severe long-term complications associated with diabetes. This study conducts a comprehensive comparison of six prominent machine learning algorithms-K-Nearest Neighbors (K-NN), Naive Bayes, Support Vector Machine (SVM), Decision Trees, Random Forest, and Logistic Regression-in predicting diabetes using a dataset of 768 individuals with diverse diabetic indicators from Kaggle. Each algorithm is rigorously evaluated based on precision, recall, and F1-score to determine the most effective method for diabetes diagnosis. The results indicate that Logistic Regression outperforms the other algorithms, achieving an accuracy of 81%. This superior performance is attributed to Logistic Regression's ability to effectively delineate linear separations, which is crucial for distinguishing between diabetic and non-diabetic individuals. The study underscores the importance of feature selection and model tuning in enhancing predictive performance. The findings suggest that integrating Logistic Regression into clinical settings can significantly improve the accuracy and timeliness of diabetes diagnosis, potentially leading to better patient outcomes and reduced healthcare costs.

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE 16th International Conference on Communication Systems and Network Technologies, CICN 2024
EditorsGeetam Singh Tomar
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1371-1378
Number of pages8
ISBN (Electronic)9798331505264
DOIs
Publication statusPublished - 2024
Event16th IEEE International Conference on Computational Intelligence and Communication Networks, CICN 2024 - Indore, India
Duration: 22 Dec 202423 Dec 2024

Publication series

NameProceedings - 2024 IEEE 16th International Conference on Communication Systems and Network Technologies, CICN 2024

Conference

Conference16th IEEE International Conference on Computational Intelligence and Communication Networks, CICN 2024
Country/TerritoryIndia
CityIndore
Period22/12/2423/12/24

Bibliographical note

Publisher Copyright:
© 2024 IEEE.

Keywords

  • Decision Trees
  • Diabetes Diagnosis
  • K-Nearest Neighbors
  • Logistic Regression
  • Machine Learning
  • Naive Bayes
  • Random Forest
  • Support Vector Machine

Fingerprint

Dive into the research topics of 'Comparative Effectiveness of Classification Algorithms in Predicting Diabetes'. Together they form a unique fingerprint.

Cite this