Skip to main navigation Skip to search Skip to main content

Audio Data Analysis and Music Genre Classification with Various Machine Learning Techniques

  • Arda Deniz*
  • , Bilal Saoud
  • , Ibraheem Shayea
  • , Shambulov Ulykbek
  • , Abilkair Imanberdi
  • , Fuad Abdulgaleel Abdoh Ghaleb
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Music genre classification presents several challenges, including high-dimensional audio data, overlapping genre characteristics, and subjective labeling. Although machine learning has shown promise in addressing these challenges, many studies either focus on a narrow range of models or lack comparative performance insights. In this study, we perform a comprehensive evaluation of ten machine learning algorithms for music genre classification using the widely adopted GTZAN dataset. The models include Naïve Bayes, Support Vector Machines, K-Nearest Neighbors (KNN), Random Forest, Neural Networks, and Extreme Gradient Boosting (XGBoost), among others. Our results demonstrate that XGBoost achieved the highest accuracy at 90.09%, outperforming Random Forest (81.42%) and KNN (80.58%) by a significant margin. Feature importance analysis highlights that percussive, harmonic, and spectral features contribute most substantially to genre discrimination. Notably, XGBoost's superior performance suggests its ability to effectively capture non-linear patterns in musical features, offering a strong alternative to traditional classifiers. This work contributes a broad comparative analysis and emphasizes the effectiveness of ensemble-based approaches in music genre classification, providing valuable insights for future research and practical applications in music information retrieval.

Original languageEnglish
Title of host publicationProceedings - 29th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, SNPD 2025-Summer
EditorsHyun Yoe, Ha Jin Hwang, Meonghun Lee, Rackwoo Kim, Ryugap Lim, Sungtaek Lee, Seaeul Kim, Simon Xu, Miguel Garcia-Ruiz, Wenyin Feng, A B M Bodrul Alam, Randy Lin, Ajmery Sultana, Faria Khandaker, Mahreen Nasir, Ken Higuchi, Shinichiro Mori, Teruhisa Hochin
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages219-224
Number of pages6
ISBN (Electronic)9798331512583
DOIs
Publication statusPublished - 2025
Event29th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, SNPD 2025-Summer - Busa, Korea, Republic of
Duration: 25 Jun 202527 Jun 2025

Publication series

NameProceedings - 29th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, SNPD 2025-Summer

Conference

Conference29th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, SNPD 2025-Summer
Country/TerritoryKorea, Republic of
CityBusa
Period25/06/2527/06/25

Bibliographical note

Publisher Copyright:
© 2025 IEEE.

Keywords

  • Feature Importance
  • GTZAN Dataset
  • Machine Learning
  • Music Genre Classification
  • Signal Processing

Fingerprint

Dive into the research topics of 'Audio Data Analysis and Music Genre Classification with Various Machine Learning Techniques'. Together they form a unique fingerprint.

Cite this