Abstract
Music genre classification presents several challenges, including high-dimensional audio data, overlapping genre characteristics, and subjective labeling. Although machine learning has shown promise in addressing these challenges, many studies either focus on a narrow range of models or lack comparative performance insights. In this study, we perform a comprehensive evaluation of ten machine learning algorithms for music genre classification using the widely adopted GTZAN dataset. The models include Naïve Bayes, Support Vector Machines, K-Nearest Neighbors (KNN), Random Forest, Neural Networks, and Extreme Gradient Boosting (XGBoost), among others. Our results demonstrate that XGBoost achieved the highest accuracy at 90.09%, outperforming Random Forest (81.42%) and KNN (80.58%) by a significant margin. Feature importance analysis highlights that percussive, harmonic, and spectral features contribute most substantially to genre discrimination. Notably, XGBoost's superior performance suggests its ability to effectively capture non-linear patterns in musical features, offering a strong alternative to traditional classifiers. This work contributes a broad comparative analysis and emphasizes the effectiveness of ensemble-based approaches in music genre classification, providing valuable insights for future research and practical applications in music information retrieval.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 29th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, SNPD 2025-Summer |
| Editors | Hyun Yoe, Ha Jin Hwang, Meonghun Lee, Rackwoo Kim, Ryugap Lim, Sungtaek Lee, Seaeul Kim, Simon Xu, Miguel Garcia-Ruiz, Wenyin Feng, A B M Bodrul Alam, Randy Lin, Ajmery Sultana, Faria Khandaker, Mahreen Nasir, Ken Higuchi, Shinichiro Mori, Teruhisa Hochin |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 219-224 |
| Number of pages | 6 |
| ISBN (Electronic) | 9798331512583 |
| DOIs | |
| Publication status | Published - 2025 |
| Event | 29th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, SNPD 2025-Summer - Busa, Korea, Republic of Duration: 25 Jun 2025 → 27 Jun 2025 |
Publication series
| Name | Proceedings - 29th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, SNPD 2025-Summer |
|---|
Conference
| Conference | 29th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, SNPD 2025-Summer |
|---|---|
| Country/Territory | Korea, Republic of |
| City | Busa |
| Period | 25/06/25 → 27/06/25 |
Bibliographical note
Publisher Copyright:© 2025 IEEE.
Keywords
- Feature Importance
- GTZAN Dataset
- Machine Learning
- Music Genre Classification
- Signal Processing
Fingerprint
Dive into the research topics of 'Audio Data Analysis and Music Genre Classification with Various Machine Learning Techniques'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver