Ana gezinime geç Aramaya geç Ana içeriğe geç

Audio Data Analysis and Music Genre Classification with Various Machine Learning Techniques

  • Arda Deniz*
  • , Bilal Saoud
  • , Ibraheem Shayea
  • , Shambulov Ulykbek
  • , Abilkair Imanberdi
  • , Fuad Abdulgaleel Abdoh Ghaleb
  • *Bu çalışma için yazışmadan sorumlu yazar

Araştırma sonucu: Kitap/Rapor/Konferans Bildirisinde BölümKonferans katkısıbilirkişi

Özet

Music genre classification presents several challenges, including high-dimensional audio data, overlapping genre characteristics, and subjective labeling. Although machine learning has shown promise in addressing these challenges, many studies either focus on a narrow range of models or lack comparative performance insights. In this study, we perform a comprehensive evaluation of ten machine learning algorithms for music genre classification using the widely adopted GTZAN dataset. The models include Naïve Bayes, Support Vector Machines, K-Nearest Neighbors (KNN), Random Forest, Neural Networks, and Extreme Gradient Boosting (XGBoost), among others. Our results demonstrate that XGBoost achieved the highest accuracy at 90.09%, outperforming Random Forest (81.42%) and KNN (80.58%) by a significant margin. Feature importance analysis highlights that percussive, harmonic, and spectral features contribute most substantially to genre discrimination. Notably, XGBoost's superior performance suggests its ability to effectively capture non-linear patterns in musical features, offering a strong alternative to traditional classifiers. This work contributes a broad comparative analysis and emphasizes the effectiveness of ensemble-based approaches in music genre classification, providing valuable insights for future research and practical applications in music information retrieval.

Orijinal dilİngilizce
Ana bilgisayar yayını başlığıProceedings - 29th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, SNPD 2025-Summer
EditörlerHyun Yoe, Ha Jin Hwang, Meonghun Lee, Rackwoo Kim, Ryugap Lim, Sungtaek Lee, Seaeul Kim, Simon Xu, Miguel Garcia-Ruiz, Wenyin Feng, A B M Bodrul Alam, Randy Lin, Ajmery Sultana, Faria Khandaker, Mahreen Nasir, Ken Higuchi, Shinichiro Mori, Teruhisa Hochin
YayınlayanInstitute of Electrical and Electronics Engineers Inc.
Sayfalar219-224
Sayfa sayısı6
ISBN (Elektronik)9798331512583
DOI'lar
Yayın durumuYayınlandı - 2025
Etkinlik29th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, SNPD 2025-Summer - Busa, Korea, Republic of
Süre: 25 Haz 202527 Haz 2025

Yayın serisi

AdıProceedings - 29th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, SNPD 2025-Summer

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???29th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, SNPD 2025-Summer
Ülke/BölgeKorea, Republic of
ŞehirBusa
Periyot25/06/2527/06/25

Bibliyografik not

Publisher Copyright:
© 2025 IEEE.

Parmak izi

Audio Data Analysis and Music Genre Classification with Various Machine Learning Techniques' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

Alıntı Yap