End-to-end Automatic Music Transcription of Polyphonic Music Using Convolutional Neural Networks

Emin Germen*, Can Karadoǧan

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper presents an automatic music transcription model based on Convolutional Neural Networks (CNNs) that mimics the 'trained ear' in music recognition. The approach pushes forward the fields of signal processing and music technology, with a focus on multi-instrument transcription featuring traditional Turkish instruments like the Qanun and Oud, known for their distinct timbral qualities and early decay characteristics. The study involves creating multipitch datasets from very basic combinations, training the CNN on this data, and achieving high transcription accuracy considering the F1 scores for two-part compositions. The training process equips the model to understand the fundamental traits of individual instruments, enabling it to identify and separate complex patterns in mixed audio. The aim is to enhance the model's ability to distinguish and analyze specific musical elements, supporting applications in music production, audio engineering, and music education.

Original languageEnglish
Title of host publication8th International Symposium on Innovative Approaches in Smart Technologies, ISAS 2024 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331540104
DOIs
Publication statusPublished - 2024
Event8th International Symposium on Innovative Approaches in Smart Technologies, ISAS 2024 - Istanbul, Turkey
Duration: 6 Dec 20247 Dec 2024

Publication series

Name8th International Symposium on Innovative Approaches in Smart Technologies, ISAS 2024 - Proceedings

Conference

Conference8th International Symposium on Innovative Approaches in Smart Technologies, ISAS 2024
Country/TerritoryTurkey
CityIstanbul
Period6/12/247/12/24

Bibliographical note

Publisher Copyright:
© 2024 IEEE.

Keywords

  • constant Q transform
  • convolutional neural network
  • Music transcription
  • signal processing

Fingerprint

Dive into the research topics of 'End-to-end Automatic Music Transcription of Polyphonic Music Using Convolutional Neural Networks'. Together they form a unique fingerprint.

Cite this