Perceptually weighted Non-negative Matrix Factorization for blind single-channel music source separation

S. Kirbiz*, B. Gunsel

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)

Abstract

We propose a blind single-channel musical source separation method that improves perceptual quality of the separated sources. It uses the advantages of subspace learning based on Non-negative Matrix Factor 2-D Deconvolution (NMF2D). To improve the perceptual quality of separation, we propose a weighted divergence type cost function for the optimization that adopts the auditory model defined in ITU-R BS.1387 into the source separation. It is shown that the proposed perceptually weighted NMF2D scheme efficiently clusters the bases of subspace representation corresponding to notes generated by single instruments. Source separation performance has been reported on musical mixtures resulting an improvement in perceptual quality measures.

Original languageEnglish
Title of host publicationICPR 2012 - 21st International Conference on Pattern Recognition
Pages226-229
Number of pages4
Publication statusPublished - 2012
Event21st International Conference on Pattern Recognition, ICPR 2012 - Tsukuba, Japan
Duration: 11 Nov 201215 Nov 2012

Publication series

NameProceedings - International Conference on Pattern Recognition
ISSN (Print)1051-4651

Conference

Conference21st International Conference on Pattern Recognition, ICPR 2012
Country/TerritoryJapan
CityTsukuba
Period11/11/1215/11/12

Fingerprint

Dive into the research topics of 'Perceptually weighted Non-negative Matrix Factorization for blind single-channel music source separation'. Together they form a unique fingerprint.

Cite this