A perceptually enhanced blind single-channel audio source separation by non-negative matrix factorization

S. Kirbiz*, B. Günsel

*Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

4 Citations (Scopus)

Abstract

This paper proposes a 2D Non-negative Matrix Factorization (NMF) based single-channel source separation algorithm that emphasizes perceptually important components of audio. Unlike the existing methods, the proposed scheme performs a psychoacoustic pre-processing on the mixture spectrogram in order to supress audio components that are not critical to human hearing sensation while amplifying the perceptually important ones. This yields the auditory spectrogram referred as sonogram of the observed audio mixture and the individual sources are then extracted by 2D NMF. Test results reported in terms of Signal-to-Distortion-Ratio (SDR), Signalto- Inference-Ratio (SIR) and Signal-to-Artifact-Ratio (SAR) show that the proposed perceptually enhanced separation improves the quality of decomposed audio sources by 1.5-6.5 dB with a reduced computational complexity.

Original languageEnglish
Pages (from-to)731-735
Number of pages5
JournalEuropean Signal Processing Conference
Publication statusPublished - 2010
Event18th European Signal Processing Conference, EUSIPCO 2010 - Aalborg, Denmark
Duration: 23 Aug 201027 Aug 2010

Fingerprint

Dive into the research topics of 'A perceptually enhanced blind single-channel audio source separation by non-negative matrix factorization'. Together they form a unique fingerprint.

Cite this