TY - GEN
T1 - NTF ile gözü kapali kaynak ayriştirma başariminin algisal ses kalite kriterlerine göre analizi
AU - Keyder, M. Altuǧ
AU - Günsel, Bilge
PY - 2008
Y1 - 2008
N2 - In this paper, the audio blind source separation (BSS) using three dimensional Nonnegative Tensor Factorization (3DNTF), is realized. The audio source separation is modeled as an optimization problem and the ß-divergence cost function is iteratively optimized by alternating multiplicative update rules. The traditional measures which are used to evaluate the decomposition performance are known to be not informative about perceptual quality of the audio signals. Therefore performance of the designed system is evaluated not only with the well known Amari index, but also with perceptual audio quality criterions which are defined in the recommendation report, ITU-R BS.1387 of International Telecommunication Union (ITU). In this study, it has been shown that source decompositon performance of the NTF modelling on audio data mixed under different conditions, is superior to the Nonnegative Matrix Factorization (NMF). Furthermore, it has been observed that some of the decomposed sources are acceptable according to Amari index while thay are not with respect to the perceptual quality citeria thus it can be concluded that the perceptual criteria is more suitable to objective quality evaluation of audio.
AB - In this paper, the audio blind source separation (BSS) using three dimensional Nonnegative Tensor Factorization (3DNTF), is realized. The audio source separation is modeled as an optimization problem and the ß-divergence cost function is iteratively optimized by alternating multiplicative update rules. The traditional measures which are used to evaluate the decomposition performance are known to be not informative about perceptual quality of the audio signals. Therefore performance of the designed system is evaluated not only with the well known Amari index, but also with perceptual audio quality criterions which are defined in the recommendation report, ITU-R BS.1387 of International Telecommunication Union (ITU). In this study, it has been shown that source decompositon performance of the NTF modelling on audio data mixed under different conditions, is superior to the Nonnegative Matrix Factorization (NMF). Furthermore, it has been observed that some of the decomposed sources are acceptable according to Amari index while thay are not with respect to the perceptual quality citeria thus it can be concluded that the perceptual criteria is more suitable to objective quality evaluation of audio.
UR - http://www.scopus.com/inward/record.url?scp=56449109364&partnerID=8YFLogxK
U2 - 10.1109/SIU.2008.4632692
DO - 10.1109/SIU.2008.4632692
M3 - Konferans katkısı
AN - SCOPUS:56449109364
SN - 9781424419999
T3 - 2008 IEEE 16th Signal Processing, Communication and Applications Conference, SIU
BT - 2008 IEEE 16th Signal Processing, Communication and Applications Conference, SIU
T2 - 2008 IEEE 16th Signal Processing, Communication and Applications Conference, SIU
Y2 - 20 April 2008 through 22 April 2008
ER -