Speaker emotional state classification by DPM models with annealed SMC samplers

Bilge Gunsel, Ozgun Cirakman, Jarek Krajewski

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

We propose a speaker emotional state classification method that employs inference-based Bayesian networks to learn posterior density of emotional speech sequentially. We aim to alleviate difficulty in detecting medium-term states where the required monitoring time is longer compared to short-term emotional states that makes temporal content representation harder. Our inference algorithm takes advantage of the Sequential Monte Carlo (SMC) sampling and recursively approximates the Dirichlet Process Mixtures (DPM) model of the speaker state class density with unknown number of components. After learning the target posterior, classification of speaker states has been performed by a simple minimum distance classifier. Test results obtained on two different datasets demonstrate the proposed method highly reduces the training data length while providing comparable accuracy compared to the existing state-of-the-art techniques.

Original languageEnglish
Title of host publication2015 23rd European Signal Processing Conference, EUSIPCO 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages120-124
Number of pages5
ISBN (Electronic)9780992862633
DOIs
Publication statusPublished - 22 Dec 2015
Event23rd European Signal Processing Conference, EUSIPCO 2015 - Nice, France
Duration: 31 Aug 20154 Sept 2015

Publication series

Name2015 23rd European Signal Processing Conference, EUSIPCO 2015

Conference

Conference23rd European Signal Processing Conference, EUSIPCO 2015
Country/TerritoryFrance
CityNice
Period31/08/154/09/15

Bibliographical note

Publisher Copyright:
© 2015 EURASIP.

Keywords

  • Dirichlet Process Mixtures model
  • emotion classification
  • Graphical models
  • HCI
  • perceptual audio features

Fingerprint

Dive into the research topics of 'Speaker emotional state classification by DPM models with annealed SMC samplers'. Together they form a unique fingerprint.

Cite this