Robot audition for dynamic environments

Kazuhiro Nakadai*, Gokhan Ince, Keisuke Nakamura, Hirofumi Nakajima

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

28 Citations (Scopus)

Abstract

This paper addresses robot audition for dynamic environments, where speakers and/or a robot is moving within a dynamically-changing acoustic environment. Robot Audition studied so far assumed only stationary human-robot interaction scenes, and thus they have difficulties in coping with such dynamic environments. We recently developed new techniques for a robot to listen to several things simultaneously using its own ears even in dynamic environments; MUltiple SIgnal Classification based on Generalized Eigen-Value Decomposition (GEVD-MUSIC), Geometrically constrained High-order Decorrelation based Source Separation with Adaptive Step-size control (GHDSS-AS), Histogram-based Recursive Level Estimation (HRLE), and Template-based Ego Noise Suppression (TENS). GEVD-MUSIC provides noise-robust sound source localization. GHDSS-AS is a new sound source separation method which quickly adapts its sound source separation parameters to dynamic changes. HRLE is a practical post-filtering method with a small number of parameters. ENS estimates the motor noise of the robot by using templates recorded in advance and eliminates it. These methods are implemented as modules for our open-source robot audition software HARK to be easily integrated. We show that each of these methods and their combinations are effective to cope with dynamic environments through off-line experiments and on-line real-time demonstrations.

Original languageEnglish
Title of host publication2012 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2012
Pages125-130
Number of pages6
DOIs
Publication statusPublished - 2012
Externally publishedYes
Event2012 2nd IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2012 - Hong Kong, China
Duration: 12 Aug 201215 Aug 2012

Publication series

Name2012 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2012

Conference

Conference2012 2nd IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2012
Country/TerritoryChina
CityHong Kong
Period12/08/1215/08/12

Keywords

  • Dynamic environment
  • Ego noise suppression
  • Microphone array
  • Robot audition
  • Sound source localization
  • Sound source separation

Fingerprint

Dive into the research topics of 'Robot audition for dynamic environments'. Together they form a unique fingerprint.

Cite this