Ego noise cancellation of a robot using missing feature masks

Gökhan Ince*, Kazuhiro Nakadai, Tobias Rodemann, Hiroshi Tsujino, Jun Ichi Imura

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)

Abstract

We describe an architecture that gives a robot the capability to recognize speech by cancelling ego noise, even while the robot is moving. The system consists of three blocks: (1) a multi-channel noise reduction block, comprising consequent stages of microphone-array-based sound localization, geometric source separation and post-filtering; (2) a single-channel noise reduction block utilizing template subtraction; and (3) an automatic speech recognition block. In this work, we specifically investigate a missing feature theory-based automatic speech recognition (MFT-ASR) approach in block (3). This approach makes use of spectro-temporal elements derived from (1) and (2) to measure the reliability of the acoustic features, and generates masks to filter unreliable acoustic features. We then evaluated this system on a robot using word correct rates. Furthermore, we present a detailed analysis of recognition accuracy to determine optimal parameters. Implementation of the proposed MFT-ASR approach resulted in significantly higher recognition performance than single or multi-channel noise reduction methods.

Original languageEnglish
Pages (from-to)360-371
Number of pages12
JournalApplied Intelligence
Volume34
Issue number3
DOIs
Publication statusPublished - Jun 2011
Externally publishedYes

Funding

FundersFunder number
Japan Society for the Promotion of Science22700165

    Keywords

    • Automatic speech recognition
    • Ego noise
    • Microphone array
    • Missing feature theory
    • Noise reduction
    • Robot audition

    Fingerprint

    Dive into the research topics of 'Ego noise cancellation of a robot using missing feature masks'. Together they form a unique fingerprint.

    Cite this