Ana gezinime geç Aramaya geç Ana içeriğe geç

Intelligent sound source localization and its application to multimodal human tracking

  • Keisuke Nakamura*
  • , Kazuhiro Nakadai
  • , Futoshi Asano
  • , Gökhan Ince
  • *Bu çalışma için yazışmadan sorumlu yazar
  • Honda Motor Co., Ltd.

Araştırma sonucu: Kitap/Rapor/Konferans Bildirisinde BölümKonferans katkısıbilirkişi

60 Atıf (Scopus)

Özet

We have assessed robust tracking of humans based on intelligent Sound Source Localization (SSL) for a robot in a real environment. SSL is fundamental for robot audition, but has three issues in a real environment: robustness against noise with high power, lack of a general framework for selective listening to sound sources, and tracking of inactive and/or noisy sound sources. To address the first issue, we extended Multiple SIgnal Classification by incorporating Generalized EigenValue Decomposition (GEVD-MUSIC) so that it can deal with high power noise and can select target sound sources. To address the second issue, we proposed Sound Source Identification (SSI) based on hierarchical gaussian mixture models and integrated it with GEVD-MUSIC to realize a selective listening function. To address the third issue, we integrated audio-visual human tracking using particle filtering. Integration of these three techniques into an intelligent human tracking system showed: 1) GEVD-MUSIC improved the noise-robustness of SSL by a signal-to-noise ratio of 5-6 dB; 2) SSI performed more than 70% in F-measure even in a noisy environment; and 3) audio-visual integration improved the average tracking error by approximately 50%.

Orijinal dilİngilizce
Ana bilgisayar yayını başlığıIROS'11 - 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems
Ana bilgisayar yayını alt yazısıCelebrating 50 Years of Robotics
Sayfalar143-148
Sayfa sayısı6
DOI'lar
Yayın durumuYayınlandı - 2011
Harici olarak yayınlandıEvet
Etkinlik2011 IEEE/RSJ International Conference on Intelligent Robots and Systems: Celebrating 50 Years of Robotics, IROS'11 - San Francisco, CA, United States
Süre: 25 Eyl 201130 Eyl 2011

Yayın serisi

AdıIEEE International Conference on Intelligent Robots and Systems

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???2011 IEEE/RSJ International Conference on Intelligent Robots and Systems: Celebrating 50 Years of Robotics, IROS'11
Ülke/BölgeUnited States
ŞehirSan Francisco, CA
Periyot25/09/1130/09/11

Parmak izi

Intelligent sound source localization and its application to multimodal human tracking' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

Alıntı Yap