Ana gezinime geç Aramaya geç Ana içeriğe geç

An audio-visual particle filter for speaker tracking on the CLEAR'06 evaluation dataset

  • Kai Nickel*
  • , Tobias Gehrig
  • , Hazim K. Ekenel
  • , John McDonough
  • , Rainer Stiefelhagen
  • *Bu çalışma için yazışmadan sorumlu yazar

Araştırma sonucu: Kitap/Rapor/Konferans Bildirisinde BölümKonferans katkısıbilirkişi

6 Atıf (Scopus)

Özet

We present an approach for tracking a lecturer during the course of his speech. We use features from multiple cameras and microphones, and process them in a joint particle filter framework. The filter performs sampled projections of 3D location hypotheses and scores them using features from both audio and video. On the video side, the features are based on foreground segmentation, multi-view face detection and upper body detection. On the audio side, the time delays of arrival between pairs of microphones are estimated with a generalized cross correlation function. In the CLEAR'06 evaluation, the system yielded a tracking accuracy (MOTA) of 71% for video-only, 55% for audio-only and 90% for combined audio-visual tracking.

Orijinal dilİngilizce
Ana bilgisayar yayını başlığıMultimodal Technologies for Perception of Humans - First International Evaluation Workshop on Classification of Events, Activities and Relationships, CLEAR 2006 Revised Selected Papers
YayınlayanSpringer Verlag
Sayfalar69-80
Sayfa sayısı12
ISBN (Basılı)9783540695677
DOI'lar
Yayın durumuYayınlandı - 2007
Harici olarak yayınlandıEvet
Etkinlik1st International Evaluation Workshop on Classification of Events, Activities and Relationships, CLEAR 2006 - Southhampton, United Kingdom
Süre: 6 Nis 20067 Nis 2006

Yayın serisi

AdıLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Hacim4122 LNCS
ISSN (Basılı)0302-9743
ISSN (Elektronik)1611-3349

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???1st International Evaluation Workshop on Classification of Events, Activities and Relationships, CLEAR 2006
Ülke/BölgeUnited Kingdom
ŞehirSouthhampton
Periyot6/04/067/04/06

Parmak izi

An audio-visual particle filter for speaker tracking on the CLEAR'06 evaluation dataset' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

Alıntı Yap