Özet
In this paper we describe the system designed by the ITU MSPR Group for content based video fingerprinting as applied to the TRECVID 2010 Content Based Copy Detection (CBCD) benchmark. This year focus of the system was on integration of audio and video fingerprinting to improve the robustness to attacks. The proposed system consists of three main modules: Audio/video fingerprint extraction, audio/video search and retrieval, and audiovisual decision fusion. We propose a video feature extraction scheme based on the Nonnegative Matrix Factorization (NMF) which is an efficient dimension reduction technique in video processing. Video fingerprint generation module takes the factorization matrices generated by NMF as its input and converts them to binary hashes by differencial coding [1, 2]. For audio data we perform an audio fingerprinting method that is similar to the one proposed in [3]. Extracted audio and video hashes are indexed into a database. Searching module first applies a hash matching procedure to locate potential matching points both in audio and video. This is followed by decision fusion that eliminates false alarms and finalizes the matching and retrieval.
Orijinal dil | İngilizce |
---|---|
Yayın durumu | Yayınlandı - 2010 |
Etkinlik | TREC Video Retrieval Evaluation, TRECVID 2010 - Gaithersburg, MD, United States Süre: 15 Kas 2010 → 17 Kas 2010 |
???event.eventtypes.event.conference???
???event.eventtypes.event.conference??? | TREC Video Retrieval Evaluation, TRECVID 2010 |
---|---|
Ülke/Bölge | United States |
Şehir | Gaithersburg, MD |
Periyot | 15/11/10 → 17/11/10 |