ITU MSPR TRECVID 2010 video copy detection system

Sezer Kutluk, Bilge Gunsel

Research output: Contribution to conferencePaperpeer-review

1 Citation (Scopus)

Abstract

In this paper we describe the system designed by the ITU MSPR Group for content based video fingerprinting as applied to the TRECVID 2010 Content Based Copy Detection (CBCD) benchmark. This year focus of the system was on integration of audio and video fingerprinting to improve the robustness to attacks. The proposed system consists of three main modules: Audio/video fingerprint extraction, audio/video search and retrieval, and audiovisual decision fusion. We propose a video feature extraction scheme based on the Nonnegative Matrix Factorization (NMF) which is an efficient dimension reduction technique in video processing. Video fingerprint generation module takes the factorization matrices generated by NMF as its input and converts them to binary hashes by differencial coding [1, 2]. For audio data we perform an audio fingerprinting method that is similar to the one proposed in [3]. Extracted audio and video hashes are indexed into a database. Searching module first applies a hash matching procedure to locate potential matching points both in audio and video. This is followed by decision fusion that eliminates false alarms and finalizes the matching and retrieval.

Original languageEnglish
Publication statusPublished - 2010
EventTREC Video Retrieval Evaluation, TRECVID 2010 - Gaithersburg, MD, United States
Duration: 15 Nov 201017 Nov 2010

Conference

ConferenceTREC Video Retrieval Evaluation, TRECVID 2010
Country/TerritoryUnited States
CityGaithersburg, MD
Period15/11/1017/11/10

Fingerprint

Dive into the research topics of 'ITU MSPR TRECVID 2010 video copy detection system'. Together they form a unique fingerprint.

Cite this