Object-based indexing of MPEG-4 compressed video

Ahmet M. Ferman, Bilge Günsel, A. Murat Tekalp

Research output: Contribution to journalConference articlepeer-review

23 Citations (Scopus)

Abstract

The MPEG-4 object-based coding standard, designed as a common platform for all multimedia applications, is inherently well-suited for video indexing applications. To fully exploit the advantages offered by MPEG-4, however, a reconsideration of existing indexing strategies is required. This paper proposes a new object-based framework for video indexing and retrieval that treats as the basic indexing unit the object itself, where changes in content are detected through observations made on the objects in the video sequence. We present a temporal segmentation algorithm that is designed to automatically extract key frames for each video object in an MPEG-4 compressed sequence based on the prediction model chosen by the encoder for individual macroblocks. An extension to the existing MPEG-4 syntax is presented for conducting and facilitating vast database searches. The data presented in the proposed 'indexing field' are: the birth and death frames of individual objects, global motion characteristics/ camera operations observed in the scene, representative key frames that capture the major transformations each object undergoes, and the dominant motion characteristics of each object throughout its lifetime. We present the validity of the proposed scheme by results obtained on several MPEG-4 test sequences.

Original languageEnglish
Pages (from-to)953-963
Number of pages11
JournalProceedings of SPIE - The International Society for Optical Engineering
Volume3024
DOIs
Publication statusPublished - 1997
Externally publishedYes
EventVisual Communications and Image Processing '97 - San Jose, CA, United States
Duration: 12 Feb 199712 Feb 1997

Keywords

  • Content-based access
  • MPEG-4 coding standard
  • Object-based representation
  • Video indexing

Fingerprint

Dive into the research topics of 'Object-based indexing of MPEG-4 compressed video'. Together they form a unique fingerprint.

Cite this