Abstract
The MPEG-4 object-based coding standard, designed as a common platform for all multimedia applications, is inherently well-suited for video indexing applications. To fully exploit the advantages offered by MPEG-4, however, a reconsideration of existing indexing strategies is required. This paper proposes a new object-based framework for video indexing and retrieval that treats as the basic indexing unit the object itself, where changes in content are detected through observations made on the objects in the video sequence. We present a temporal segmentation algorithm that is designed to automatically extract key frames for each video object in an MPEG-4 compressed sequence based on the prediction model chosen by the encoder for individual macroblocks. An extension to the existing MPEG-4 syntax is presented for conducting and facilitating vast database searches. The data presented in the proposed 'indexing field' are: the birth and death frames of individual objects, global motion characteristics/ camera operations observed in the scene, representative key frames that capture the major transformations each object undergoes, and the dominant motion characteristics of each object throughout its lifetime. We present the validity of the proposed scheme by results obtained on several MPEG-4 test sequences.
Original language | English |
---|---|
Pages (from-to) | 953-963 |
Number of pages | 11 |
Journal | Proceedings of SPIE - The International Society for Optical Engineering |
Volume | 3024 |
DOIs | |
Publication status | Published - 1997 |
Externally published | Yes |
Event | Visual Communications and Image Processing '97 - San Jose, CA, United States Duration: 12 Feb 1997 → 12 Feb 1997 |
Keywords
- Content-based access
- MPEG-4 coding standard
- Object-based representation
- Video indexing