Temporal video segmentation using unsupervised clustering and semantic object tracking

Bilge Günsel*, A. Müfit Ferman, A. Murat Tekalp

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

87 Citations (Scopus)


This paper proposes a content-based temporal video segmentation system that integrates syntactic (domain-independent) and semantic (domain-dependent) features for automatic management of video data. Temporal video segmentation includes scene change detection and shot classification. The proposed scene change detection method consists of two steps: detection and tracking of semantic objects of interest specified by the user, and an unsupervised method for detection of cuts, and edit effects. Object detection and tracking is achieved using a region matching scheme, where the region of interest is defined by the boundary of the object. A new unsupervised scene change detection method based on two-class clustering is introduced to eliminate the data dependency of threshold selection. The proposed shot classification approach relies on semantic image features and exploits domain-dependent visual properties such as shape, color, and spatial configuration of tracked semantic objects. The system has been applied to segmentation and classification of TV programs collected from different channels. Although the paper focuses on news programs, the method can easily be applied to other TV programs with distinct semantic structure.

Original languageEnglish
Pages (from-to)592-604
Number of pages13
JournalJournal of Electronic Imaging
Issue number3
Publication statusPublished - Jul 1998


Dive into the research topics of 'Temporal video segmentation using unsupervised clustering and semantic object tracking'. Together they form a unique fingerprint.

Cite this