Video content representation by incremental non-negative matrix factorization

Serhat S. Bucak, Bilge Gunsel

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

33 Citations (Scopus)

Abstract

Nonnegative Matrix Factorization (NMF) Is a powerful decomposition tool which has been used in several content representation applications recently. However, there are some difficulties in implementing NMF in on-line video applications. This paper introduces an incremental NMF (INMF) without deviating from conventional NMF's main objective function, which is minimizing the reconstruction error. The proposed algorithm is capable of modeling dynamic content of the video; thus controls contribution of the subsequent observations to the NMF representation properly. It is shown that the INMF preserves additive, parts-based representation capability of the NMF with a low computational load while offering dimension reduction. Experimental results are given to compare the reconstruction performances of the conventional and incremental NMF. In addition, video scene change detection and dynamic video content representation by INMF are investigated. Test results demonstrate that the INMF can be used as a powerful on-line factorization tool.

Original languageEnglish
Title of host publication2007 IEEE International Conference on Image Processing, ICIP 2007 Proceedings
PagesII113-II116
DOIs
Publication statusPublished - 2007
Event14th IEEE International Conference on Image Processing, ICIP 2007 - San Antonio, TX, United States
Duration: 16 Sept 200719 Sept 2007

Publication series

NameProceedings - International Conference on Image Processing, ICIP
Volume2
ISSN (Print)1522-4880

Conference

Conference14th IEEE International Conference on Image Processing, ICIP 2007
Country/TerritoryUnited States
CitySan Antonio, TX
Period16/09/0719/09/07

Keywords

  • Incremental algorithms
  • Non-negative matrix factorization
  • Video content representation

Fingerprint

Dive into the research topics of 'Video content representation by incremental non-negative matrix factorization'. Together they form a unique fingerprint.

Cite this