Automatic 2D-to-3D video conversion by monocular depth cues fusion and utilizing human face landmarks

Mani B. Fard, Ulug Bayazit

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

In this paper, we propose a hybrid 2D-to-3D video conversion system to recover the 3D structure of the scene. Depending on the scene characteristics, geometric or height depth information is adopted to form the initial depth map. This depth map is fused with color-based depth cues to construct the nal depth map of the scene background. The depths of the foreground objects are estimated after their classi cation into human and non-human regions. Speci cally, the depth of a non-human foreground object is directly calculated from the depth of the region behind it in the background. To acquire more accurate depth for the regions containing a human, the estimation of the distance between face landmarks is also taken into account. Finally, the computed depth information of the foreground regions is superimposed on the background depth map to generate the complete depth map of the scene which is the main goal in the process of converting 2D video to 3D.

Original languageEnglish
Title of host publicationSixth International Conference on Machine Vision, ICMV 2013
PublisherSPIE
ISBN (Print)9780819499967
DOIs
Publication statusPublished - 2013
Event6th International Conference on Machine Vision, ICMV 2013 - London, United Kingdom
Duration: 16 Nov 201317 Nov 2013

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume9067
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Conference

Conference6th International Conference on Machine Vision, ICMV 2013
Country/TerritoryUnited Kingdom
CityLondon
Period16/11/1317/11/13

Keywords

  • 2D-to-3D video conversion
  • Anthropometric-based cue
  • Color-based depth cue
  • Geometric depth cues
  • Height depth cue

Fingerprint

Dive into the research topics of 'Automatic 2D-to-3D video conversion by monocular depth cues fusion and utilizing human face landmarks'. Together they form a unique fingerprint.

Cite this