Abstract
Intuitively, integrating information from multiple visual cues, such as texture, stereo disparity, and image motion, should improve performance on perceptual tasks, such as object detection. On the other hand, the additional effort required to extract and represent information from additional cues may increase computational complexity. In this work, we show that using biologically inspired integrated representation of texture and stereo disparity information for a multi-view facial detection task leads to not only improved detection performance, but also reduced computational complexity. Disparity information enables us to filter out 90% of image locations as being less likely to contain faces. Performance is improved because the filtering rejects 32% of the false detections made by a similar monocular detector at the same recall rate. Despite the additional computation required to compute disparity information, our binocular detector takes only 42 ms to process a pair of 640×480 images, 35% of the time required by the monocular detector. We also show that this integrated detector is computationally more efficient than a detector with similar performance where texture and stereo information is processed separately.
Original language | English |
---|---|
Pages (from-to) | 1100-1113 |
Number of pages | 14 |
Journal | Signal Processing: Image Communication |
Volume | 28 |
Issue number | 9 |
DOIs | |
Publication status | Published - Oct 2013 |
Funding
This work was supported by the Germany/Hong Kong Joint Research Scheme sponsored by the Research Grants Council of Hong Kong and the German Academic Exchange Service (Reference No. G HK014/09), by the Concept for the Future of Karlsruhe Institute of Technology within the framework of the German Excellence Initiative, by the General Research Fund sponsored by the Research Grants Council of Hong Kong (Reference No. 619111), and by the Istanbul Technical University Research Fund (Reference No. 36123).
Funders | Funder number |
---|---|
Concept for the Future of Karlsruhe Institute of Technology | 619111 |
Research Grants Council of Hong Kong | |
Deutscher Akademischer Austauschdienst | HK014/09 |
Istanbul Teknik Üniversitesi | 36123 |
Keywords
- Disparity energy model
- Gabor filter
- Multi-view face detection
- Stereo vision