Özet
In the field of video content analysis, object detection is a crucial task. The High Efficient Video Coding (H.265, HEVC) standard's coding structures are strongly correlated with the video content, creating an opportunity to utilize these structures for video object detection in a computationally efficient way. To address this, we present a video object detection method that partitions frames into macroblocks based on the H.265 structure. Blocks with spatially high-frequency content go through a dynamic-layer approach that subjects them to deeper analysis with more layers, while blocks with spatially low-frequency content undergo fewer layers to enable a lower computational load. Results on ImageNet-Vid Dataset indicate that our approach has the potential to save significant computational resources while maintaining accurate object detection performance.
Orijinal dil | İngilizce |
---|---|
Ana bilgisayar yayını başlığı | ICASSPW 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing Workshops, Proceedings |
Yayınlayan | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Elektronik) | 9798350302615 |
DOI'lar | |
Yayın durumu | Yayınlandı - 2023 |
Etkinlik | 2023 IEEE International Conference on Acoustics, Speech and Signal Processing Workshops, ICASSPW 2023 - Rhodes Island, Greece Süre: 4 Haz 2023 → 10 Haz 2023 |
Yayın serisi
Adı | ICASSPW 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing Workshops, Proceedings |
---|
???event.eventtypes.event.conference???
???event.eventtypes.event.conference??? | 2023 IEEE International Conference on Acoustics, Speech and Signal Processing Workshops, ICASSPW 2023 |
---|---|
Ülke/Bölge | Greece |
Şehir | Rhodes Island |
Periyot | 4/06/23 → 10/06/23 |
Bibliyografik not
Publisher Copyright:© 2023 IEEE.