Sievenet: An Efficient Model Utilizing H.265 Codec Structure for Video Object Detection

Onur Can Koyun*, Behcet Ugur Toreyin

*Bu çalışma için yazışmadan sorumlu yazar

Araştırma sonucu: Kitap/Rapor/Konferans Bildirisinde BölümKonferans katkısıbilirkişi

Özet

In the field of video content analysis, object detection is a crucial task. The High Efficient Video Coding (H.265, HEVC) standard's coding structures are strongly correlated with the video content, creating an opportunity to utilize these structures for video object detection in a computationally efficient way. To address this, we present a video object detection method that partitions frames into macroblocks based on the H.265 structure. Blocks with spatially high-frequency content go through a dynamic-layer approach that subjects them to deeper analysis with more layers, while blocks with spatially low-frequency content undergo fewer layers to enable a lower computational load. Results on ImageNet-Vid Dataset indicate that our approach has the potential to save significant computational resources while maintaining accurate object detection performance.

Orijinal dilİngilizce
Ana bilgisayar yayını başlığıICASSPW 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing Workshops, Proceedings
YayınlayanInstitute of Electrical and Electronics Engineers Inc.
ISBN (Elektronik)9798350302615
DOI'lar
Yayın durumuYayınlandı - 2023
Etkinlik2023 IEEE International Conference on Acoustics, Speech and Signal Processing Workshops, ICASSPW 2023 - Rhodes Island, Greece
Süre: 4 Haz 202310 Haz 2023

Yayın serisi

AdıICASSPW 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing Workshops, Proceedings

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???2023 IEEE International Conference on Acoustics, Speech and Signal Processing Workshops, ICASSPW 2023
Ülke/BölgeGreece
ŞehirRhodes Island
Periyot4/06/2310/06/23

Bibliyografik not

Publisher Copyright:
© 2023 IEEE.

Parmak izi

Sievenet: An Efficient Model Utilizing H.265 Codec Structure for Video Object Detection' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

Alıntı Yap