Ana gezinime geç Aramaya geç Ana içeriğe geç

Cross-domain One-shot Video Object Detection

  • Istanbul Technical University
  • Istanbul Medeniyet University

Araştırma sonucu: Kitap/Rapor/Konferans Bildirisinde BölümKonferans katkısıbilirkişi

Özet

One-shot-object detection (OSOD) aims to detect novel object classes using a single example of an unseen class. Cross-domain OSOD is a more challenging problem since the seen and unseen objects are sampled from the entirely disjoint datasets. The majority of the existing CD-OSOD methods focus on image datasets where the video domain remains largely unaddressed. To tackle this problem, we introduce a one-shot cross-domain video object detection (CD-OSVOD) model enabling adaptation from the still image to the video. Specifically the novel target object is designated as the query shot and a target driven cross-domain finetuning (FT) scheme is integrated with a baseline object detector. To address the requirements of the long term video object detection, the FT scheme is augmented with a novel Online Target Update (OTU) mechanism, enabling the detector to handle challenges such as appearance changes and occlusions. The OTU is controlled by a temporal aggregation module (TAM) which leverages temporal information in video and triggers update of the one-shot query when the temporal consistency is disrupted. The proposed CD-OSVOD utilizes base models trained on COCO and VOC still image datasets and successfully adapts to the video domain for novel object classes. Performance evaluations on challenging VOT-LT benchmarking video dataset demonstrate significant improvement in AP50 and mAP scores, highlighting the effectiveness of the proposed domain adaptation approach.

Orijinal dilİngilizce
Ana bilgisayar yayını başlığı2025 33rd European Signal Processing Conference, EUSIPCO 2025 - Proceedings
YayınlayanEuropean Signal Processing Conference, EUSIPCO
Sayfalar641-645
Sayfa sayısı5
ISBN (Elektronik)9789464593624
DOI'lar
Yayın durumuYayınlandı - 2025
Etkinlik33rd European Signal Processing Conference, EUSIPCO 2025 - Palermo, Italy
Süre: 8 Eyl 202512 Eyl 2025

Yayın serisi

AdıEuropean Signal Processing Conference
ISSN (Basılı)2219-5491

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???33rd European Signal Processing Conference, EUSIPCO 2025
Ülke/BölgeItaly
ŞehirPalermo
Periyot8/09/2512/09/25

Bibliyografik not

Publisher Copyright:
© 2025 European Signal Processing Conference, EUSIPCO. All rights reserved.

Parmak izi

Cross-domain One-shot Video Object Detection' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

Alıntı Yap