Özet
One-shot-object detection (OSOD) aims to detect novel object classes using a single example of an unseen class. Cross-domain OSOD is a more challenging problem since the seen and unseen objects are sampled from the entirely disjoint datasets. The majority of the existing CD-OSOD methods focus on image datasets where the video domain remains largely unaddressed. To tackle this problem, we introduce a one-shot cross-domain video object detection (CD-OSVOD) model enabling adaptation from the still image to the video. Specifically the novel target object is designated as the query shot and a target driven cross-domain finetuning (FT) scheme is integrated with a baseline object detector. To address the requirements of the long term video object detection, the FT scheme is augmented with a novel Online Target Update (OTU) mechanism, enabling the detector to handle challenges such as appearance changes and occlusions. The OTU is controlled by a temporal aggregation module (TAM) which leverages temporal information in video and triggers update of the one-shot query when the temporal consistency is disrupted. The proposed CD-OSVOD utilizes base models trained on COCO and VOC still image datasets and successfully adapts to the video domain for novel object classes. Performance evaluations on challenging VOT-LT benchmarking video dataset demonstrate significant improvement in AP50 and mAP scores, highlighting the effectiveness of the proposed domain adaptation approach.
| Orijinal dil | İngilizce |
|---|---|
| Ana bilgisayar yayını başlığı | 2025 33rd European Signal Processing Conference, EUSIPCO 2025 - Proceedings |
| Yayınlayan | European Signal Processing Conference, EUSIPCO |
| Sayfalar | 641-645 |
| Sayfa sayısı | 5 |
| ISBN (Elektronik) | 9789464593624 |
| DOI'lar | |
| Yayın durumu | Yayınlandı - 2025 |
| Etkinlik | 33rd European Signal Processing Conference, EUSIPCO 2025 - Palermo, Italy Süre: 8 Eyl 2025 → 12 Eyl 2025 |
Yayın serisi
| Adı | European Signal Processing Conference |
|---|---|
| ISSN (Basılı) | 2219-5491 |
???event.eventtypes.event.conference???
| ???event.eventtypes.event.conference??? | 33rd European Signal Processing Conference, EUSIPCO 2025 |
|---|---|
| Ülke/Bölge | Italy |
| Şehir | Palermo |
| Periyot | 8/09/25 → 12/09/25 |
Bibliyografik not
Publisher Copyright:© 2025 European Signal Processing Conference, EUSIPCO. All rights reserved.
Parmak izi
Cross-domain One-shot Video Object Detection' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.Alıntı Yap
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver