Zero-Shot Object Detection and Segmentation: A Focus on Street View Imagery

Sahra Tilki, Ahmet Kaplan, Aydin Tarik Zengin

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

With the advancement of data collection technologies, the importance of new data types like street view images, in addition to satellite and aerial images, has increased. Street view images (SVI) stand out by containing more comprehensive and real-time information compared to other types of images, thus offering a rich research field for object detection and segmentation processes. Interpreting and analyzing complex street view images requires accurate and effective processing of these data types. The use of SAM (Segment-Anything Model) and Grounding DINO models, which are less emphasized in the literature on street view images, forms the focus of this study. The application of these two models provides the opportunity to successfully perform segmentation and detection processes together on street images. This approach marks a significant advancement in the analysis of street images within the field of visual data processing, enhancing efficiency.

Original languageEnglish
Title of host publication2024 IEEE 3rd International Conference on Computing and Machine Intelligence, ICMI 2024 - Proceedings
EditorsAhmed Abdelgawad, Akhtar Jamil, Alaa Ali Hameed
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350372977
DOIs
Publication statusPublished - 2024
Event3rd IEEE International Conference on Computing and Machine Intelligence, ICMI 2024 - Mt. Pleasant, United States
Duration: 13 Apr 202414 Apr 2024

Publication series

Name2024 IEEE 3rd International Conference on Computing and Machine Intelligence, ICMI 2024 - Proceedings

Conference

Conference3rd IEEE International Conference on Computing and Machine Intelligence, ICMI 2024
Country/TerritoryUnited States
CityMt. Pleasant
Period13/04/2414/04/24

Bibliographical note

Publisher Copyright:
© 2024 IEEE.

Keywords

  • grounding DINO
  • image segmentation
  • segment anything model (SAM)
  • street view images (SVI)
  • zero shot object detection

Fingerprint

Dive into the research topics of 'Zero-Shot Object Detection and Segmentation: A Focus on Street View Imagery'. Together they form a unique fingerprint.

Cite this