Abstract
With the advancement of data collection technologies, the importance of new data types like street view images, in addition to satellite and aerial images, has increased. Street view images (SVI) stand out by containing more comprehensive and real-time information compared to other types of images, thus offering a rich research field for object detection and segmentation processes. Interpreting and analyzing complex street view images requires accurate and effective processing of these data types. The use of SAM (Segment-Anything Model) and Grounding DINO models, which are less emphasized in the literature on street view images, forms the focus of this study. The application of these two models provides the opportunity to successfully perform segmentation and detection processes together on street images. This approach marks a significant advancement in the analysis of street images within the field of visual data processing, enhancing efficiency.
Original language | English |
---|---|
Title of host publication | 2024 IEEE 3rd International Conference on Computing and Machine Intelligence, ICMI 2024 - Proceedings |
Editors | Ahmed Abdelgawad, Akhtar Jamil, Alaa Ali Hameed |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9798350372977 |
DOIs | |
Publication status | Published - 2024 |
Event | 3rd IEEE International Conference on Computing and Machine Intelligence, ICMI 2024 - Mt. Pleasant, United States Duration: 13 Apr 2024 → 14 Apr 2024 |
Publication series
Name | 2024 IEEE 3rd International Conference on Computing and Machine Intelligence, ICMI 2024 - Proceedings |
---|
Conference
Conference | 3rd IEEE International Conference on Computing and Machine Intelligence, ICMI 2024 |
---|---|
Country/Territory | United States |
City | Mt. Pleasant |
Period | 13/04/24 → 14/04/24 |
Bibliographical note
Publisher Copyright:© 2024 IEEE.
Keywords
- grounding DINO
- image segmentation
- segment anything model (SAM)
- street view images (SVI)
- zero shot object detection