Abstract
Image segmentation at the pixel level is a time-consuming and difficult task in computer vision and image processing. Aerial (satellite/drone) photo segmentation is considered in this paper. Data from high-resolution remote sensing has enabled new applications such as more detailed per-pixel object classification. U-Net with VGG16 has made segmentation and categorization of images much more efficient and intelligent. U-Net models with pre-trained VGG16 backbones perform best across all tested scenarios. Adding the near-infrared band improves prediction results slightly compared with using RGB image bands alone. The ability to transfer images between sensors, especially between satellites and aerial images, could be improved through train-time enhancement and contrast enhancement. Further improving performance could be achieved by adding noisy training data from free online resources.
Original language | English |
---|---|
Title of host publication | Intelligent Systems - Proceedings of 4th International Conference on Machine Learning, IoT and Big Data ICMIB 2024 |
Editors | Siba Kumar Udgata, Srinivas Sethi, George Ghinea, Sanjay Kumar Kuanar |
Publisher | Springer Science and Business Media Deutschland GmbH |
Pages | 349-362 |
Number of pages | 14 |
ISBN (Print) | 9789819781591 |
DOIs | |
Publication status | Published - 2025 |
Event | 4th International Conference on Machine Learning, Internet of Things and Big Data, ICMIB 2024 - Gunupur, India Duration: 8 Mar 2024 → 10 Mar 2024 |
Publication series
Name | Lecture Notes in Networks and Systems |
---|---|
Volume | 1149 |
ISSN (Print) | 2367-3370 |
ISSN (Electronic) | 2367-3389 |
Conference
Conference | 4th International Conference on Machine Learning, Internet of Things and Big Data, ICMIB 2024 |
---|---|
Country/Territory | India |
City | Gunupur |
Period | 8/03/24 → 10/03/24 |
Bibliographical note
Publisher Copyright:© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
Keywords
- Deep learning
- Drone images
- U-Net