Comparative analysis of deep learning based building extraction methods with the new VHR Istanbul dataset

Tolga Bakirman*, Irem Komurcu, Elif Sertel

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

12 Citations (Scopus)

Abstract

Automatic building segmentation from satellite images is an important task for various applications such as urban mapping, disaster management and regional planning. With the broader availability of very high-resolution satellite images, deep learning-based techniques have been broadly used for remote sensing image-related tasks. In this study, we generated a new building dataset, the Istanbul dataset, for the building segmentation task. 150 Pléiades image tiles of 1500 × 1500 pixels covering an area of 85 km2 area of Istanbul city were used and approximately 40,000 buildings were labelled, representing different building structures and spatial distribution. We extensively investigated the ideal architecture, encoder and hyperparameter settings for building segmentation tasks using the new Istanbul dataset. More than 60 experiments were conducted by applying state-of-the-art architectures such as U-Net, Unet++, DeepLabv3+, FPN and PSPNet with different pre-trained encoders and hyperparameters. Our experiments showed that Unet++ architecture using SE-ResNeXt101 encoder pre-trained with ImageNet provides the best results with 93.8% IoU on the Istanbul dataset. In order to prove our solution's generalizability, the ideal network has also been trained separately on Inria and Massachusetts building segmentation datasets. The networks have produced IoU values of 75.39% and 92.53% on the Inria and Massachusetts datasets, respectively. The results indicate that our ideal network solution settings outperform other methods in terms of building segmentation even without any specific architectural modification. The weights files and inference notebook is available on: https://github.com/TolgaBkm/Istanbul_Dataset.

Original languageEnglish
Article number117346
JournalExpert Systems with Applications
Volume202
DOIs
Publication statusPublished - 15 Sept 2022

Bibliographical note

Publisher Copyright:
© 2022 Elsevier Ltd

Funding

This work was supported by Istanbul Technical University, Scientific Research Office [MAB-2020-42332]. The authors would like to thank Istanbul Technical University, Center for Satellite Communications and Remote Sensing and its researchers for providing Pléides imagery and labelling buildings. This work was supported by Istanbul Technical University, Scientific Research Office [MAB-2020-42332]. The data that support the findings of this study are available on request from the corresponding author. The data are not publicly available due to research partner restrictions.

FundersFunder number
Istanbul Technical University, Center for Satellite Communications and Remote Sensing
Istanbul Technical University, Scientific Research OfficeMAB-2020-42332

    Keywords

    • Building extraction
    • Deep learning
    • Pléiades
    • Urban

    Fingerprint

    Dive into the research topics of 'Comparative analysis of deep learning based building extraction methods with the new VHR Istanbul dataset'. Together they form a unique fingerprint.

    Cite this