Enhancing Object Detection in Aerial Images Using Transformer-Based Super-Resolution

Aslan Ahmet Haykir, Ilkay Öksuz

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Enhancing the resolution of aerial images is an important task in military and scientific applications. Sufficient image quality is essential for robust object detection. In this paper, we propose using transformer-based super-resolution techniques to increase object detection accuracy in aerial im-agery, specifically focusing on the Dataset for Object Detection in Aerial Images (DOTA) dataset. We utilize the Hybrid Attention Transformer for Image Restoration (HAT-L) architecture as a transformer model for super-resolution and analyze its influence on object detection performance, especially using the HAT- L model pre-trained on the ImageNet dataset. We integrate the YOLOv8 (You Only Look Once) OBB model, pre-trained on the DOTA dataset, to assess the effectiveness of our approach in enhancing object detection capabilities. Our results highlight the benefits of combining the HAT-L archi-tecture for super-resolution with the YOLOv8 OBB model for object detection tasks. We achieve a Peak Signal-to-Noise Ratio (PSNR) of 37.847 and a Structural Similarity Index (SSIM) value of 0.903 for super-resolved images on the DOTA validation set using the HAT- L architecture. Additionally, our integrated approach yields a mean Average Precision (mAP) of 0.809 at IOU 0.5 and 0.656 at IOU 0.5-0.95 using the YOLOv8x OBB model on the super-resolved images. Our findings contribute valuable insights into the effectiveness of transformer models for image enhancement and object detection tasks in remote sensing applications.

Original languageEnglish
Title of host publicationUBMK 2024 - Proceedings
Subtitle of host publication9th International Conference on Computer Science and Engineering
EditorsEsref Adali
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages966-971
Number of pages6
ISBN (Electronic)9798350365887
DOIs
Publication statusPublished - 2024
Event9th International Conference on Computer Science and Engineering, UBMK 2024 - Antalya, Turkey
Duration: 26 Oct 202428 Oct 2024

Publication series

NameUBMK 2024 - Proceedings: 9th International Conference on Computer Science and Engineering

Conference

Conference9th International Conference on Computer Science and Engineering, UBMK 2024
Country/TerritoryTurkey
CityAntalya
Period26/10/2428/10/24

Bibliographical note

Publisher Copyright:
© 2024 IEEE.

Keywords

  • Aerial Images
  • DOTA
  • HAT-L
  • Object Detection
  • Super Resolution
  • Transformers
  • YOLOv8

Fingerprint

Dive into the research topics of 'Enhancing Object Detection in Aerial Images Using Transformer-Based Super-Resolution'. Together they form a unique fingerprint.

Cite this