Abstract
Detecting small objects, those with 32 × 32 pixels or smaller, remains a challenging task in aerial imagery due to limited resolution and contextual information. In this study, we propose enlarging bounding box annotations for small objects, thereby integrating additional shape and spatial context into the training process. With this approach, we seek to reinforce both the contextual information and leverage object shape characteristics during the training process. Experiments conducted on the VisDrone dataset demonstrate that using expanded annotations yields an absolute improvement of approximately 10% in mean average precision. Notably, detection accuracy for small object classes, such as pedestrians and motors, improves by over 20% in average precision, highlighting the benefit of including surrounding contextual information for small object detection from aerial images.
| Original language | English |
|---|---|
| Title of host publication | 2025 25th International Conference on Digital Signal Processing, DSP 2025 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Electronic) | 9798331512132 |
| DOIs | |
| Publication status | Published - 2025 |
| Event | 25th International Conference on Digital Signal Processing, DSP 2025 - Pylos, Greece Duration: 25 Jun 2025 → 27 Jun 2025 |
Publication series
| Name | International Conference on Digital Signal Processing, DSP |
|---|---|
| ISSN (Print) | 1546-1874 |
| ISSN (Electronic) | 2165-3577 |
Conference
| Conference | 25th International Conference on Digital Signal Processing, DSP 2025 |
|---|---|
| Country/Territory | Greece |
| City | Pylos |
| Period | 25/06/25 → 27/06/25 |
Bibliographical note
Publisher Copyright:© 2025 IEEE.