Detecting Objects in Aerial Photographs using Neural Network Techniques

Main Article Content

Warakorn Luangluewut
Kittakorn Viriyasatr
Wichai Pawgasame
Pantape Kaewmongkol
Sanya Mitaim

Abstract

This article discusses the study of aerial image analysis obtained from Unmanned Aerial Vehicles (UAVs) using the principles of artificial neural networks for image recognition. In this study, the author reviews the literature on image analysis for object detection and explores which models are suitable for the task of object detection in aerial images. The author selects the YOLO, Retinanet, and Fast R-CNN detection models after careful consideration. Through experimentation and study, the article reveals that when detecting objects in aerial images captured by UAVs, it is crucial to choose a detection model that aligns well with the equipment used for image capture. The experimental results show that utilizing the YOLO model yields a mean Average Precision (mAP) of up to 58.5% and a processing speed of 158.13 frames per second. These results highlight the superior accuracy and speed of object detection compared to other models tested with aerial images captured by unmanned aerial vehicles.

Downloads

Download data is not yet available.

Article Details

How to Cite
[1]
W. Luangluewut, K. Viriyasatr, W. Pawgasame, P. Kaewmongkol, and S. Mitaim, “Detecting Objects in Aerial Photographs using Neural Network Techniques”, DTAJ, vol. 5, no. 12, pp. 4–11, Nov. 2023.
Section
Academic Articles

References

Q. M. Chung, T. D. Le, T. V. Dang, N. D. Vo, T. V. Nguyen, and K. Nguyen, "Data Augmentation Analysis in Vehicle Detection from Aerial Videos," in 2020 RIVF International Conference on Computing and Communication Technologies (RIVF), Ho Chi Minh City, Vietnam, 2020, pp. 1-3, doi: 10.1109/RIVF48685.2020.9140740.

S. Ali, A. Siddique, H. F. Ates, and B. K. Güntürk, "Improved YOLOv4 for Aerial Object Detection," in 2021 29th Signal Processing and Communications Applications Conference (SIU), Istanbul, Turkey, 2021, pp. 1 - 4, doi: 10.1109/SIU53274.2021.9478027.

Y. Zuo, J. Yang, Z. Zhu, R. Li, Y. Zhou, and Y. Zheng, "Real-Time Semantic Segmentation of Aerial Videos Based on Bilateral Segmentation Network," in 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS,

Brussels, Belgium, 2021, pp. 2763-2766, doi: 10.1109/IGARSS47720.2021.9554952.

I. Yurchuk, V. Kovdrya, and L. Bilyanska, "Segmentation of Digital Images of Aerial Photography," in 2019 IEEE 5th International Conference Actual Problems of Unmanned Aerial Vehicles Developments (APUAVD), Kiev, Ukraine, 2019, pp. 258-261, doi: 10.1109/APUAVD47061.2019.8943841.

G. - S. Xia et al., “DOTA: A Large-scale Dataset for Object Detection in Aerial Images,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018, pp. 3974 - 3983.

J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You Only Look Once: Unified, Real-Time Object Detection," in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016 pp. 779-788. doi: 10.1109/CVPR.2016.91.

T. -Y. Lin, P. Goyal, R. Girshick, K. He, and P. Dollár, "Focal Loss for Dense Object Detection," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 42, no. 2, pp. 318-327, 1 Feb. 2020, doi: 10.1109/TPAMI.2018.2858826.

S. Ren, K. He, R. Girshick, and J. Sun, “Faster R-CNN: Towards Real-Time Object Detectionwith Region Proposal Networks,” in Advances in Neural Information Processing Systems 28 (NIPS 2015), C. Cortes, N. Lawrence, D. Lee, M. Sugiyama and R. Garnett, Eds. 2015, pp. 1 - 9.

K. O’Shea and R. Nash, “An Introduction to Convolutional Neural Networks,” ArXiv e-prints, pp. 1 - 11, 2015, doi: 10.48550/arXiv.1511.08458.

J. Sun, B. Li, Y. Jiang, and C.-y. Wen, “A Camera-Based Target Detection and Positioning UAV System for Search and Rescue (SAR) Purposes,” Sensors, vol. 16, no. 11, pp. 1778, 2016, doi: 10.3390/s16111778.