Convolutional transform learning based fusion framework for scale invariant long term target detection and tracking in unmanned aerial vehicles.

Journal: Scientific reports
Published Date:

Abstract

Unmanned aerial vehicles (UAVs) become increasingly available devices with extensive usage as environmental monitoring systems. With the benefit of higher mobility, UAVs are applied to fuel various significant uses in computer vision (CV), providing more effectiveness and accessibility than surveillance cameras with permanent camera view, angle, and scale. Nevertheless, owing to camera motion and composite environments, it is problematic to identify the UAV; conventional models frequently miss UAV detection and make false alarms. Drone-equipped cameras monitor objects at changing altitudes, leading to essential scale variants. The model increases targeted accuracy and decreases false positives using real-time data and machine learning (ML) methods. Its enormous applications range from military operations to urban planning and wildlife monitoring. Therefore, this study develops a novel long-term target detection and tracking model for unmanned aerial vehicles using a deep fusion-based convolutional transform learning (LTTDT-UAVDFCTL) model. The LTTDT-UAVDFCTL model presents a new model to improve the robustness and accuracy of target tracking and detection in scale-variant environments. At first, the presented LTTDT-UAVDFCTL technique performs image pre-processing by utilizing the median median-enhanced wiener filter (MEWF) technique to improve clarity and reduce noise. For object detection (OD), the highly accurate YOLOv8 technique is utilized, followed by feature extraction through a backbone deep fusion-based convolutional transform learning of VGG16, CapsNet, and EfficientNetB7 to capture both spatial and hierarchical features across varying scales. Moreover, the graph convolutional neural network (GCN) technique is employed for long-term target detection and tracking models. Finally, the hybrid nonlinear whale optimization algorithm with sine cosine (SCWOA) is implemented for the optimum choice of the hyperparameters involved in the GCN technique. The experimental study of the LTTDT-UAVDFCTL approach is performed under the VisDrone dataset. The performance validation of the LTTDT-UAVDFCTL approach portrayed a superior mAP value of 80.13% over existing models.

Authors

  • Fatma S Alrayes
    Department of Information Systems, College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University, P.O. Box 84428, Riyadh, 11671, Saudi Arabia.
  • Nazir Ahmad
    Department of Information Systems, Community College, King Khalid University, Abha, Saudi Arabia.
  • Asma Alshuhail
    Department of Information Systems, College of Computer Sciences and Information Technology, King Faisal University, Hofuf, Saudi Arabia.
  • Menwa Alshammeri
    Department of Computer Science, College of Computer and Information Sciences, Jouf University, Sakakah, Kingdom of Saudi Arabia.
  • Ali Alqazzaz
    Faculty of Computing and Information Technology, University of Bisha, Bisha, Saudi Arabia.
  • Hassan Alkhiri
    Department of Computer Science, Faculty of Computing and Information Technology, Al-Baha University, Al-Baha, Saudi Arabia.
  • Jehad Saad Alqurni
    Department of Educational Technologies, College of Education, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, 31441, Dammam, Saudi Arabia.
  • Yahia Said
    Laboratory of Electronics and Microelectronics (EμE), Faculty of Sciences of Monastir, University of Monastir, Monastir, Tunisia.

Keywords

No keywords available for this article.