Multi-camera spatiotemporal deep learning framework for real-time abnormal behavior detection in dense urban environments.

Journal: Scientific reports
Published Date:

Abstract

The emerging density in today's urban environments requires a strong multi-camera architecture for real-time abnormality detection and behavior analysis. Most of the existing methods tend to fail in detecting unusual behaviors due to occlusion, dynamic scene changes and high computational inefficiency. These failures often result in high rates of false positives and poor generalization for unseen anomalies. Both traditional graphs based and even the current CNN-RNN systems fail to capture complex social interactions and spatiotemporal dependencies; therefore, much is limited in such scenarios where people crowd. To address those drawbacks, this research proposes a deep learning framework for abnormal behavior detection with multiple cameras using spatiotemporal information, integrating several new methodologies. Multi Scale Graph Attention Networks (MS-GAT) are used to achieve interaction-aware anomaly detection, which has resulted in up to 30% reduction in false positives. RL-DCAT or the Reinforcement Learning Based Dynamic Camera Attention Transformer works very efficiently for optimizing surveillance focus, which helps reduce 40% of the computational overhead and increases recall by 15%. Given this, STICL-Spatiotemporal Inverse Contrastive Learning, which uses an inverse contrastive anomaly memory, increases the generalization to unseen rare anomalies by 25% improved recall. Neuromorphic event-based encoding captures speed action analysis through spiking neural networks, lowering detection latency by 60%. Finally, the BGS-MFA synthesizes new abnormal behaviors using generative behavior synthesis and meta-learned few-shot adaptation to generalize anomaly detection by 35%. Evaluation on the UCF-Crime, ShanghaiTech and Avenue Datasets showed 40% better false alarm reduction, 50% computational demands lower and an impressive 98% real-time efficiency of this multi-faceted framework. This total framework will enable multi-camera crowd surveillance with adaptive scalability and resource provisioning for real-time dynamic behavioral anomaly detection in real-world settings.

Authors

  • Sai Babu Veesam
    School of Computer Science and Engineering, VIT-AP University, Amaravathi, 522241, India. saibabuv@gmail.com.
  • B Tarakeswara Rao
    Department of C.S. E, KKR & KSR Institute of Technology and Sciences, Guntur, Andhra Pradesh, India.
  • Zarina Begum
    School of Computer Science and Engineering, VIT-AP University, Amaravathi, 522241, India.
  • R S M Lakshmi Patibandla
    Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Guntur, Andhra Pradesh, India.
  • Arvin Arun Dcosta
    Department of CSE, School of Computer Science and Engineering, VIT-AP University, 522241, Amaravathi, India.
  • Shonak Bansal
    Department of Electronics and Communication Engineering, Chandigarh University, Gharuan, Punjab, India. shonakk@gmail.com.
  • Krishna Prakash
    Department of Electronics and Communication Engineering, NRI Institute of Technology, Agripalli, Eluru, AP, 521212, India. k_krishna2k7@yahoo.co.in.
  • Mohammad Rashed Iqbal Faruque
    Space Science Centre (ANGKASA), Institute of Climate Change (IPI), Universiti Kebangsaan Malaysia (UKM), 43600, Bangi, Selangor D. E., Malaysia. rashed@ukm.edu.my.
  • K S Al-Mugren
    Physics Department, Science College, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

Keywords

No keywords available for this article.