Learning spatio-temporal context for basketball action pose estimation with a multi-stream network.

Journal: Scientific reports

Published Date: Aug 9, 2025

Abstract

Accurate athlete pose estimation in basketball is crucial for game analysis, player training, and tactical decision-making. However, existing pose estimation methods struggle to effectively address common challenges in basketball, such as motion blur, occlusions, and complex backgrounds. To tackle these issues, this paper proposes a basketball action pose estimation framework, which first leverages a multi-dimensional data stream network to extract spatial, temporal, and contextual information separately. Specifically, the spatial stream branch aims to extract multi-scale features and captures the spatial pose information of players in single-frame images through feature fusion and spatial attention mechanisms. The temporal stream branch merges feature maps with adjacent frames, effectively capturing player motion information across consecutive frames. The context stream branch generates a global context feature vector that encodes the entire image, offering a holistic perspective for pose estimation. Subsequently, we designed a feature fusion module that integrates early fusion, late fusion, and hybrid fusion strategies to fully utilize multi-modal information. Finally, we introduced a stage-wise streaming training module that progressively enhances the model's accuracy and generalization ability through three stages. Experimental results demonstrate that the proposed framework significantly improves the accuracy and robustness of basketball action pose estimation, particularly excelling in scenarios with high dynamics and complex backgrounds.

Authors

Zhihao Zhang

Department of Radiology, Affiliated Hospital of Youjiang Medical University for Nationalities, 533000, Baise, China (Q.W., C.H., J.Z., Z.Z., X.Z.); School of Laboratory Medicine, Youjiang Medical University for Nationalities, 533000, Baise, China (Q.W., J.Z., Z.Z.).
Wenyue Liu

Faculty of Education, Universiti Kebangsaan Malaysia, 43600, Bangi, Selangor, Malaysia.
Yuan Zheng

School of Finance, Anhui University of Finance and Economics, Bengbu, Anhui 233030, China.
Linkang Du

Xi'an Jiaotong University, Xi'an, 710049, Shaanxi, China.
Lezhong Sun

Shandong Vocational University of Foreign Affairs, Rushan, 264504, Shandong, China. sunlezhong2034@163.com.

Keywords

Algorithms Basketball Humans Posture Spatio-Temporal Analysis

External Resources

View on PubMed Access via DOI PubMed (40783613)

Learning spatio-temporal context for basketball action pose estimation with a multi-stream network.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

Learning spatio-temporal context for basketball action pose estimation with a multi-stream network.

Abstract

Authors

Keywords

External Resources

Stay Ahead of Medical AI

Popular Topics

Recent Journals