Accurate multi-category student performance forecasting at early stages of online education using neural networks.

Journal: Scientific reports
PMID:

Abstract

The ability to accurately predict and analyze student performance in online education, both at the outset and throughout the semester, is vital. Most of the published studies focus on binary classification (Fail or Pass) but there is still a significant research shortcoming in predicting performance of students across multiple categories. This study introduces a novel neural network-based approach capable of accurately predicting student performance and identifying vulnerable students at early stages of the online courses. The open university learning analytics (OULA) dataset is employed to develop and test the proposed model, which predicts outcomes in Distinction, Fail, Pass, and Withdrawn categories. The OULA dataset is preprocessed to extract features from demographic data, assessment data, and clickstream interactions within a virtual learning environment (VLE). Novel features engineering has been utilized to predict students' performance across multiple categories at early stages of courses. Specially, students' VLE interactions are aggregated by total clicks to represent daily engagement and assess online activity. Comparative simulations indicate that the proposed model significantly outperforms existing baseline models including artificial neural network long short-term memory (ANN-LSTM), random forest (RF) 'gini', RF 'entropy' and deep feed forward neural network (DFFNN) in terms of accuracy, precision, recall, and F1-score. The results indicate that the prediction accuracy of the proposed method is about [Formula: see text] more than the existing state-of-the-art methods. Furthermore, compared to existing methodologies, the model demonstrates superior predictive capability across temporal course progression, achieving superior accuracy even at the initial [Formula: see text] phase of course completion.

Authors

  • Naveed Ur Rehman Junejo
    School of Physics and Electronic Engineering, Hanshan Normal University, Chaozhou, 521041, China.
  • Muhammad Wasim Nawaz
    Department of Computer Engineering, The University of Lahore, Lahore, Punjab, 54000, Pakistan.
  • Qingsheng Huang
    School of Mathematics and Statistics, Hanshan Normal University, Chaozhou, 521041, China. huangqs@hstc.edu.cn.
  • Xiaoqing Dong
    School of Physics and Electronic Engineering, Hanshan Normal University, Chaozhou, 521041, China.
  • Chang Wang
    Key Laboratory of the plateau of environmental damage control, Lanzhou General Hospital of Lanzhou Military Command, Lanzhou, China.
  • Gengzhong Zheng
    Department of Computer Science and Engineering, Hanshan Normal University, Chaozhou, 521041, China. zhenggz@hstc.edu.cn.