Research on social bot identification through behavioral feature analysis.

Journal: PloS one
Published Date:

Abstract

Accurately identifying social bot accounts is the key to preventing the use of artificial intelligence technology to forge social accounts, which can interfere with public opinion and thus cause public opinion crises. However, at present, relying only on manual identification of bot accounts has the challenges of low efficiency, high cost, and low accuracy, while existing research on batch identification of social bots lacks research on the system of behavioural characteristics of social bots, and thus lacks the construction of a model for the analysis of the behavioural characteristics of social bots. In this paper, we propose a diverse set of behavioural features for social robots based on the differences between the behavioural features of social robot accounts and normal users. The feature selection method based on OOB estimation is chosen for excluding redundant features in the constructed feature set; meanwhile, Random Forest, as a combined classification method, overcomes the problem of limitations of decision boundaries when classifying with a single decision tree, and has the characteristics of high accuracy, fast speed and stable performance. Through experiments, this paper applies it to the construction of social robot recognition model for detecting robot accounts in social platforms. The experiments prove that the effective indicators screened by the feature selection method based on OOB estimation can help improve the stability of the model. Specifically, the filtered features contribute about 20% more to the model accuracy and F1 score than other features. The social robot recognition model constructed based on random forest has higher accuracy and stability compared to the decision tree model and neural network model. Specifically, the accuracy rate is about 5% higher than other models, and other indicators are also better than other models. The experimental results show that the feature selection method based on OOB estimation and the random forest model show excellent performance in the experiments of social robot recognition, which can meet the requirements of the actual social robot recognition research and can be applied to the practical scenarios of robot account detection on social platforms.

Authors

  • Peng Zhang
    Key Laboratory of Macromolecular Science of Shaanxi Province, School of Chemistry & Chemical Engineering, Shaanxi Normal University, Xi'an, Shaanxi 710062, China.
  • Yinghui Du
    Network Public Opinion Research Center of China People's Police University, Langfang, China.
  • Qilei Wang
    Research Centre for Modern Police Technology and Equipment of China People's Police University, Langfang, China.
  • Jiyang Zhang
    Network Public Opinion Research Center of China People's Police University, Langfang, China.
  • Ruiqing Qin
    Network Public Opinion Research Center of China People's Police University, Langfang, China.