The compatibility of theoretical frameworks with machine learning analyses in psychological research.

Journal: Current opinion in psychology
Published Date:

Abstract

Supervised machine learning has been increasingly used in psychology and psychiatry research. Machine learning offers an important advantage over traditional statistical analyses: statistical model training in example data to enhance predictions in external test data. Additional advantages include advanced, improved statistical algorithms, and empirical methods to select a smaller set of predictor variables. Yet machine learning researchers often use large numbers of predictor variables, without using theory to guide variable selection. Such approach leads to Type I error, spurious findings, and decreased generalizability. We discuss the importance of theory to the psychology field. We offer suggestions for using theory to drive variable selection and data analyses using machine learning in psychological research, including an example from the cyberpsychology field.

Authors

  • Jon D Elhai
    Academy of Psychology and Behavior, Tianjin Normal University, No. 57-1 Wujiayao Street, Hexi District, Tianjin 300074, China; Department of Psychology, University of Toledo, 2801 West Bancroft Street, Toledo, OH 43606, USA; Department of Psychiatry, University of Toledo, 3000 Arlington Avenue, Toledo, OH 43614, USA. Electronic address: contact@jon-elhai.com.
  • Christian Montag
    Department of Molecular Psychology, Institute of Psychology and Education, Ulm University, Ulm, Germany.