Multimodal fusion for anticipating human decision performance.

Journal: Scientific reports
Published Date:

Abstract

Anticipating human decisions while performing complex tasks remains a formidable challenge. This study proposes a multimodal machine-learning approach that leverages image features and electroencephalography (EEG) data to predict human response correctness in a demanding visual searching task. Notably, we extract a novel set of image features pertaining to object relationships using the Segment Anything Model (SAM), which enhances prediction accuracy compared to traditional features. Additionally, our approach effectively utilizes a combination of EEG signals and image features to streamline the feature set required for the Random Forest Classifier (RFC) while maintaining high accuracy. The findings of this research hold substantial potential for developing advanced fault alert systems, particularly in critical decision-making environments such as the medical and defence sectors.

Authors

  • Xuan-The Tran
    GrapheneX-UTS HAI Centre, Australian AI Institute, Faculty of Engineering and Information Technology (FEIT), University of Technology Sydney (UTS), Sydney, NSW, 2007, Australia.
  • Thomas Do
    GrapheneX-UTS HAI Centre, Australian AI Institute, Faculty of Engineering and Information Technology (FEIT), University of Technology Sydney (UTS), Sydney, NSW, 2007, Australia.
  • Nikhil R Pal
  • Tzyy-Ping Jung
    Swartz Center for Computational Neuroscience, Institute for Neural Computation, University of California San Diego, La Jolla, CA, USA.
  • Chin-Teng Lin