Multi-loss, feature fusion and improved top-two-voting ensemble for facial expression recognition in the wild.

Journal: Neural networks : the official journal of the International Neural Network Society

PMID: 39615451

Abstract

Facial expression recognition (FER) in the wild is a challenging pattern recognition task affected by the images' low quality and has attracted broad interest in computer vision. Existing FER methods failed to obtain sufficient accuracy to support the practical applications, especially in scenarios with low fault tolerance, which limits the adaptability of FER. Targeting exploring the possibility of further improving the accuracy of FER in the wild, this paper proposes a novel single model named R18+FAML and an ensemble model named R18+FAML-FGA-T2V, which applies intra-feature fusion within a single network, feature fusion among multiple networks, and the ensemble decision strategy. Based on the backbone of ResNet18 (R18), R18+FAML combines internal feature fusion and three attention blocks, as well as uses multiple loss functions (FAML) to improve the diversity of the feature extraction. To effectively integrate feature extractors from multiple networks, we propose feature fusion among networks based on the genetic algorithm (FGA). Comprehensively considering and utilizing more classification information, we propose an ensemble strategy, i.e., the improved top-two-voting (T2V) of multiple networks with the same structure. Combining the above strategies, R18+FAML-FGA-T2V can focus on the main expression-aware areas by integrating interest areas of multiple networks. From experiments on three challenging FER datasets in the wild including RAF-DB, AffectNet-8 and AffectNet-7, our single model R18+FAML and ensemble model R18+FAML-FGA-T2V achieve the accuracies of 90.32,62.17,65.83% and 91.59,63.27,66.63% respectively, both achieving the state-of-the-art results.

Authors

Guangyao Zhou

School of Computing and Artificial Intelligence, Southwest Jiaotong University, China. Electronic address: guangyao_zhou@my.swtju.edu.cn.
Yuanlun Xie

School of Information and Software Engineering, University of Electronic Science and Technology of China, China.
Yiqin Fu

China National Offshore Oil Corporation, China.
Zhaokun Wang

School of Mechanical Engineering, Nanjing University of Science and Technology, 210094 Nanjing, China.

Keywords

Algorithms Automated Facial Recognition Facial Expression Humans Neural Networks, Computer Pattern Recognition, Automated Voting

External Resources

View on PubMed Access via DOI PubMed (39615451)

Multi-loss, feature fusion and improved top-two-voting ensemble for facial expression recognition in the wild.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals