Behind the Screens: Uncovering Bias in AI-Driven Video Interview Assessments Using Counterfactuals
Journal:
arXiv
Published Date:
May 17, 2025
Abstract
AI-enhanced personality assessments are increasingly shaping hiring
decisions, using affective computing to predict traits from the Big Five
(OCEAN) model. However, integrating AI into these assessments raises ethical
concerns, especially around bias amplification rooted in training data. These
biases can lead to discriminatory outcomes based on protected attributes like
gender, ethnicity, and age. To address this, we introduce a
counterfactual-based framework to systematically evaluate and quantify bias in
AI-driven personality assessments. Our approach employs generative adversarial
networks (GANs) to generate counterfactual representations of job applicants by
altering protected attributes, enabling fairness analysis without access to the
underlying model. Unlike traditional bias assessments that focus on unimodal or
static data, our method supports multimodal evaluation-spanning visual, audio,
and textual features. This comprehensive approach is particularly important in
high-stakes applications like hiring, where third-party vendors often provide
AI systems as black boxes. Applied to a state-of-the-art personality prediction
model, our method reveals significant disparities across demographic groups. We
also validate our framework using a protected attribute classifier to confirm
the effectiveness of our counterfactual generation. This work provides a
scalable tool for fairness auditing of commercial AI hiring platforms,
especially in black-box settings where training data and model internals are
inaccessible. Our results highlight the importance of counterfactual approaches
in improving ethical transparency in affective computing.