Automated Survey Collection with LLM-based Conversational Agents
Journal:
arXiv
Published Date:
Apr 2, 2025
Abstract
Objective: Traditional phone-based surveys are among the most accessible and
widely used methods to collect biomedical and healthcare data, however, they
are often costly, labor intensive, and difficult to scale effectively. To
overcome these limitations, we propose an end-to-end survey collection
framework driven by conversational Large Language Models (LLMs).
Materials and Methods: Our framework consists of a researcher responsible for
designing the survey and recruiting participants, a conversational phone agent
powered by an LLM that calls participants and administers the survey, a second
LLM (GPT-4o) that analyzes the conversation transcripts generated during the
surveys, and a database for storing and organizing the results. To test our
framework, we recruited 8 participants consisting of 5 native and 3 non-native
english speakers and administered 40 surveys. We evaluated the correctness of
LLM-generated conversation transcripts, accuracy of survey responses inferred
by GPT-4o and overall participant experience.
Results: Survey responses were successfully extracted by GPT-4o from
conversation transcripts with an average accuracy of 98% despite transcripts
exhibiting an average per-line word error rate of 7.7%. While participants
noted occasional errors made by the conversational LLM agent, they reported
that the agent effectively conveyed the purpose of the survey, demonstrated
good comprehension, and maintained an engaging interaction.
Conclusions: Our study highlights the potential of LLM agents in conducting
and analyzing phone surveys for healthcare applications. By reducing the
workload on human interviewers and offering a scalable solution, this approach
paves the way for real-world, end-to-end AI-powered phone survey collection
systems.