VITAL: A New Dataset for Benchmarking Pluralistic Alignment in Healthcare
Journal:
arXiv
Published Date:
Feb 19, 2025
Abstract
Alignment techniques have become central to ensuring that Large Language
Models (LLMs) generate outputs consistent with human values. However, existing
alignment paradigms often model an averaged or monolithic preference, failing
to account for the diversity of perspectives across cultures, demographics, and
communities. This limitation is particularly critical in health-related
scenarios, where plurality is essential due to the influence of culture,
religion, personal values, and conflicting opinions. Despite progress in
pluralistic alignment, no prior work has focused on health, likely due to the
unavailability of publicly available datasets. To address this gap, we
introduce VITAL, a new benchmark dataset comprising 13.1K value-laden
situations and 5.4K multiple-choice questions focused on health, designed to
assess and benchmark pluralistic alignment methodologies. Through extensive
evaluation of eight LLMs of varying sizes, we demonstrate that existing
pluralistic alignment techniques fall short in effectively accommodating
diverse healthcare beliefs, underscoring the need for tailored AI alignment in
specific domains. This work highlights the limitations of current approaches
and lays the groundwork for developing health-specific alignment solutions.