A framework to create, evaluate and select synthetic datasets for survival prediction in oncology.
Journal:
Computers in biology and medicine
Published Date:
Apr 23, 2025
Abstract
BACKGROUND AND PURPOSE: Data-driven decision-making in radiation oncology (RO) relies on integrating real-world data effectively. Synthetic data (SD), generated through machine learning, offers a solution by mimicking real-world data without compromising privacy. This paper presents a general framework for generating, evaluating, and selecting high-quality tabular SD for clinical use, focusing on survival datasets in RO.