Generative models for synthetic data generation: application to pharmacokinetic/pharmacodynamic data.

Journal: Journal of pharmacokinetics and pharmacodynamics
Published Date:

Abstract

The generation of synthetic patient data that reflect the statistical properties of real data plays a fundamental role in today's world because of its potential to (i) be enable proprietary data access for statistical and research purposes and (ii) increase available data (e.g., in low-density regions-i.e., for patients with under-represented characteristics). Generative methods employ a family of solutions for generating synthetic data. The objective of this research is to benchmark numerous state-of-the-art deep-learning generative methods across different scenarios and clinical datasets comprising patient covariates and several pharmacokinetic/pharmacodynamic endpoints. We did this by implementing various probabilistic models aimed at generating synthetic data, such as the Multi-layer Perceptron Conditioning Generative Adversarial Neural Network (MLP cGAN), Time-series Generative Adversarial Networks (TimeGAN), and a more traditional approach like Probabilistic Autoregressive (PAR). We evaluated their performance by calculating discriminative and predictive scores. Furthermore, we conducted comparisons between the distributions of real and synthetic data using Kolmogorov-Smirnov and Chi-square statistical tests, focusing respectively on covariate and output variables of the models. Lastly, we employed pharmacometrics-related metric to enhance interpretation of our results specific to our investigated scenarios. Results indicate that multi-layer perceptron-based conditional generative adversarial networks (MLP cGAN) exhibit the best overall performance for most of the considered metrics. This work highlights the opportunities to employ synthetic data generation in the field of clinical pharmacology for augmentation and sharing of proprietary data across institutions.

Authors

  • Yulun Jiang
    School of Computer and Communication Science, Ecole Polytechnique Federale de Lausanne (EPFL), Lausanne, Switzerland.
  • Alberto García-Durán
    Merck Quantitative Pharmacology, Ares Trading SA (an affiliate of Merck KGaA, Darmstadt, Germany), Lausanne, Switzerland.
  • Idris Bachali Losada
    Quantitative Pharmacology, Ares Trading S.A. (An Affiliate of Merck KGaA, Darmstadt, Germany), Lausanne, Switzerland.
  • Pascal Girard
    Merck Institute for Pharmacometrics, Merck Serono S.A., Switzerland, a Subsidiary of Merck KGaA, Darmstadt, Germany.
  • Nadia Terranova
    Merck Institute for Pharmacometrics, Merck Serono S.A., Switzerland, a Subsidiary of Merck KGaA, Darmstadt, Germany. nadia.terranova@merckgroup.com.