A formulation dataset of poly(lactide-co-glycolide) nanoparticles for small molecule delivery.
Journal:
Scientific data
Published Date:
Jul 10, 2025
Abstract
Poly(lactide-co-glycolide) (PLGA) nanoparticles are promising drug delivery systems, widely recognized for their ability to overcome various limitations associated with conventional formulations. However, designing and optimizing such formulations is a complex and non-trivial process that heavily relies on a lengthy, iterative approach, often involving trial and error. To address the limitations of traditional approaches, formulation scientists are increasingly incorporating artificial intelligence, particularly machine learning, to rationalize and accelerate the process. Despite decades of intensive research into PLGA nanoparticles, a notable shortage remains in the availability of comprehensive open-source datasets essential for driving this accelerated development process forward. Here, we present a literature-curated dataset of 433 PLGA nanoparticle formulations encompassing 65 small molecules. The dataset aims to bridge existing data gaps and provide a comprehensive resource for research on nanoparticle formulations.