"Computational Prediction of Mutagenicity Through Comprehensive Cell Painting Analysis".
Journal:
Mutagenesis
Published Date:
Aug 4, 2025
Abstract
The mutagenicity of chemical compounds is a key consideration in toxicology, drug development, and environmental safety. Traditional methods such as the Ames test, while reliable, are time-intensive and costly. With advances in imaging and machine learning, high-content assays like Cell Painting offer new opportunities for predictive toxicology. Cell Painting captures extensive morphological features of cells, which can correlate with chemical bioactivity. In this study, we leveraged Cell Painting data to develop machine learning models for predicting mutagenicity and compared their performance with structure-based models. We used two datasets: a Broad Institute dataset containing profiles of over 30,000 molecules and a US-EPA dataset with images of 1,200 chemicals tested at multiple concentrations. By integrating these datasets, we aimed to improve the robustness of our models. Among three algorithms tested - Random Forest, Support Vector Machine, and Extreme Gradient Boosting - the third showed the best performance for both datasets. Notably, selecting the most relevant concentration per compound, the Phenotypic Altering Concentration, significantly improved prediction accuracy. Our models outperformed traditional QSAR tools such as VEGA and the CompTox Dashboard for the majority of compounds, demonstrating the utility of Cell Painting features. The Cell Painting-based models revealed morphological changes related to DNA/RNA and ER perturbation, especially in mitochondria and nuclei, aligning with mutagenicity mechanisms. Despite this, certain compounds remained challenging to predict due to inherent dataset limitations and inter-laboratory variability in Cell Painting technology. The findings highlight the potential of Cell Painting in mutagenicity prediction, offering a complementary perspective to chemical structure-based models. Future work could involve harmonizing Cell Painting methodologies across datasets and exploring deep learning techniques to enhance predictive accuracy. Ultimately, integrating Cell Painting data with QSAR descriptors in hybrid models may unlock novel insights into chemical mutagenicity.
Authors
Keywords
No keywords available for this article.