"Computational Prediction of Mutagenicity Through Comprehensive Cell Painting Analysis".

Journal: Mutagenesis
Published Date:

Abstract

The mutagenicity of chemical compounds is a key consideration in toxicology, drug development, and environmental safety. Traditional methods such as the Ames test, while reliable, are time-intensive and costly. With advances in imaging and machine learning, high-content assays like Cell Painting offer new opportunities for predictive toxicology. Cell Painting captures extensive morphological features of cells, which can correlate with chemical bioactivity. In this study, we leveraged Cell Painting data to develop machine learning models for predicting mutagenicity and compared their performance with structure-based models. We used two datasets: a Broad Institute dataset containing profiles of over 30,000 molecules and a US-EPA dataset with images of 1,200 chemicals tested at multiple concentrations. By integrating these datasets, we aimed to improve the robustness of our models. Among three algorithms tested - Random Forest, Support Vector Machine, and Extreme Gradient Boosting - the third showed the best performance for both datasets. Notably, selecting the most relevant concentration per compound, the Phenotypic Altering Concentration, significantly improved prediction accuracy. Our models outperformed traditional QSAR tools such as VEGA and the CompTox Dashboard for the majority of compounds, demonstrating the utility of Cell Painting features. The Cell Painting-based models revealed morphological changes related to DNA/RNA and ER perturbation, especially in mitochondria and nuclei, aligning with mutagenicity mechanisms. Despite this, certain compounds remained challenging to predict due to inherent dataset limitations and inter-laboratory variability in Cell Painting technology. The findings highlight the potential of Cell Painting in mutagenicity prediction, offering a complementary perspective to chemical structure-based models. Future work could involve harmonizing Cell Painting methodologies across datasets and exploring deep learning techniques to enhance predictive accuracy. Ultimately, integrating Cell Painting data with QSAR descriptors in hybrid models may unlock novel insights into chemical mutagenicity.

Authors

  • Natacha Cerisier
    Université Paris Cité, INSERM U1133, CNRS UMR 8251, 75013, Paris, France.
  • Emily Truong
    Université Paris Cité, INSERM U1133, CNRS UMR 8251, 75013, Paris, France.
  • Taku Watanabe
    Division of Respiratory Medicine, Department of Internal Medicine, Hokkaido University Graduate School of Medicine, Sapporo, Japan.
  • Taro Oshiro
    Scientific Product Assessment Center, Japan Tobacco Inc, 6-2, Umegaoka, Aoba-Ku, Yokohama, Kanagawa, 227-8512, Japan.
  • Tomohiro Takahashi
    Scientific Product Assessment Center, Japan Tobacco Inc, 6-2, Umegaoka, Aoba-Ku, Yokohama, Kanagawa, 227-8512, Japan.
  • Shigeaki Ito
    Scientific Product Assessment Center, R&D Group, Japan Tobacco Inc., 6-2 Umegaoka, Aoba-ku, Yokohama, Kanagawa, 227-8512, Japan.
  • Olivier Taboureau
    INSERM U1133, CNRS UMR 8251, Unit of functional and adaptive biology, Université de Paris, Paris 75013, France. Olivier.taboureau@univ-paris-diderot.fr.

Keywords

No keywords available for this article.