High-content image generation for drug discovery using generative adversarial networks.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

Immense amount of high-content image data generated in drug discovery screening requires computationally driven automated analysis. Emergence of advanced machine learning algorithms, like deep learning models, has transformed the interpretation and analysis of imaging data. However, deep learning methods generally require large number of high-quality data samples, which could be limited during preclinical investigations. To address this issue, we propose a generative modeling based computational framework to synthesize images, which can be used for phenotypic profiling of perturbations induced by drug compounds. We investigated the use of three variants of Generative Adversarial Network (GAN) in our framework, viz., a basic Vanilla GAN, Deep Convolutional GAN (DCGAN) and Progressive GAN (ProGAN), and found DCGAN to be most efficient in generating realistic synthetic images. A pre-trained convolutional neural network (CNN) was used to extract features of both real and synthetic images, followed by a classification model trained on real and synthetic images. The quality of synthesized images was evaluated by comparing their feature distributions with that of real images. The DCGAN-based framework was applied to high-content image data from a drug screen to synthesize high-quality cellular images, which were used to augment the real image data. The augmented dataset was shown to yield better classification performance compared with that obtained using only real images. We also demonstrated the application of proposed method on the generation of bacterial images and computed feature distributions for bacterial images specific to different drug treatments. In summary, our results showed that the proposed DCGAN-based framework can be utilized to generate realistic synthetic high-content images, thus enabling the study of drug-induced effects on cells and bacteria.

Authors

  • Shaista Hussain
    School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore 639798.
  • Ayesha Anees
    Institute of High Performance Computing, A∗STAR, Singapore.
  • Ankit Das
    Institute of High Performance Computing, A*STAR, 138673, Singapore.
  • Binh P Nguyen
    School of Mathematics and Statistics, Victoria University of Wellington, Kelburn Parade, Wellington 6140, New Zealand.
  • Mardiana Marzuki
    Singapore Immunology Network, A*STAR, 138648, Singapore.
  • Shuping Lin
    Skin Research Institute of Singapore, A*STAR, 138648, Singapore.
  • Graham Wright
  • Amit Singhal
    Singapore Immunology Network, A*STAR, 138648, Singapore. Electronic address: Amit_Singhal@immunol.a-star.edu.sg.