Scientific discovery in the age of artificial intelligence.

Journal: Nature
Published Date:

Abstract

Artificial intelligence (AI) is being increasingly integrated into scientific discovery to augment and accelerate research, helping scientists to generate hypotheses, design experiments, collect and interpret large datasets, and gain insights that might not have been possible using traditional scientific methods alone. Here we examine breakthroughs over the past decade that include self-supervised learning, which allows models to be trained on vast amounts of unlabelled data, and geometric deep learning, which leverages knowledge about the structure of scientific data to enhance model accuracy and efficiency. Generative AI methods can create designs, such as small-molecule drugs and proteins, by analysing diverse data modalities, including images and sequences. We discuss how these methods can help scientists throughout the scientific process and the central issues that remain despite such advances. Both developers and users of AI toolsneed a better understanding of when such approaches need improvement, and challenges posed by poor data quality and stewardship remain. These issues cut across scientific disciplines and require developing foundational algorithmic approaches that can contribute to scientific understanding or acquire it autonomously, making them critical areas of focus for AI innovation.

Authors

  • Hanchen Wang
    Key Laboratory of Novel Functional Textile Fibers and Materials, Minjiang University, Fuzhou 350108, China; College of Material Engineering, Fujian Agriculture and Forestry University, Fuzhou 350002, China.
  • Tianfan Fu
    Department of Computer Science, Nanjing University, Nanjing, Jiangsu, China.
  • Yuanqi Du
    Department of Computer Science, George Mason University, Fairfax, VA 22030, USA.
  • Wenhao Gao
    Department of Chemical Engineering, MIT, Cambridge, Massachusetts 02139, United States.
  • Kexin Huang
    School of Life Science and Technology, Xidian University, Xi'an, Shaanxi 710071, PR China.
  • Ziming Liu
    Mechanical, Aerospace and Biomedical Engineering, University of Tennessee, Knoxville, USA.
  • Payal Chandak
    Harvard-MIT Program in Health Sciences and Technology, Cambridge, MA, USA.
  • Shengchao Liu
    Department of Computer Sciences , University of Wisconsin-Madison , Madison , Wisconsin 53706 , United States.
  • Peter Van Katwyk
    Department of Earth, Environmental and Planetary Sciences, Brown University, Providence, RI, USA.
  • Andreea Deac
    Mila - Quebec AI Institute, Montreal, Quebec, Canada.
  • Anima Anandkumar
    Department of Computing and Mathematical Science, California Institute of Technology, Pasadena, California; NVIDIA Corporation, Santa Clara, California.
  • Karianne Bergen
    Department of Earth, Environmental and Planetary Sciences, Brown University, Providence, RI, USA.
  • Carla P Gomes
    Department of Computer Science, Cornell University , Ithaca, New York 14850, United States.
  • Shirley Ho
    Center for Computational Astrophysics, Flatiron Institute, New York, NY, USA.
  • Pushmeet Kohli
    DeepMind, London, UK.
  • Joan Lasenby
    Department of Engineering, University of Cambridge, Cambridge, UK.
  • Jure Leskovec
    Department of Computer Science, Stanford University.
  • Tie-Yan Liu
    Microsoft Research Asia, Beijing 100080, China.
  • Arjun Manrai
    Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
  • Debora Marks
    Department of Systems Biology, Harvard Medical School, Boston, MA, USA.
  • Bharath Ramsundar
    Department of Computer Science , Stanford University , Stanford , CA 94305 , USA.
  • Le Song
    Department of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, Georgia, USA.
  • Jimeng Sun
    College of Computing Georgia Institute of Technology Atlanta, GA, USA.
  • Jian Tang
    Department of Decision Sciences HEC, Université de Montréal, Montreal, Québec, Canada.
  • Petar Veličković
    Computer Laboratory, University of Cambridge, Cambridge, Cambs, England, United Kingdom.
  • Max Welling
    Informatics Institute at the University of Amsterdam, Amsterdam 1098 XH, the Netherlands; AMLab, Amsterdam, 1098 XH, the Netherlands.
  • Linfeng Zhang
    Department of Mathematics and Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ 08544, USA.
  • Connor W Coley
    Department of Chemical Engineering, Massachusetts Institute of Technology 77 Massachusetts Avenue Cambridge MA 02139 USA whgreen@mit.edu kfjensen@mit.edu.
  • Yoshua Bengio
    Université de Montréal, Montréal QC H3T 1N8, Canada.
  • Marinka Zitnik
    Department of Computer Science, Stanford University.