Materials Data toward Machine Learning: Advances and Challenges.

Journal: The journal of physical chemistry letters
Published Date:

Abstract

Machine learning (ML) is believed to have enabled a paradigm shift in materials research, and in practice, ML has demonstrated its power in speeding up the cost-efficient discovery of new materials and autonomizing materials laboratories. In this Perspective, current research progress in materials data which are the backbones of ML are reviewed, focusing on high-throughput data generation, standardized data storage, and data representation. More importantly, the challenging issues in materials data that should be overcome to unlock the full potential of ML in materials research and development, including classic 5V (volume, velocity, variety, veracity, and value) issues, 3M (multicomponent, multiscale, and multistage) challenges, co-mining of experimental and computational data, and materials data toward transferable/explainable ML or causal ML, are discussed.

Authors

  • Linggang Zhu
    School of Materials Science and Engineering, Beihang University, Beijing 100191, China.
  • Jian Zhou
    CTIQ, Canon Medical Research USA, Inc., Vernon Hills, 60061, USA.
  • Zhimei Sun
    School of Materials Science and Engineering, Beihang University, Beijing 100191, China.