Neural network training method for materials science based on multi-source databases.

Journal: Scientific reports
PMID:

Abstract

The fourth paradigm of science has achieved great success in material discovery and it highlights the sharing and interoperability of data. However, most material data are scattered among various research institutions, and a big data transmission will consume significant bandwidth and tremendous time. At the meanwhile, some data owners prefer to protect the data and keep their initiative in the cooperation. This dilemma gradually leads to the "data island" problem, especially in material science. To attack the problem and make full use of the material data, we propose a new strategy of neural network training based on multi-source databases. In the whole training process, only model parameters are exchanged and no any external access or connection to the local databases. We demonstrate its validity by training a model characterizing material structure and its corresponding formation energy, based on two and four local databases, respectively. The results show that the obtained model accuracy trained by this method is almost the same to that obtained from a single database combining all the local ones. Moreover, different communication frequencies between the client and server are also studied to improve the model training efficiency, and an optimal frequency is recommended.

Authors

  • Jialong Guo
    Computer Network Information Center, Chinese Academy of Sciences, Beijing, 100083, China.
  • Ziyi Chen
    Department of Neurology, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, Guangdong Province, China.
  • Zhiwei Liu
    Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education, Huazhong Agricultural University, Wuhan, Hubei Province, 430070 China.
  • Xianwei Li
    China Petroleum Pipeline Engineering Co., Ltd., International, Langfang, 065000, Hebei, China.
  • Zhiyuan Xie
    Department of Physics, Renmin University of China, Beijing, 100872, China.
  • Zongguo Wang
    Computer Network Information Center, Chinese Academy of Sciences, Beijing, 100083, China. wangzg@cnic.cn.
  • Yangang Wang
    Computer Network Information Center, Chinese Academy of Sciences, Beijing, 100083, China.