Retention time prediction in hydrophilic interaction liquid chromatography with graph neural network and transfer learning.

Journal: Journal of chromatography. A
Published Date:

Abstract

The combination of retention time (RT), accurate mass and tandem mass spectra can improve the structural annotation in untargeted metabolomics. However, the incorporation of RT for metabolite identification has received less attention because of the limitation of available RT data, especially for hydrophilic interaction liquid chromatography (HILIC). Here, the Graph Neural Network-based Transfer Learning (GNN-TL) is proposed to train a model for HILIC RTs prediction. The graph neural network was pre-trained using an in silico HILIC RT dataset (pseudo-labeling dataset) with ∼306 K molecules. Then, the weights of dense layers in the pre-trained GNN (pre-GNN) model were fine-tuned by transfer learning using a small number of experimental HILIC RTs from the target chromatographic system. The GNN-TL outperformed the methods in Retip, including the Random Forest (RF), Bayesian-regularized neural network (BRNN), XGBoost, light gradient-boosting machine (LightGBM), and Keras. It achieved the lowest mean absolute error (MAE) of 38.6 s on the test set and 33.4 s on an additional test set. It has the best ability to generalize with a small performance difference between training, test, and additional test sets. Furthermore, the predicted RTs can filter out nearly 60% false positive candidates on average, which is valuable for the identification of compounds complementary to mass spectrometry.

Authors

  • Qiong Yang
    Institute of Modern Physics, Chinese Academy of Science, Lanzhou 730000, China.
  • Hongchao Ji
    College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, PR China.
  • Xiaqiong Fan
    College of Chemistry and Chemical Engineering, Central South University, Changsha, 410083, P. R. China.
  • Zhimin Zhang
    School of Control Science and Engineering, Shandong University, Jinan, People's Republic of China. School of Information Technology and Electrical Engineering, University of Queensland, Queensland, Australia.
  • Hongmei Lu
    College of Chemistry and Chemical Engineering, Central South University, Changsha, People's Republic of China.