MambaPhase: deep learning for liquid-liquid phase separation protein classification.

Journal: Briefings in bioinformatics
Published Date:

Abstract

Liquid-liquid phase separation plays a critical role in cellular processes, including protein aggregation and RNA metabolism, by forming membraneless subcellular structures. Accurate identification of phase-separated proteins is essential for understanding and controlling these processes. Traditional identification methods are effective but often costly and time-consuming. The recent machine learning methods have reduced these costs, but most models are restricted to classifying scaffold and client proteins with limited experimental conditions. To address this limitation, we developed a Mamba-based encoder using contrastive learning that incorporates separation probability, protein type, and experimental conditions. Our model achieved 95.2% accuracy in predicting phase-separated proteins and an ROCAUC score of 0.87 in classifying scaffold and client proteins. Further validation in the DgHBP-2 drug delivery system demonstrated its potential for condition modulation in drug development. This study provides an effective framework for the accurate identification and control of phase separation, facilitating advancements in biomedical research and therapeutic applications.

Authors

  • Jianwei Huang
    Department of Radiology, The Third Affiliated Hospital of Guangzhou Medical University, Guangzhou, 510000, People's Republic of China.
  • Youli Zhang
    National Institute of Diagnostics and Vaccine Development in Infectious Diseases, School of Public Health, Xiamen University, No. 422 Siming South Rd, 361005, Xiamen, Fujian, China.
  • Shulin Ren
    State Key Laboratory of Vaccines for Infectious Diseases, Xiang An Biomedicine Laboratory, School of Public Health, Xiamen University, No. 422 Siming South Rd, 361005, Xiamen, Fujian, China.
  • Ziyang Wang
  • Xiaocheng Jin
    National Institute of Diagnostics and Vaccine Development in Infectious Diseases, XiaMen University, No. 422, Siming South Road, XiaMen, 361005, Fujian, China; State Key Laboratory of Molecular Vaccinology and Molecular Diagnostics, XiaMen University, No. 422, Siming South Road, XiaMen, 361005, Fujian, China; School of Public Health, XiaMen University, No. 422, Siming South Road, XiaMen, 361005, Fujian, China.
  • Xiaoli Lu
    Information and Networking Center, XiaMen University, No. 422, Siming South Road, XiaMen, 361005, Fujian, China.
  • Yu Zhang
    College of Marine Electrical Engineering, Dalian Maritime University, Dalian, China.
  • Xiaoping Min
  • Shengxiang Ge
    National Institute of Diagnostics and Vaccine Development in Infectious Diseases, School of Life Sciences, Xiamen University, Xiamen 361102, Fujian, China.
  • Jun Zhang
    First School of Clinical Medicine, Shandong University of Traditional Chinese Medicine, Jinan, China.
  • Ningshao Xia