GKNnet: an relational graph convolutional network-based method with knowledge-augmented activation layer for microbial structural variation detection.

Journal: Briefings in bioinformatics
PMID:

Abstract

Structural variants (SVs) in microbial genomes play a critical role in phenotypic changes, environmental adaptation, and species evolution, with deletion variations particularly closely linked to phenotypic traits. Therefore, accurate and comprehensive identification of deletion variations is essential. Although long-read sequencing technology can detect more SVs, its high error rate introduces substantial noise, leading to high false-positive and low recall rates in existing SV detection algorithms. This paper presents an SV detection method based on graph convolutional networks (GCNs). The model first represents node features through a heterogeneous graph, leveraging the GCN to precisely identify variant regions. Additionally, a knowledge-augmented activation layer (KANLayer) with a learnable activation function is introduced to reduce noise around variant regions, thereby improving model precision and reducing false positives. A clustering algorithm then aggregates multiple overlapping regions near the variant center into a single accurate SV interval, further enhancing recall. Validation on both simulated and real datasets demonstrates that our method achieves superior F1 scores compared to benchmark methods (cuteSV, Sniffles, Svim, and Pbsv), highlighting its advantage and robustness in SV detection and offering an innovative solution for microbial genome structural variation research.

Authors

  • Fengyi Guo
    School of Artificial Intelligence and Computer Science, Jiangnan University, 1800 Lihu Avenue, Binhu District, Wuxi, Jiangsu 214122, China.
  • Yuanbo Li
    State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing 100193, PR China. Electronic address: liyuanbo@caas.cn.
  • Hongyuan Zhao
    National Engineering Research Center for Cereal Fermentation and Food Biomanufacturing, State Key Laboratory of Food Science and Technology, School of Food Science and technology, Jiangnan University, Wuxi, China.
  • Xiaogang Liu
    Sichuan Academy of Medical Sciences & Sichuan Provincial People's Hospital, Chengdu, China. gary.samsph@gmail.com.
  • Jian Mao
    State Key Laboratory of Heavy Oil Processing and College of Chemistry and Chemical Engineering, China University of Petroleum (East China), Qingdao 266580, China.
  • Dongna Ma
    National Engineering Research Center of Cereal Fermentation and Food Biomanufacturing, State Key Laboratory of Food Science and Technology, School of Food Science and Technology, Jiangnan University, Wuxi, Jiangsu 214122, China.
  • Shuangping Liu
    Center for Bio-inspired Energy Science, Northwestern University, 2145 Sheridan Road, Evanston, IL 60208, USA.