Microbes have a profound impact on human health. Identifying disease-associated microbes would provide helpful guidance for drug development and disease treatment. Through an enormous experimental effort, limited disease-associated microbes have been...
Circulating cell-free DNA (cfDNA) is a powerful diagnostics tool that is widely studied in the context of liquid biopsy in oncology and other fields. In obstetrics, maternal plasma cfDNA have already proven its utility, enabling noninvasive prenatal ...
The assay for transposase-accessible chromatin with sequencing (ATAC-seq) identifies chromatin accessibility across the genome, crucial for gene expression regulating. However, bulk ATAC-seq obscures cellular heterogeneity, while single-cell ATAC-seq...
Blood-brain barrier peptides (BBBP) could significantly improve the delivery of drugs to the brain, paving the way for new treatments for central nervous system (CNS) disorders. The primary challenge in treating CNS disorders lies in the difficulty p...
Deep machine learning demonstrates a capacity to uncover evolutionary relationships directly from protein sequences, in effect internalising notions inherent to classical phylogenetic tree inference. We connect these two paradigms by assessing the ca...
Understanding the genetic basis of phenotypic variation is fundamental to biology. Here we introduce GAP, a novel machine learning framework for predicting binary phenotypes from gaps in multi-species sequence alignments. GAP employs a neural network...
Modelling biological systems depends on the availability of data and components of the system at hand. As our understanding of these systems evolves, the ability to gradually refine models by adding new components of different formalisms covering sto...
Existing algorithms for assessing tumor purity are limited to a single omics data, such as gene expression, somatic copy number variations, somatic mutations, and DNA methylation. Here we proposed the machine learning Multi-omics Tumor Purity predict...
The interactions between long noncoding RNA (lncRNA) and microRNA (miRNA) play critical roles in life processes, highlighting the necessity to enhance the performance of state-of-the-art models. Here, we introduced TEC-LncMir, a novel approach for pr...
Deciphering the cellular abundance in spatial transcriptomics (ST) is crucial for revealing the spatial architecture of cellular heterogeneity within tissues. However, some of the current spatial sequencing technologies are in low resolutions, leadin...