Tape measure (TM) proteins are essential for the formation of long-tailed phages. TM protein assembly into tails requires the action of tail assembly chaperones (TACs). TACs (e.g. gpG and gpT of E. coli phage lambda) are usually produced in a short (...
The PSIPRED Workbench is a web server offering a range of predictive methods to the bioscience community for 20 years. Here, we present the work we have completed to update the PSIPRED Protein Analysis Workbench and make it ready for the next 20 year...
Automated function prediction (AFP) of proteins is of great significance in biology. AFP can be regarded as a problem of the large-scale multi-label classification where a protein can be associated with multiple gene ontology terms as its labels. Bas...
The CRISPR-Cas are adaptive bacterial and archaeal immunity systems that have been harnessed for the development of powerful genome editing and engineering tools. In the incessant host-parasite arms race, viruses evolved multiple anti-defense mechani...
Accurate prediction of protein secondary structure (alpha-helix, beta-strand and coil) is a crucial step for protein inter-residue contact prediction and ab initio tertiary structure prediction. In a previous study, we developed a deep belief network...
Identification of thermostable and alkaline xylanases from different fungal and bacterial species have gained an interest for the researchers because of its biotechnological relevance in many industries, such as pulp, paper, and bioethanol. In this s...
Sesquiterpene synthases (STSs) catalyze the formation of a large class of plant volatiles called sesquiterpenes. While thousands of putative STS sequences from diverse plant species are available, only a small number of them have been functionally ch...
Proceedings of the National Academy of Sciences of the United States of America
33876751
In the field of artificial intelligence, a combination of scale in data and model capacity enabled by unsupervised learning has led to major advances in representation learning and statistical generation. In the life sciences, the anticipated growth ...
Machine learning has been increasingly used for protein engineering. However, because the general sequence contexts they capture are not specific to the protein being engineered, the accuracy of existing machine learning algorithms is rather limited....
In computational biology, the Protein Remote homology Detection technique (PRHD) has got undeniable significance. It is mostly important for structure and function identification of a protein sequence. The previous years have seen a challenge that la...