Generating new protein sequences by using dense network and attention mechanism.

Journal: Mathematical biosciences and engineering : MBE
Published Date:

Abstract

Protein engineering uses de novo protein design technology to change the protein gene sequence, and then improve the physical and chemical properties of proteins. These newly generated proteins will meet the needs of research better in properties and functions. The Dense-AutoGAN model is based on GAN, which is combined with an Attention mechanism to generate protein sequences. In this GAN architecture, the Attention mechanism and Encoder-decoder can improve the similarity of generated sequences and obtain variations in a smaller range on the original basis. Meanwhile, a new convolutional neural network is constructed by using the Dense. The dense network transmits in multiple layers over the generator network of the GAN architecture, which expands the training space and improves the effectiveness of sequence generation. Finally, the complex protein sequences are generated on the mapping of protein functions. Through comparisons of other models, the generated sequences of Dense-AutoGAN verify the model performance. The new generated proteins are highly accurate and effective in chemical and physical properties.

Authors

  • Feng Wang
    Department of Oncology, Binzhou Medical University Hospital, Binzhou, Shandong, China.
  • Xiaochen Feng
    Information Engineering Department, Changzhou University Huaide College, Taizhou, China.
  • Ren Kong
    Institute of Bioinformatics and Medical Engineering, Jiangsu University of Technology, Changzhou, China.
  • Shan Chang
    Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China.