iEnhancer-DS: Attention-based improved densenet for identifying enhancers and their strength.
Journal:
Computational biology and chemistry
Published Date:
May 5, 2025
Abstract
Enhancers are short DNA fragments that enhance gene expression by binding to transcription factors. Accurately identifying enhancers and their strength is crucial for understanding gene regulation mechanisms. However, traditional enhancer sequencing techniques are costly and time-consuming. Therefore, it is necessary to develop computational methods to quickly and accurately identify enhancers and their strength. Given the limitations of existing computational methods, such as low performance and complex encoding, this study proposes a deep learning-based multi-task framework, iEnhancer-DS, for enhancer identification and their strength classification. First, feature embeddings characterizing DNA sequences are obtained using one-hot encoding and nucleotide chemical properties (NCP). Next, an improved DenseNet module is applied to learn implicit high-order features from the concatenated feature embeddings. Subsequently, the self-attention mechanism is used to dynamically assess the importance of features and assign weights to them, and then the features are passed to the multilayer perceptron (MLP) to calculate the prediction probabilities. Experimental results show that iEnhancer-DS achieves state-of-the-art performance in both enhancer identification and strength prediction. In the enhancer identification task, iEnhancer-DS improves ACC and MCC by 4.03% and 8.47% respectively over the current state-of-the-art methods. Similarly, in the enhancer strength prediction task, the ACC and MCC values of iEnhancer-DS increased by 1.40% and 3.81%, respectively. In addition, we used the t-SNE method to perform an interpretable analysis of the mechanism of action of iEnhancer-DS. The detailed code and raw data of iEnhancer-DS can be obtained from https://github.com/zha12ja/iEnhancer-DS.