Early feature extraction drives model performance in high-resolution chromatin accessibility prediction.

Journal: Genome research

Published Date: Feb 17, 2026

Abstract

Fine-grained prediction of chromatin accessibility from DNA sequence is a foundational step in modeling gene expression changes resulting from sequence variants. Yet, few methods operate at the resolution necessary to capture subtle effects of single-nucleotide changes. Furthermore, it remains unclear which architectural components, such as residual connections, normalization strategies, or attention mechanisms, drive performance in these high-resolution predictions. To address these knowledge gaps, we systematically evaluate classic architectural choices and introduce ConvNeXt V2 blocks, originally developed for computer vision, as high-resolution feature extractors in deep learning models for genomic data. Integrated into diverse architectures such as convoluted neural networks (CNNs), long short-term memory (LSTM), dilated CNNs, and transformers, ConvNeXt V2 blocks consistently improve performance, leading to similar prediction accuracy across these different model types. This reveals that early feature extraction, rather than downstream architecture, is the primary determinant of prediction accuracy. A comprehensive evaluation of these models on ATAC-seq signal prediction at 4-bp resolution in a cell type-specific manner identifies the ConvNeXt-based dilated CNN as the most robust performer, better preserving the signal's shape. Our codebase and benchmarks provide practical tools for high-resolution chromatin modeling.

Authors

Aayush Grover

ETH Zurich, Swiss Institute of Bioinformatics.
Till Muser

Swiss Data Science Center.
Liine Kasak

ETH Zurich.
Lin Zhang

Laboratory of Molecular Translational Medicine, Centre for Translational Medicine, Key Laboratory of Birth Defects and Related Diseases of Women and Children, Ministry of Education, Clinical Research Center for Birth Defects of Sichuan Province, West China Second Hospital, Sichuan University, Chengdu, Sichuan, 610041, China. Electronic address: [email protected].
Ekaterina Krymova

Swiss Data Science Center.
Valentina Boeva

Department of Computer Science, ETH Zurich, Zurich, 8092, Switzerland.

Keywords

No keywords available for this article.

External Resources

View on PubMed Access via DOI PubMed (41526189)

Early feature extraction drives model performance in high-resolution chromatin accessibility prediction.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

Early feature extraction drives model performance in high-resolution chromatin accessibility prediction.

Abstract

Authors

Keywords

External Resources

Stay Ahead of Medical AI

Popular Topics

Recent Journals