AttnW2V-Enhancer: Leveraging attention and Word2Vec for enhanced enhancer prediction
  • Rehman, Mobeen Ur
  • Abbas, Zeeshan
  • Ullah, Farman
  • Hussain, Irfan
Citations

WEB OF SCIENCE

1
Citations

SCOPUS

1

초록

Accurate identification of enhancer regions in DNA sequences is essential for understanding gene regulation and its role in diverse biological processes. Enhancers are regulatory elements that influence gene expression, but their detection remains challenging due to the complexity and variability of genomic sequences. In this study, we propose AttnW2V-Enhancer, a novel model that combines Word2Vec-based sequence encoding, convolutional neural networks (CNN), and attention mechanisms to address this challenge. By leveraging Word2Vec embeddings, our model captures biologically meaningful patterns and offers a more efficient and interpretable representation than traditional methods such as one-hot encoding and physicochemical descriptors. We evaluate AttnW2V-Enhancer on an independent test set, where it achieves superior performance with an accuracy of 81.75%, sensitivity of 83.50%, specificity of 80.00%, and a Matthews Correlation Coefficient (MCC) of 0.635, outperforming existing models. Additionally, we demonstrate the effectiveness of the attention mechanism in enhancing feature learning by dynamically focusing on the most relevant sequence regions. These results confirm that integrating Word2Vec encoding with CNNs and attention mechanisms provides a powerful and interpretable framework for enhancer prediction, offering valuable insights into the identification of regulatory sequences. The source code and implementation are publicly available at: https://github.com/Rehman1995/AttnW2V-Enhancer. © 2025

키워드

Artificial intelligenceBioinformaticsComputational biologyDNA sequence analysisEnhancer predictionGenomic feature extractionNon-coding DNAWord2VecTRANSCRIPTIONAL REGULATORY ELEMENTSIDENTIFYING ENHANCERSCHROMATINSTRENGTH
제목
AttnW2V-Enhancer: Leveraging attention and Word2Vec for enhanced enhancer prediction
저자
Rehman, Mobeen UrAbbas, ZeeshanUllah, FarmanHussain, Irfan
DOI
10.1016/j.csbj.2025.07.008
발행일
2025-01
유형
Article
저널명
Computational and Structural Biotechnology Journal
27
페이지
3275 ~ 3284