XMolCap: Advancing Molecular Captioning through Multimodal Fusion and Explainable Graph Neural Networks
  • Tran, Duong Thanh
  • Nguyen, Nguyen Doan Hieu
  • Pham, Nhat Truong
  • Rakkiyappan, Rajan
  • Karki, Rajendra
  • ... Manavalan, Balachandran
Citations

WEB OF SCIENCE

0
Citations

SCOPUS

2

초록

Large language models (LLMs) have significantly advanced computational biology by enabling the integration of molecular, protein, and natural language data to accelerate drug discovery. However, existing molecular captioning approaches often underutilize diverse molecular modalities and lack interpretability. In this study, we introduce XMolCap, a novel explainable molecular captioning framework that integrates molecular images, SMILES strings, and graph-based structures through a stacked multimodal fusion mechanism. The framework is built upon a BioT5-based encoder-decoder architecture, which serves as the backbone for extracting feature representations from SELFIES. By leveraging specialized models such as SwinOCSR, SciBERT, and GIN-MoMu, XMolCap effectively captures complementary information from each modality. Our model not only achieves state-of-the-art performance on two benchmark datasets (L+M-24 and ChEBI-20), outperforming several strong baselines, but also provides detailed, functional group-aware, and property-specific explanations through graph-based interpretation. We believe it holds strong potential for clinical and pharmaceutical applications by generating accurate, interpretable molecular descriptions that deepen our understanding of molecular properties and interactions. © 2013 IEEE.

키워드

Explainable artificial intelligencegraph neural networkslanguage and moleculeslarge language modelsmodel interpretationmolecular captioningmultimodal fusion
제목
XMolCap: Advancing Molecular Captioning through Multimodal Fusion and Explainable Graph Neural Networks
저자
Tran, Duong ThanhNguyen, Nguyen Doan HieuPham, Nhat TruongRakkiyappan, RajanKarki, RajendraManavalan, Balachandran
DOI
10.1109/JBHI.2025.3572910
발행일
2025-10
유형
Article
저널명
IEEE Journal of Biomedical and Health Informatics
29
10
페이지
1 ~ 12