DeepTYLCV: An interpretable and experimentally validated AI model for predicting virulence of different tomato yellow leaf curl virus strains
Citations

WEB OF SCIENCE

0
Citations

SCOPUS

0

초록

Tomato yellow leaf curl virus (TYLCV) is among the most devastating pathogens affecting tomato production worldwide, with emerging virulent strains increasingly overcoming genetic resistance and triggering severe outbreaks. Traditional field diagnosis, reliant on visual inspection or image-based AI models, remains constrained by symptom dependence, environmental variability, and poor strain-level interpretability. To address these challenges, we introduce DeepTYLCV, a novel deep learning framework for accurate virulence prediction directly from viral-genome-derived open reading frame sequences. We first constructed a comprehensive dataset of globally sourced TYLCV sequences curated by virulence annotations. DeepTYLCV integrates protein language model-based embeddings with optimal concatenated conventional descriptors using a hybrid architecture composed of a transformer encoder and a multi-scale convolutional neural network, enabling effective extraction of both global and local sequence features. Benchmark analyses demonstrate that DeepTYLCV significantly outperforms our previously developed IML-TYLCV model, which was trained on Korean isolates and lacked global generalizability. Importantly, blind predictions on 15 uncharacterized or representative TYLCV isolates were experimentally validated in tomato plants, achieving 100% concordance between model predictions and observed symptom severity. Furthermore, 1D-Grad-CAM++-based interpretability analyses revealed that the model consistently focused on relevant sequence motifs associated with severe strains, offering mechanistic insights into symptom severity. DeepTYLCV is publicly available at https://balalab-skku.org/DeepTYLCV/ and represents a powerful, interpretable, and globally scalable platform for early TYLCV surveillance, resistance monitoring, and strategic disease management in tomato cultivation.

키워드

conventional descriptorsgenome-based predictionmulti-scale CNNsymptom severitytransformer encoderTYLCVPROTEINGENEGEMINIVIRUSRESISTANCELANGUAGESEQUENCEDISEASESTYLCV
제목
DeepTYLCV: An interpretable and experimentally validated AI model for predicting virulence of different tomato yellow leaf curl virus strains
저자
Bupi, NattanongSangaraju, HariharanTran, Duong ThanhSangaraju, Vinoth KumarIm, HyojinKim, MinkwanLee, SukchanManavalan, Balachandran
DOI
10.1016/j.xplc.2026.101877
발행일
2026-05-11
유형
Article
저널명
Plant Communications
7
5