Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

A Multi-Modal Assessment Framework for Comparison of Specialized Deep Learning and General-Purpose Large Language Models

Authors
Nadeem, MohammadSohail, Shahab SaquibMadsen, Dag OivindAlzahrani, Ahmed AbrahimSer, Javier DelMuhammad, Khan
Issue Date
2025
Publisher
Institute of Electrical and Electronics Engineers Inc.
Keywords
Assessment framework; Deep learning; Generative artificial intelligence; Large language models
Citation
IEEE Transactions on Big Data
Indexed
SCIE
SCOPUS
Journal Title
IEEE Transactions on Big Data
URI
https://scholarx.skku.edu/handle/2021.sw.skku/120533
DOI
10.1109/TBDATA.2025.3536937
ISSN
2332-7790
Abstract
Recent years have witnessed tremendous advancements in Al tools (e.g., ChatGPT, GPT 4, and Bard), driven by the growing power, reasoning, and efficiency of Large Language Models (LLMs). LLMs have been shown to excel in tasks ranging from poem writing and coding to essay generation and puzzle solving. Despite their proficiency in general queries, specialized tasks such as metaphor understanding and fake news detection often require finely tuned models, posing a comparison challenge with specialized Deep Learning (DL). We propose an assessment framework to compare task-specific intelligence with general-purpose LLMs on suicide and depression tendency identification. For this purpose, we trained two DL models on a suicide and depression detection dataset, followed by testing their performance on a test set. Afterward, the same test dataset is used to evaluate the performance of four LLMs (GPT 3.5, GPT 4, Google Bard, and MS Bing) using four classification metrics. The BERT-based DL model performed the best among all, with a testing accuracy of 94.61%, while GPT 4 was the runner-up with accuracy 92.5%. Results demonstrate that LLMs do not outperform the specialized DL models but are able to achieve comparable performance, making them a decent option for downstream tasks without specialized training. However, LLMs outperformed specialized models on the reduced dataset. © 2015 IEEE.
Files in This Item
There are no files associated with this item.
Appears in
Collections
Computing and Informatics > Convergence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher MUHAMMAD, KHAN photo

MUHAMMAD, KHAN
Computing and Informatics (Convergence)
Read more

Altmetrics

Total Views & Downloads

BROWSE