Detailed Information

Cited 32 time in webofscience Cited 35 time in scopus
Metadata Downloads

SARA: A memetic algorithm for high-dimensional biomedical data

Authors
Baliarsingh, SK[Baliarsingh, Santos Kumar]Muhammad, K[Muhammad, Khan]Bakshi, S[Bakshi, Sambit]
Issue Date
Mar-2021
Publisher
ELSEVIER
Keywords
Memetic algorithm (MA); Simulated annealing (SA); Rao algorithm (RA); Support vector machine (SVM); Classification
Citation
APPLIED SOFT COMPUTING, v.101
Indexed
SCIE
SCOPUS
Journal Title
APPLIED SOFT COMPUTING
Volume
101
URI
https://scholarx.skku.edu/handle/2021.sw.skku/98230
DOI
10.1016/j.asoc.2020.107009
ISSN
1568-4946
Abstract
Over the past two decades, large amounts of biomedical and clinical data have been generated. These high dimensional datasets contain thousands of genes. However, such datasets contain many irrelevant genes which influence the predictive accuracy of diagnosis. Therefore, to select the relevant genes from the dataset and to accurately identify the patterns in the genes, it is necessary to employ some gene selection and classification algorithms. In this work, a hybrid algorithm is proposed using simulated annealing (SA) and Rao algorithm (RA) for selecting the optimal gene subset and classifying cancer. SA works as a local search strategy and RA works as a global optimization framework. The reason for combining SA in RA is to improve the exploitation capability of RA. The proposed method consists of two stages. In the first stage, minimum redundancy maximum relevance (mRMR) is employed to select the relevant gene subsets from the microarray dataset. Then, SA is hybridized with RA to improve the quality of solutions after every iteration of RA. Log sigmoidal function is introduced as an encoding scheme to transform the continuous version of Simulated annealing-Rao algorithm (SARA) to a discrete optimization algorithm. The performance of our approach is tested on three binary-class and four multi-class datasets. A comparative study is carried out with eighteen existing techniques. Results from the experiments have shown that our proposed approach selects discriminating genes with high classification accuracy. Particularly, it achieves high classification accuracy on the SRBCT dataset with 99.81% with only five informative genes. (C) 2020 Elsevier B.V. All rights reserved.
Files in This Item
There are no files associated with this item.
Appears in
Collections
Computing and Informatics > Convergence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher MUHAMMAD, KHAN photo

MUHAMMAD, KHAN
Computing and Informatics (Convergence)
Read more

Altmetrics

Total Views & Downloads

BROWSE