Loading...
A bi-objective hybrid optimization algorithm to reduce noise and data dimension in diabetes diagnosis using support vector machines
Alirezaei, M ; Sharif University of Technology | 2019
768
Viewed
- Type of Document: Article
- DOI: 10.1016/j.eswa.2019.02.037
- Publisher: Elsevier Ltd , 2019
- Abstract:
- Diabetes mellitus is a medical condition examined by data miners for reasons such as significant health complications in affected people, the economic impact on healthcare networks, and so on. In order to find the main causes of this disease, researchers look into the patient's lifestyle, hereditary information, etc. The goal of data mining in this context is to find patterns that make early detection of the disease and proper treatment easier. Due to the high volume of data involved in therapeutic contexts and disease diagnosis, provision of the intended treatment method become almost impossible over a short period of time. This justifies the use of pre-processing techniques and data reduction methods in such contexts. In this regard, clustering and meta-heuristic algorithms maintain important roles. In this paper, a method based on the k-means clustering algorithm is first utilized to detect and delete outliers. Then, in order to select significant and effective features, four bi-objective meta-heuristic algorithms are employed to choose the least number of significant features with the highest classification accuracy using support vector machines (SVM). In addition, the 10-fold cross validation (CV) method is used to validate the constructed model. Using real case data, it is concluded that the multi-objective firefly (MOFA) and multi-objective imperialist competitive algorithm (MOICA) with a 100% classification accuracy outperform the non-dominated sorting genetic algorithm (NSGA-II) and multi-objective particle swarm optimization (MOPSO) with the accuracies of 98.2% and 94.6%, respectively. © 2019 Elsevier Ltd
- Keywords:
- Diabetes diagnosis ; Feature selection ; K-means algorithms ; Meta-heuristic algorithms ; Support vector machine ; Diagnosis ; Diseases ; Feature extraction ; Genetic algorithms ; Heuristic algorithms ; K-means clustering ; Multiobjective optimization ; Particle swarm optimization (PSO) ; Patient treatment ; Screening ; Support vector machines ; 10-fold cross-validation ; Classification accuracy ; Hybrid optimization algorithm ; Imperialist competitive algorithms ; Meta heuristic algorithm ; Multi objective particle swarm optimization ; Non dominated sorting genetic algorithm (NSGA II) ; Data mining
- Source: Expert Systems with Applications ; Volume 127 , 2019 , Pages 47-57 ; 09574174 (ISSN)
- URL: https://www.sciencedirect.com/science/article/abs/pii/S0957417419301514