Genetic algorithm based wrapper feature selection on hybrid prediction model for analysis of high dimensional data
Industrial and Information Systems(2014)
摘要
Data mining concepts have been extensively used for disease prediction in the medical field. Many Hybrid Prediction Models (HPM) have been proposed and implemented in this area, however, there is always a need for increasing accuracy and efficiency. The existing methods take into account all the features to build the classifier model thus reducing the accuracy and increasing the overall processing time. This paper proposes a Genetic Algorithm based Wrapper feature selection Hybrid Prediction Model (GWHPM). This model initially uses k-means clustering technique to remove the outliers from the dataset. Further, an optimal set of features are obtained by using Genetic Algorithm based Wrapper feature selection. Finally, it is used to build the classifier models such as Decision Tree, Naive Bayes, k nearest neighbor and Support Vector Machine. A comparative study of GWHPM is carried out and it is observed that the proposed model performed better than the existing methods.
更多查看译文
关键词
data analysis,data mining,diseases,feature selection,genetic algorithms,medical computing,pattern classification,pattern clustering,gwhpm,hpm,classifier model,data mining concept,disease prediction,genetic algorithm based wrapper feature selection,high dimensional data analysis,hybrid prediction model,k-means clustering technique,medical field,classifier,clustering,feature selection (fs),genetic algorithm based wrapper feature selection hybrid prediction model (gwhpm),hybrid prediction model (hpm),diabetes,support vector machines,predictive models,accuracy
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要