谷歌浏览器插件
订阅小程序
在清言上使用

Feature Subset Selection Using Heuristic and Metaheuristic Approaches for Diabetes Prediction on a Binary Encoded Dataset

Advances in complex systems/International journal of modeling, simulation and scientific computing(2024)

引用 0|浏览2
暂无评分
摘要
The Machine Learning (ML) models are prone to a curse of dimensionality. The dataset with a greater number of features involves more computational cost and it may lead to low performance in the context of prediction accuracy. Therefore, in this research work we have predicted diabetes with more accuracy by using a smaller number of features. The heuristic methods Sequential Forward Selection (SFS), Sequential Backward Selection (SBS) and metaheuristic evolutionary methods - Whale Optimization Algorithm (WOA) and Genetic Algorithm (GA) are used for performing feature subset selection. The Gini index is also used as a filter evaluator. The performance of the feature subsets is analyzed by applying three different types of ML models, Random Forest (RF), Multi-Layer Perceptron (MLP) and K-Nearest Neighbor (KNN). We have predicted type-2 diabetes with an accuracy of 96.82%. Also, we have reduced the number of features up to 67.44% i.e., identified 32.56% most relevant features.
更多
查看译文
关键词
Whale optimization algorithm,sequential forward selection,sequential backward selection,Gini index,random forest
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要