A Comparison Of Machine Learning Methods For The Prediction Of Breast Cancer

EvoBIO'11: Proceedings of the 9th European conference on Evolutionary computation, machine learning and data mining in bioinformatics(2011)

引用 7|浏览7
暂无评分
摘要
In this work we perform a comparison of machine learning methods in an association study with the goal of finding reliable classifiers that predict the presence or absence of breast cancer based on single nucleotide polymorphisms from the BRCA1, BRCA2 and TP53 genes. We emphasize how misleading some common statistical measures can be when evaluating classifiers whose learning was biased by an unbalanced dataset, as in our case. Then we compare and discuss the format of different solutions from the interpretability point of view, revealing a correlation between size and performance of the solutions, and also identify a small set of preferred features that agree with previously published work. We designate CART regression trees as the best classifiers, both in terms of performance and interpretability, and discuss how to improve the results reported here.
更多
查看译文
关键词
Breast Cancer, Support Vector Machine, Random Forest, Genetic Programming, TP53 Gene
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要