Exploring Hyperparameter Usage and Tuning in Machine Learning Research

Sebastian Simon,Nikolay Kolyada,Christopher Akiki,Martin Potthast,Benno Stein,Norbert Siegmund

2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN)（2023）

引用 0|浏览54

暂无评分

摘要

The success of machine learning (ML) models depends on careful experimentation and optimization of their hyperparameters. Tuning can affect the reliability and accuracy of a trained model and is the subject of ongoing research. However, little is known on whether and how hyperparameters are used and optimized in research practice. This lack of knowledge not only limits the adoption of best practices for tuning in research, but also affects the reproducibility of published results. Our research systematically analyzes the use and tuning of hyperparameters in ML publications. For this, we analyze 2000 code repositories and their associated research papers from Papers with Code. We compare the use and tuning of hyperparameters of three widely used ML libraries: scikit-learn, TensorFlow, and PyTorch. Our results show that the most of the available hyperparameters remain untouched, and those that have been changed use constant values. In particular, there is a significant difference between tuning hyperparameters and the reporting about it in the corresponding research papers. Our results suggest that there is a need for improved research and reporting practices when using ML methods to improve the reproducibility of published results.

查看译文

关键词

Hyperparameter,Hyperparameter Tuning,Configuration Settings

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要