DataPerf: Benchmarks for Data-Centric AI Development.

Mark Mazumder,Colby R. Banbury,Xiaozhe Yao,Bojan Karlas,William Gaviria Rojas, Sudnya Frederick Diamos,Greg Diamos, Lynn He,Douwe Kiela, David Jurado,David Kanter, Rafael Mosquera, Juan Ciro,Lora Aroyo,Bilge Acun,Sabri Eyuboglu,Amirata Ghorbani,Emmett D. Goodman, Tariq Kane,Christine R. Kirkpatrick,Tzu-Sheng Kuo,Jonas Mueller,Tristan Thrush,Joaquin Vanschoren,Margaret Warren,Adina Williams,Serena Yeung,Newsha Ardalani,Praveen K. Paritosh,Ce Zhang,James Zou 0001,Carole-Jean Wu,Cody Coleman,Andrew Y. Ng,Peter Mattson,Vijay Janapa Reddi

CoRR（2022）

引用 0|浏览13

暂无评分

摘要

Machine learning research has long focused on models rather than datasets, and prominent datasets are used for common ML tasks without regard to the breadth, difficulty, and faithfulness of the underlying problems. Neglecting the fundamental importance of data has given rise to inaccuracy, bias, and fragility in real-world applications, and research is hindered by saturation across existing dataset benchmarks. In response, we present DataPerf, a community-led benchmark suite for evaluating ML datasets and data-centric algorithms. We aim to foster innovation in data-centric AI through competition, comparability, and reproducibility. We enable the ML community to iterate on datasets, instead of just architectures, and we provide an open, online platform with multiple rounds of challenges to support this iterative development. The first iteration of DataPerf contains five benchmarks covering a wide spectrum of data-centric techniques, tasks, and modalities in vision, speech, acquisition, debugging, and diffusion prompting, and we support hosting new contributed benchmarks from the community. The benchmarks, online evaluation platform, and baseline implementations are open source, and the MLCommons Association will maintain DataPerf to ensure long-term benefits to academia and industry.

查看译文

关键词

benchmarks,ai,data-centric

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要