Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks

Tristan Thrush,Kushal Tirumala,Anmol Gupta,Max Bartolo,Pedro Rodriguez,Tariq Kane,William Gaviria Rojas,Peter Mattson,Adina Williams,Douwe Kiela

PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): PROCEEDINGS OF SYSTEM DEMONSTRATIONS（2022）

引用 10|浏览128

暂无评分

摘要

We introduce Dynatask: an open source system for setting up custom NLP tasks that aims to greatly lower the technical knowledge and effort required for hosting and evaluating state-of-the-art NLP models, as well as for conducting model in the loop data collection with crowdworkers. Dynatask is integrated with Dynabench, a research platform for rethinking benchmarking in AI that facilitates human and model in the loop data collection and evaluation. To create a task, users only need to write a short task configuration file from which the relevant web interfaces and model hosting infrastructure are automatically generated. The system is available at https://dynabench.org/ and the full library can be found at https://github.com/facebookresearch/dynabench.

查看译文

关键词

dynamic ai benchmark tasks

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要