Machine Learning Model Attribution Challenge

Elizabeth Merkhofer, Deepesh Chaudhari,Hyrum S. Anderson,Keith Manville, Lily Wong, João Gante

arxiv（2023）

引用 2|浏览26

暂无评分

摘要

We present the findings of the Machine Learning Model Attribution Challenge. Fine-tuned machine learning models may derive from other trained models without obvious attribution characteristics. In this challenge, participants identify the publicly-available base models that underlie a set of anonymous, fine-tuned large language models (LLMs) using only textual output of the models. Contestants aim to correctly attribute the most fine-tuned models, with ties broken in the favor of contestants whose solutions use fewer calls to the fine-tuned models' API. The most successful approaches were manual, as participants observed similarities between model outputs and developed attribution heuristics based on public documentation of the base models, though several teams also submitted automated, statistical solutions.

查看译文

关键词

attribution,machine learning,challenge,model

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要