谷歌浏览器插件
订阅小程序
在清言上使用

The Perfect Blend: Redefining RLHF with Mixture of Judges

Tengyu Xu, Eryk Helenowski,Karthik Abinav Sankararaman, Di Jin, Kaiyan Peng, Eric Han,Shaoliang Nie, Chen Zhu, Hejia Zhang, Wenxuan Zhou, Zhouhao Zeng, Yun He, Karishma Mandyam, Arya Talabzadeh,Madian Khabsa, Gabriel Cohen, Yuandong Tian,Hao Ma,Sinong Wang, Han Fang

arxiv(2024)

引用 0|浏览10
暂无评分
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要