InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward ModelYuhang Zang,Xiaoyi Dong,Pan Zhang,Yuhang Cao, Ziyu Liu, Shengyuan Ding, Shenxi Wu, Yubo Ma,Haodong Duan,Wenwei Zhang,Kai Chen,Dahua Lin,Jiaqi Wangarxiv(2025)引用 0|浏览4AI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要