InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HDXiaoyi Dong,Pan Zhang,Yuhang Zang,Yuhang Cao,Bin Wang,Linke Ouyang,Songyang Zhang,Haodong Duan,Wenwei Zhang,Yining Li,Hang Yan, Yang Gao,Zhe Chen,Xinyue Zhang,Wei Li,Li Jingwen,Wenhai Wang,Kai Chen,Conghui He,Xingcheng Zhang,Jifeng Dai,Yu Qiao,Dahua Lin,Jiaqi WangNeurIPS 2024(2024)引用 68|浏览212关键词Large Vision Language Model (LVLM)AI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要