Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models
Matt Deitke, Christopher Clark, Sangho Lee, Rohun Tripathi, Yue Yang,Jae Sung Park,Mohammadreza Salehi,Niklas Muennighoff,Kyle Lo,Luca Soldaini,Jiasen Lu, Taira Anderson,Erin Bransom,Kiana Ehsani, Huong Ngo, YenSung Chen,Ajay Patel,Mark Yatskar,Chris Callison-Burch,Andrew Head,Rose Hendrix,Favyen Bastani,Eli VanderBilt,Nathan Lambert, Yvonne Chou, Arnavi Chheda, Jenna Sparks,Sam Skjonsberg, Michael Schmitz,Aaron Sarnat, Byron Bischoff,Pete Walsh,Chris Newell,Piper Wolters,Tanmay Gupta,Kuo-Hao Zeng, Jon Borchardt,Dirk Groeneveld, Crystal Nam, Sophie Lebrecht, Caitlin Wittlif,Carissa Schoenick,Oscar Michel,Ranjay Krishna,Luca Weihs,Noah A. Smith,Hannaneh Hajishirzi,Ross Girshick,Ali Farhadi,Aniruddha Kembhavi, CoRR(2024)
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper