Hymba: A Hybrid-head Architecture for Small Language Models
Xin Dong,Yonggan Fu,Shizhe Diao,Wonmin Byeon, Zijia Chen, Ameya Sunil Mahabaleshwarkar, Shih-Yang Liu, Matthijs Van Keirsbilck, Min-Hung Chen, Yoshi Suhara,Yingyan Lin,Jan Kautz,Pavlo Molchanov arxiv(2024)
AI 理解论文
溯源树
样例