D2O: Dynamic Discriminative Operations for Efficient Generative Inference of Large Language ModelsZhongwei Wan, Xinjian Wu,Yu Zhang,Yi Xin,Chaofan Tao, Zhihong Zhu, Xin Wang, Siqi Luo,Jing Xiong,Mi Zhangarxiv(2024)引用 0|浏览14暂无评分AI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要