Solving Attention Kernel Regression Problem via Pre-conditioner

arXiv (Cornell University)(2023)

引用 1|浏览5
暂无评分
摘要
Large language models have shown impressive performance in many tasks. One of the major features from the computation perspective is computing the attention matrix. Previous works [Zandieh, Han, Daliri, and Karba 2023, Alman and Song 2023] have formally studied the possibility and impossibility of approximating the attention matrix. In this work, we define and study a new problem which is called the attention kernel regression problem. We show how to solve the attention kernel regression in the input sparsity time of the data matrix.
更多
查看译文
关键词
attention kernel regression problem,pre-conditioner
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要