Densifying Assumed-Sparse Tensors

Derya Cavdar,Valeriu Codreanu,Can Karakus,Damian Podareanu,Vikram Saletore,Alexander Sergeev,Victor Suthichai,Quy Ta,Srinivas Varadharajan,Lucas A. Wilson,Rengan Xu,Pei Yang

HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2019（2019）

引用 1|浏览0

暂无评分

摘要

Neural machine translation - using neural networks to translate human language - is an area of active research exploring new neuron types and network topologies with the goal of dramatically improving machine translation performance. Current state-of-the-art approaches, such as the multi-head attention-based transformer, require very large translation corpuses and many epochs to produce models of reasonable quality. Recent attempts to parallelize the official TensorFlow “Transformer” model across multiple nodes have hit roadblocks due to excessive memory use and resulting out of memory errors when performing MPI collectives.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要