Audio to Deep Visual: Speaking Mouth Generation Based on 3D Sparse Landmarks

VRW(2023)

引用 0|浏览1
暂无评分
摘要
Having a system to automatically generate a talking mouth in sync with input speech would enhance speech communication and enable many novel applications. This article presents a new model that can generate 3D talking mouth landmarks from Chinese speech. We use sparse 3D landmarks to model the mouth motion, which are easy to capture and provide sufficient lip accuracy. The 4D mouth motion dataset was collected by our self-developed facial capture device, filling the gap in the Chinese speech driven lip dataset. The experimental results show that the generated talking landmarks achieve accurate. smooth, and natural 3D mouth movements.
更多
查看译文
关键词
Computing methodologie-Artificial intelligence-Natural language processing,Computing methodologies-Computer graphics-Applications
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要