Audio to Deep Visual: Speaking Mouth Generation Based on 3D Sparse Landmarks

Hui Fang,Dongdong Weng, Zeyu Tian,Zhen Song

VRW（2023）

引用 0|浏览1

暂无评分

摘要

Having a system to automatically generate a talking mouth in sync with input speech would enhance speech communication and enable many novel applications. This article presents a new model that can generate 3D talking mouth landmarks from Chinese speech. We use sparse 3D landmarks to model the mouth motion, which are easy to capture and provide sufficient lip accuracy. The 4D mouth motion dataset was collected by our self-developed facial capture device, filling the gap in the Chinese speech driven lip dataset. The experimental results show that the generated talking landmarks achieve accurate. smooth, and natural 3D mouth movements.

查看译文

关键词

Computing methodologie-Artificial intelligence-Natural language processing,Computing methodologies-Computer graphics-Applications

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要