Cross-lingual speaker transfer for Cambodian based on feature disentangler and time-frequency attention adaptive normalization

Yuanzhang Yang, Linqin Wang,Shengxiang Gao,Zhengtao Yu, Ling Dong

INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS(2024)

引用 0|浏览5
暂无评分
摘要
Purpose - This paper aims to disentangle Chinese-English-rich resources linguistic and speaker timbre features, achieving cross-lingual speaker transfer for Cambodian.Design/methodology/approach - This study introduces a novel approach: the construction of a cross-lingual feature disentangler coupled with the integration of time-frequency attention adaptive normalization to proficiently convert Cambodian speaker timbre into Chinese-English without altering the underlying Cambodian speech content.Findings - Considering the limited availability of multi-speaker corpora in Cambodia, conventional methods have demonstrated subpar performance in Cambodian speaker voice transfer.Originality/value - The originality of this study lies in the effectiveness of the disentanglement process and precise control over speaker timbre feature transfer.
更多
查看译文
关键词
Cambodian,Speaker transfer,Cross-lingual non-parallel resource,Feature disentangler
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要