谷歌浏览器插件
订阅小程序
在清言上使用

JenGAN: Stacked Shifted Filters in GAN-Based Speech Synthesis

Interspeech 2024(2024)

引用 0|浏览9
暂无评分
摘要
Non-autoregressive GAN-based neural vocoders are widely used due to theirfast inference speed and high perceptual quality. However, they often sufferfrom audible artifacts such as tonal artifacts in their generated results.Therefore, we propose JenGAN, a new training strategy that involves stackingshifted low-pass filters to ensure the shift-equivariant property. This methodhelps prevent aliasing and reduce artifacts while preserving the modelstructure used during inference. In our experimental evaluation, JenGANconsistently enhances the performance of vocoder models, yielding significantlysuperior scores across the majority of evaluation metrics.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要