Spoken language identification by means of acoustic mid-level descriptors

Uwe D. Reichel,Andreas Triantafyllopoulos, Christopher Oates,Stephan Huber,Björn W. Schuller

semanticscholar（2020）

引用 0|浏览0

暂无评分

摘要

We introduce an acoustic mid-level feature (MLD) set derived from openSMILE low-level descriptors for the purpose of language characterisation and identification. The four languages targeted in this study are Georgian, Pashto, Kurmanji Kurdish, and Turkish. Language-dependent differences of these features will be discussed in terms of language typology. Furthermore, language identification by feed forward neural networks is comparatively evaluated for the MLDs and for openSMILE functionals, as well as for varying segment of analysis lengths. The best result 76.3% UAR was achieved for a joint feature set and for a minimum speech chunk length of 8 seconds.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要