基本信息
浏览量:344
职业迁徙
个人简介
Prof. Woodland’s research is in the area on speech and language technology with a major focus on developing all aspects of speech recognition systems.
His group has developed a number of techniques in that have been widely used in large vocabulary systems including standard methods for transform-based adaptation and discriminative sequence training. He has worked on the use of deep neural networks for both acoustic models and language models. His current work has a focus on the use and development of end-to-end trainable neural network systems. One area of interest is developing flexible systems that can adapt to a wide range of speakers, acoustic conditions, speaking style, language, task etc., with relatively limited training resources. This includes work on unsupervised training, active learning and self-supervised learning, the use of speech and text data for adapting models, as well as contextual speech recognition for biasing neural models. He is also interested in areas including speaker diarisation (who spoke when), emotion recognition from speech data, processing highly overlapped data, multimodal data (speech and video), optimisation techniques large for large sequence-to-sequence models models and confidence estimation.
He is well known for his work on the HTK large vocabulary speech recognition systems.
He has also worked on audio indexing, machine translation from speech, keyword spotting, auditory modelling and speech synthesis.
His group has developed a number of techniques in that have been widely used in large vocabulary systems including standard methods for transform-based adaptation and discriminative sequence training. He has worked on the use of deep neural networks for both acoustic models and language models. His current work has a focus on the use and development of end-to-end trainable neural network systems. One area of interest is developing flexible systems that can adapt to a wide range of speakers, acoustic conditions, speaking style, language, task etc., with relatively limited training resources. This includes work on unsupervised training, active learning and self-supervised learning, the use of speech and text data for adapting models, as well as contextual speech recognition for biasing neural models. He is also interested in areas including speaker diarisation (who spoke when), emotion recognition from speech data, processing highly overlapped data, multimodal data (speech and video), optimisation techniques large for large sequence-to-sequence models models and confidence estimation.
He is well known for his work on the HTK large vocabulary speech recognition systems.
He has also worked on audio indexing, machine translation from speech, keyword spotting, auditory modelling and speech synthesis.
研究兴趣
论文共 345 篇作者统计合作学者相似作者
按年份排序按引用量排序主题筛选期刊级别筛选合作者筛选合作机构筛选
时间
引用量
主题
期刊级别
合作者
合作机构
COMPUTER SPEECH AND LANGUAGE (2025)
arxiv(2024)
引用0浏览0引用
0
0
IEEE International Conference on Acoustics, Speech, and Signal Processingpp.11836-11840, (2024)
arXiv (Cornell University) (2024): 2078-2093
Annual Meeting of the Association for Computational Linguisticspp.1139-1157, (2024)
Interspeech 2024pp.717-721, (2024)
arXiv (Cornell University) (2024)
Speaker and Language Recognition Workshoppp.260-265, (2024)
加载更多
作者统计
#Papers: 344
#Citation: 16057
H-Index: 68
G-Index: 112
Sociability: 8
Diversity: 2
Activity: 47
合作学者
合作机构
D-Core
- 合作者
- 学生
- 导师
数据免责声明
页面数据均来自互联网公开来源、合作出版商和通过AI技术自动分析结果,我们不对页面数据的有效性、准确性、正确性、可靠性、完整性和及时性做出任何承诺和保证。若有疑问,可以通过电子邮件方式联系我们:report@aminer.cn