ConvLab-3: A Flexible Dialogue System Toolkit Based on a Unified Data Format.

Qi Zhu,Christian Geishauser,Hsien-Chin Lin,Carel van Niekerk,Baolin Peng,Zheng Zhang,Shutong Feng,Michael Heck,Nurul Lubis,Dazhen Wan, Xiaochen Zhu,Jianfeng Gao,Milica Gasic,Minlie Huang

arxiv（2023）

引用 9|浏览32

暂无评分

摘要

Diverse data formats and ontologies of task-oriented dialogue (TOD) datasets hinder us from developing general dialogue models that perform well on many datasets and studying knowledge transfer between datasets. To address this issue, we present ConvLab-3, a flexible dialogue system toolkit based on a unified TOD data format. In ConvLab-3, different datasets are transformed into one unified format and loaded by models in the same way. As a result, the cost of adapting a new model or dataset is significantly reduced. Compared to the previous releases of ConvLab (Lee et al., 2019b; Zhu et al., 2020b), ConvLab-3 allows developing dialogue systems with much more datasets and enhances the utility of the reinforcement learning (RL) toolkit for dialogue policies. To showcase the use of ConvLab-3 and inspire future work, we present a comprehensive study with various settings. We show the benefit of pre-training on other datasets for few-shot fine-tuning and RL, and encourage evaluating policy with diverse user simulators.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要