Recognizing cross-lingual textual entailment with co-training using similarity and difference views

IJCNN(2014)

引用 6|浏览63
暂无评分
摘要
Cross-lingual textual entailment is a relatively new problem that detects the entailment relationship between two text fragments written in different languages. Previous work adopted machine learning algorithms and similarity measures as features to address this task. In order to overcome the high cost of human annotation and further improve the recognition performance, we present a novel co-training approach to solve this problem. We first use an off-the-shelf machine translation tool to eliminate the language gap between two texts. Then we measure the similarities and differences between two texts and regard them as sufficient and redundant views. We use those two views to conduct the co-training procedure to perform classification. Besides, a new effective Kullback-Leibler (KL) based criterion is proposed to select the results from all possible iterations. Experiments on cross-lingual datasets provided by SemEval 2013 show that our method significantly outperforms the baseline systems and previous work.
更多
查看译文
关键词
human annotation,learning (artificial intelligence),cross-lingual textual entailment recognition,kullback-leibler based criterion,language translation,off-the-shelf machine translation tool,machine learning algorithm,cotraining approach,text analysis,prediction algorithms,learning artificial intelligence,support vector machines,accuracy,feature extraction,semantics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要