Building Chinese Interlanguage Corpus:The Case of Character Error-tagged Chinese Interlanguage Corpus of Sun Yat-Sen University
Applied Linguistics(2012)
摘要
The paper reports the preliminary findings of character error-coded Chinese Interlanguage Corpus of Sun Yat-Sen University.The corpus is used as an illustration on some theoretical issues in interlanguage corpus building.The first one is the authenticity and continuity of the corpus.The second one is the principled tagging,especially the tagging for the characters errors.The wrong characters are created by Truetype Character Editor in Windows,and stored and displayed as images.The characters can be edited.The third issue is that the retrieval tool should be multifunctional and user-friendly to guarantee the efficient use of corpus data.The last issue is the development of the sub-system of corpora.
更多查看译文
关键词
Corpus Linguistics,Part-of-Speech Tagging
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要