谷歌浏览器插件
订阅小程序
在清言上使用

Sintaktikoki etiketatutako euskarazko corpus historikoa eraikitzen

Fontes Linguae Vasconum 50 urte. Ekarpen berriak euskararen ikerketari / Nuevas aportaciones al estudio de la lengua(2020)

引用 0|浏览4
暂无评分
摘要
In this paper we present an ongoing project to build a morphosyntactically annotated historical corpus of Basque. The corpus will have around one million words, encompass-ing the most significant written production of Basque between the 15th and 18th cen-turies. Morphosyntactic tagging will allow for systematic searches at different levels of complexity: lemma, form, part of speech, morphosyntactic feature, and also a number of syntactic constructions. In addition, a set of metadata will enable searches based on socio-historical criteria too. Beyond being the first annotated historical corpus of Basque, through this project tools for language processing will be improved by analysing Basque historical varieties more or less distant from present-day standard Basque. Moreover, this project aims to estab-lish a model for further works in historical corpora of Basque.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要