Benchmarking Nlp Toolkits For Enterprise Application

PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III(2019)

引用 3|浏览16
暂无评分
摘要
Natural Language Processing (NLP) is an important technology that motivates the form of AI applications today. Many NLP libraries are available for researchers and developers to perform standard NLP tasks (such as segmentation, tokenization, lemmatization, POS tagging, and NER) without the need to develop from scratch. However, there are some challenges in selecting the most suitable library such as data type, performance, and the compatibility. In this paper, we assessed five popular NLP libraries for performing the standard processing tasks on datasets crawled from different online news sources in Malaysia. The obtained results are analysed and differences of those libraries are listed. The goal of this study is to provide a clear view for users to select the suitable NLP library for their text analysis task.
更多
查看译文
关键词
Natural language processing, Sentence segmentation, Tokenization, Lemmatization, POS tagging, Named entity recognition
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要