Named-Entity Recognition for Portuguese Police Reports
semanticscholar(2018)
摘要
During a criminal investigation several text documents are produced by police officers, creating a deluge of unstructured data obtained from heterogeneous sources. Therefore, identification and recognition of entities, i.e. places, organizations or persons, by a natural language pipeline, with named-entities recognition task, could help police officers to understand and find relevant information in data extracted. We aim to defined a natural language processing pipeline to identify and recognize entities from these police reports, supported by two trained corpus, namely Amazonia and a Portuguese News Corpus. Additionally, we evaluate named-entities recognition systems, focus in Portuguese language, with a dataset produced by the Portuguese police. We then evaluate the performance obtained on the information retrieval process applied to the dataset.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要