PubMed Author-assigned Keyword Extraction (PubMedAKE) Benchmark

Jiasheng Sheng,Zelalem Gero,Joyce C Ho

Conference on Information and Knowledge Management(2022)

引用 1|浏览6
暂无评分
摘要
ABSTRACTWith the ever-increasing abundance of biomedical articles, improving the accuracy of keyword search results becomes crucial for ensuring reproducible research. However, keyword extraction for biomedical articles is hard due to the existence of obscure keywords and the lack of a comprehensive benchmark. PubMedAKE is an author-assigned keyword extraction dataset that contains the title, abstract, and keywords of over 843,269 articles from the PubMed open access subset database. This dataset, publicly available on Zenodo, is the largest keyword extraction benchmark with sufficient samples to train neural networks. Experimental results using state-of-the-art baseline methods illustrate the need for developing automatic keyword extraction methods for biomedical literature.
更多
查看译文
关键词
PubMed literature,datasets,keyphrases extraction,keywords extraction
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要