First Results on Detecting Term Evolutions

Nina Tahmasebi,Sukriti Ramesh,Thomas Risse

msra

引用 26|浏览103

暂无评分

摘要

The archival of content like publications or web pages is just the first step toward "full" content preservation. It also has to be guaranteed that content can be found and inter- preted in the long run. The correspondence between the terminology used for querying and the one used in content objects to be retrieved, is a crucial prerequisite for effective retrieval technology. However, as terminology evolves over time, a growing gap opens between older documents in (long- term) archives and the active language used for querying such archives. Thus, technologies for detecting and system- atically handling terminology evolution are required to en- sure "semantic" accessibility of archived content in the long run. The core of our approach is to derive mappings between terminologies originating from different times by the fusion of term concept graphs. To verify the suitability of our ap- proach, we present first results of experiments conducted on The Times archive that covers 200 years of documents. In addition, we discuss how our approach can be applied to web archives and the challenges that arise from this.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要