The Healing Power of Poison: Helpful Non-relevant Documents in Feedback

ACM International Conference on Information and Knowledge Management(2016)

引用 5|浏览61
暂无评分
摘要
The use of feedback information is an effective approach to address the vocabulary gap between a user’s query and the relevant documents. It has been shown that some relevant documents act like “poison pills,” i.e. they hurt the performance of feedback systems despite the fact that they are relevant. In this paper, we study the positive counterpart of this by investigating the helpfulness of nonrelevant documents in feedback. In general, we find that although documents that are explicitly judged as non-relevant are normally assumed to be poisonous for feedback systems, sometimes considering high-scored non-relevant documents as a positive feedback helps to improve the performance of retrieval. In our experimental data, we observe a considerable fraction of non-relevant documents in higher ranked positions of the initial retrieval run, for most of the topics. Hence, by ignoring the potential value of non-relevant documents, we may loose a lot of useful information. We design a set of experiments with existing state-of-the-art feedback methods to investigate the potential contribution of nonrelevant documents. Our main findings are the following. First, we find that some of the non-relevant documents are exclusively helpful, they improve retrieval on their own, and others are complementary helpful, they lead to further improvement when added to a set of relevant documents. Second, we discover that, on average, exclusively helpful non-relevant documents have a higher contribution to the performance improvement, compared to the complementary ones. Third, we show that non-relevant documents in topics with poor average precision in the initial retrieval are more likely to help in the feedback.
更多
查看译文
关键词
Feedback,Helpful Non-relevant Documents,Relevance Feedback,Pseudo-Relevance Feedback
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要