The twist measure for IR evaluation: Taking user's effort into account

Journal of the Association for Information Science and Technology(2016)

引用 30|浏览25
暂无评分
摘要
AbstractWe present a novel measure for ranking evaluation, called Twist ï ź. It is a measure for informational intents, which handles both binary and graded relevance. ï ź stems from the observation that searching is currently a that searching is currently taken for granted and it is natural for users to assume that search engines are available and work well. As a consequence, users may assume the utility they have in finding relevant documents, which is the focus of traditional measures, as granted. On the contrary, they may feel uneasy when the system returns nonrelevant documents because they are then forced to do additional work to get the desired information, and this causes avoidable effort. The latter is the focus of ï ź, which evaluates the effectiveness of a system from the point of view of the effort required to the users to retrieve the desired information. We provide a formal definition of ï ź, a demonstration of its properties, and introduce the notion of effort/gain plots, which complement traditional utility-based measures. By means of an extensive experimental evaluation, ï ź is shown to grasp different aspects of system performances, to not require extensive and costly assessments, and to be a robust tool for detecting differences between systems.
更多
查看译文
关键词
information retrieval,retrieval effectiveness,evaluation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要