On the Value of Prior in Online Learning to Rank

Branislav Kveton,Ofer Meshi,Masrour Zoghi,Zhen Qin

International Conference on Artificial Intelligence and Statistics (AISTATS)（2022）

引用 3|浏览27

暂无评分

摘要

This paper addresses the cold-start problem in online learning to rank (OLTR). We show both theoretically and empirically that priors improve the quality of ranked lists presented to users interactively based on user feedback. These priors can come in the form of unbiased estimates of the relevance of the ranked items, or more practically, can be obtained from offline-learned models. Our experiments show the effectiveness of priors in improving the short-term regret of tabular OLTR algorithms, based on Thompson sampling and BayesUCB.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要