Contextual Bandits in a Collaborative Environment
SIGIR, pp. 529-538, 2016.
Contextual bandit algorithms provide principled online learning solutions to find optimal trade-offs between exploration and exploitation with companion side-information. They have been extensively used in many important practical scenarios, such as display advertising and content recommendation. A common practice estimates the unknown ba...More
PPT (Upload PPT)