Finding the Optimal Exploration-Exploitation Trade-off Online Through Bayesian Risk Estimation and Minimization

ARTIFICIAL INTELLIGENCE(2024)

引用 0|浏览37
暂无评分
关键词
Bayesian risk,Stochastic online learning,Multi-armed bandits,Partial monitoring
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要