Mastering the Game of Stratego with Model-Free Multiagent Reinforcement LearningJulien Pérolat,Bart De Vylder,Daniel Hennes,Eugene Tarassov,Florian Strub,Vincent de Boer,Paul Müller,Jerome T. Connor,Neil Burch,T Anthony,Stephen McAleer,Romuald Élie,Sarah H. Cen,Zhe Wang,Audrūnas Gruslys,Aleksandra Malysheva,Mina Khan,Sherjil Ozair,Finbarr Timbers,Toby Pohlen,Tom Eccles,Mark Rowland,Marc Lanctot,Jean-Baptiste Lespiau,Bilal Piot,Shayegan Omidshafiei,Edward Lockhart,Laurent Sifre,Nathalie Beauguerlange,Rémi Munos,David Silver,Satinder Singh,Demis Hassabis,Karl TuylsarXiv (Cornell University)(2022)引用 0|浏览56暂无评分关键词stratego,learning,game,model-freeAI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要