Provably Efficient Generalized Lagrangian Policy Optimization for Safe Multi-Agent Reinforcement Learning.

LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211(2023)

引用 0|浏览80
暂无评分
关键词
safe multi-agent reinforcement learning,constrained Markov game,upper confidence reinforcement learning,generalized Lagrange multiplier method,online mirror descent
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要