Confident Natural Policy Gradient for Local Planning in $Q_\pi$-Realizable Constrained MDPs

NeurIPS 2024（2024）

Cited 0|Views8

No score

Key words

reinforcement learning,constrained MDP,sample complexity,q-pi realizability,local planning

AI Read Science

Must-Reading Tree

Example

Generate MRT to find the research sequence of this paper

Chat Paper

Summary is being generated by the instructions you defined