Chrome Extension
WeChat Mini Program
Use on ChatGLM

Build Complementary Models on Human Feedback for Simulation to the Real World

Knowledge-Based Systems(2021)

Cited 0|Views22
Key words
Safe reinforcement learning,Human-in-the-loop reinforcement learning,Markov decision processes,Supervised learning
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined