Chrome Extension
WeChat Mini Program
Use on ChatGLM

Guided Deterministic Policy Optimization with Gradient-Free Policy Parameters Information

EXPERT SYSTEMS WITH APPLICATIONS(2023)

Cited 0|Views6
Key words
Deterministic policy gradient,Premature convergence,Local optimum,Policy optimization,Exploration,Sample efficiency
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined