Learning to Shape Rewards Using a Game of Two PartnersDavid Henry Mguni,Jianhong Wang,Taher Jafferjee,Nicolas Perez-Nieves,Wenbin Song,Feifei Tong, Hui Chen,Jiangcheng Zhu,Yaodong Yang,Jun WangICLR 2022(2022)引用 1|浏览115关键词Reinforcement learning,Reward Shaping,Markov game,Sparse rewardsAI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要