Learning to Shape Rewards using a Game of Switching ControlsDavid Mguni,Jianhong Wang,Taher Jafferjee,Nicolas Perez Nieves,Wenbin Song,Yaodong Yang,Feifei Tong, Hui Chen,Jiangcheng Zhu,Yali Du,Jun WangArXiv(2021)引用 0|浏览30关键词shape rewards,switching controls,learningAI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要