Imitating Language Via Scalable Inverse Reinforcement LearningMarkus Wulfmeier,Michael Bloesch,Nino Vieillard,Arun Ahuja,Jorg Bornschein,Sandy Huang,Artem Sokolov,Matt Barnes,Guillaume Desjardins,Alex Bewley, Sarah Maria Elisabeth Bechtle,Jost Tobias Springenberg,Nikola Momchev,Olivier Bachem,Matthieu Geist,Martin RiedmillerNeurIPS 2024(2024)引用 5|浏览16关键词Language Modeling,Inverse Reinforcement Learning,Imitation Learning,Supervised Fine-tuningAI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要