BOND: Aligning LLMs with Best-of-N Distillation
Pier Giuseppe Sessa,Robert Dadashi,Léonard Hussenot-Desenonges,Johan Ferret,Nino Vieillard,Alexandre Rame, Bobak Shahriari,Sarah Perrin,Abram Friesen,Geoffrey Cideron,Sertan Girgin,Piotr Stanczyk,Andrea Michi,Danila Sinopalnikov,Sabela Ramos Garea,Amélie Héliou,Aliaksei Severyn,Matthew Hoffman,Nikola Momchev,Olivier Bachem ICRA 2025(2025)
AI 理解论文
溯源树
样例
