BOND: Aligning LLMs with Best-of-N Distillation
Pier Giuseppe Sessa,Robert Dadashi,Léonard Hussenot,Johan Ferret,Nino Vieillard,Alexandre Ramé, Bobak Shariari,Sarah Perrin,Abe Friesen,Geoffrey Cideron,Sertan Girgin,Piotr Stanczyk,Andrea Michi,Danila Sinopalnikov,Sabela Ramos,Amélie Héliou,Aliaksei Severyn,Matt Hoffman,Nikola Momchev,Olivier Bachem CoRR(2024)
AI 理解论文
溯源树
样例
