Podcast Episode Details

Back to Podcast Episodes
Arash Ahmadian on Rethinking RLHF

Arash Ahmadian on Rethinking RLHF


Episode 51


Arash Ahmadian is a Researcher at Cohere and Cohere For AI focussed on Preference Training of large language models. He’s also a researcher at the Vector Institute of AI.

Featured Reference

Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs

Arash Ahmadian, Chris Cremer, Matthias Gallé, Marzieh Fadaee, Julia Kreutzer, Olivier Pietquin, Ahmet Üstün, Sara Hooker


Additional References


Published on 1 year, 9 months ago






If you like Podbriefly.com, please consider donating to support the ongoing development.

Donate