Episode Details

Back to Episodes

Fine-tuning and Preference Alignment in a Single Streamlined Process

Published 2 years ago

Description

Jiwoo Hong and Noah Lee of KAIST AI are co-authors of ORPO: Monolithic Preference Optimization without Reference Model.

Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/

Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS.

Detailed show notes can be found on The Data Exchange web site.

Listen Now

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.