Episode Details

Back to Episodes

Fine-tuning and Preference Alignment in a Single Streamlined Process

Published 1 year, 10 months ago
Description

Jiwoo Hong and  Noah Lee of KAIST AI are co-authors of ORPO: Monolithic Preference Optimization without Reference Model

Subscribe to the Gradient Flow Newsletterhttps://gradientflow.substack.com/

Subscribe: AppleSpotify OvercastPocket CastsAntennaPodPodcast AddictAmazon •  RSS.

Detailed show notes can be found on The Data Exchange web site.

Listen Now

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.

Support Us