Podcast Episodes

Back to Search
Vincent Moens on TorchRL

Vincent Moens on TorchRL


Episode 52


Dr. Vincent Moens is an Applied Machine Learning Research Scientist at Meta, and an author of TorchRL and TensorDict in pytorch. 

Featured References

TorchRL: A data-driven decision-making library fo…


Published on 1 year, 5 months ago

Arash Ahmadian on Rethinking RLHF

Arash Ahmadian on Rethinking RLHF


Episode 51


Arash Ahmadian is a Researcher at Cohere and Cohere For AI focussed on Preference Training of large language models. He’s also a researcher at the Vector Institute of AI.

Featured Reference

Back to Bas…


Published on 1 year, 5 months ago

Glen Berseth on RL Conference

Glen Berseth on RL Conference


Episode 50


Glen Berseth is an assistant professor at the Université de Montréal, a core academic member of the Mila - Quebec AI Institute, a Canada CIFAR AI chair, member l'Institute Courtios, and co-director o…


Published on 1 year, 5 months ago

Ian Osband

Ian Osband


Episode 49


Ian Osband is a Research scientist at OpenAI (ex DeepMind, Stanford) working on decision making under uncertainty.  

We spoke about: 

- Information theory and RL 

- Exploration, epistemic uncertainty an…


Published on 1 year, 6 months ago

Sharath Chandra Raparthy

Sharath Chandra Raparthy


Episode 48


Sharath Chandra Raparthy on In-Context Learning for Sequential Decision Tasks, GFlowNets, and more!  

Sharath Chandra Raparthy is an AI Resident at FAIR at Meta, and did his Master's at Mila.  


Feature…


Published on 1 year, 6 months ago

Pierluca D'Oro and Martin Klissarov

Pierluca D'Oro and Martin Klissarov


Episode 47


Pierluca D'Oro and Martin Klissarov on Motif and RLAIF, Noisy Neighborhoods and Return Landscapes, and more!  

Pierluca D'Oro is PhD student at Mila and visiting researcher at Meta.


Martin Klissarov is…


Published on 1 year, 9 months ago

Martin Riedmiller

Martin Riedmiller


Episode 46


Martin Riedmiller of Google DeepMind on controlling nuclear fusion plasma in a tokamak with RL, the original Deep Q-Network, Neural Fitted Q-Iteration, Collect and Infer, AGI for control systems, and…


Published on 2 years ago

Max Schwarzer

Max Schwarzer


Episode 45


Max Schwarzer is a PhD student at Mila, with Aaron Courville and Marc Bellemare, interested in RL scaling, representation learning for RL, and RL for science.  Max spent the last 1.5 years at Google …


Published on 2 years, 1 month ago

Julian Togelius

Julian Togelius


Episode 44


Julian Togelius is an Associate Professor of Computer Science and Engineering at NYU, and Cofounder and research director at modl.ai


  

Featured References  
Choose Your Weapon: Survival Strategies for …


Published on 2 years, 1 month ago

Jakob Foerster

Jakob Foerster


Episode 43


Jakob Foerster on Multi-Agent learning, Cooperation vs Competition, Emergent Communication, Zero-shot coordination, Opponent Shaping, agents for Hanabi and Prisoner's Dilemma, and more.  

Jakob Foerst…


Published on 2 years, 4 months ago





If you like Podbriefly.com, please consider donating to support the ongoing development.

Donate