Episode Details
Back to Episodes
🦃 ThursdAI - Thanksgiving special 24' - Qwen Open Sources Reasoning, BlueSky hates AI, H controls the web & more AI news
Description
Hey ya'll, Happy Thanskgiving to everyone who celebrates and thank you for being a subscriber, I truly appreciate each and every one of you!
We had a blast on today's celebratory stream, especially given that today's "main course" was the amazing open sourcing of a reasoning model from Qwen, and we had Junyang Lin with us again to talk about it! First open source reasoning model that you can run on your machine, that beats a 405B model, comes close to o1 on some metrics 🤯
We also chatted about a new hybrid approach from Nvidia called Hymba 1.5B (Paper, HF) that beats Qwen 1.5B with 6-12x less training, and Allen AI releasing Olmo 2, which became the best fully open source LLM 👏 (Blog, HF, Demo), though they didn't release WandB logs this time, they did release data!
I encourage you to watch todays show (or listen to the show, I don't judge), there's not going to be a long writeup like I usually do, as I want to go and enjoy the holiday too, but of course, the TL;DR and show notes are right here so you won't miss a beat if you want to use the break to explore and play around with a few things!
ThursdAI - Recaps of the most high signal AI weekly spaces is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.
TL;DR and show notes
* Qwen QwQ 32B preview - the first open weights reasoning model (X, Blog, HF, Try it)
* Allen AI - Olmo 2 the best fully open language model (Blog, HF, Demo)
* NVIDIA Hymba 1.5B - Hybrid smol model beating Qwen, SmolLM w/ 6-12x less training (X, Paper, HF)
* Big CO LLMs + APIs
* Anthropic MCP - model context protocol (X,Blog, Spec, Explainer)
* Cursor, Jetbrains now integrate with ChatGPT MacOS app (X)
* Xai is going to be a Gaming company?! (X)
* H company shows Runner H - WebVoyager Agent (X, Waitlist)
* This weeks Buzz
* Interview w/ Thomas Cepelle about Weave scorers and guardrails (Guide)
* Vision & Video
* OpenAI SORA API was "leaked" on HuggingFace (here)
* Runway launches video Expand feature (