Episode Details
Back to Episodes
The Winds of AI Winter (Q2 Four Wars Recap) + ChatGPT Voice Mode Preview
Description
Thank you for 1m downloads of the podcast and 2m readers of the Substack! 🎉
This is the audio discussion following The Winds of AI Winter essay that also serves as a recap of Q2 2024 in AI viewed through the lens of our Four Wars framework. Enjoy!
Full Video Discussion
Full show notes are here.
Timestamps
* [00:00:00] Intro Song by Suno.ai
* [00:02:01] Swyx and Alessio in Singapore
* [00:05:49] GPU Rich vs Poors: Frontier Labs
* [00:06:35] GPU Rich Frontier Models: Claude 3.5
* [00:10:37] GPU Rich helping Poors: Llama 3.1: The Synthetic Data Model
* [00:15:41] GPU Rich helping Poors: Frontier Labs Vibe Shift - Phi 3, Gemma 2
* [00:18:26] GPU Rich: Mistral Large
* [00:21:56] GPU Rich: Nvidia + FlashAttention 3
* [00:23:45] GPU Rich helping Poors: Noam Shazeer & Character.AI
* [00:28:14] GPU Poors: On Device LLMs: Mozilla Llamafile, Chrome (Gemini Nano), Apple Intelligence
* [00:35:33] Quality Data Wars: NYT vs The Atlantic lawyer up vs partner up
* [00:37:41] Quality Data Wars: Reddit, ScarJo, RIAA vs Udio & Suno
* [00:41:03] Quality Data Wars: Synthetic Data, Jagged Intelligence, AlphaProof
* [00:45:33] Multimodality War: ChatGPT Voice Mode, OpenAI demo at AIEWF
* [00:47:34] Multimodality War: Meta Llama 3 multimodality + Chameleon
* [00:50:54] Multimodality War: PaliGemma + CoPaliGemma
* [00:52:55] Renaming Rag/Ops War to LLM OS War
* [00:55:31] LLM OS War: Ops War: Prompt Management vs Gateway vs Observability
* [01:02:57] LLM OS War: BM42 Vector DB Wars, Memory Databases, GraphRAG
* [01:06:15] LLM OS War: Agent Tooling
* [01:08:26] LLM OS War: Agent Protocols
* [01:10:43] Trend: Commoditization of Intelligence
* [01:16:45] Trend: Vertical Service as Software, AI Employees, Brightwave, Dropzone
* [01:20:44] Trend: Benchmark Frontiers after MMLU
* [01:23:31] Crowdstrike will save us from Skynet
* [01:24:30] Bonus: ChatGPT Advanced Voice Mode Demo
* [01:25:37] Voice Mode: Storytelling
* [01:27:55] Voice Mode: Accents
* [01:31:48] Voice Mode: Accent Detection
* [01:35:00] Voice Mode: Nonverbal Emotions
* [01:37:53] Voice Mode: Multiple Voices in One
* [01:40:52] Voice Mode: Energy Levels Detection
* [01:42:03] Voice Mode: Multilinguality
* [01:43:53] Voice Mode: Shepard Tone
* [01:46:57] Voice Mode: Generating Tones
* [01:49:39] Voice Mode: Interruptions don't work
* [01:49:55] Voice Mode: Reverberations
* [01:51:37] Voice Mode: Mimicry doesn't work
Transcript
Charlie [00:01:08]: Welcome back, listeners. This is your AI co-host, Charlie. It's been a few months since we took a step back from the interview format and talked about the show. We're happy to share that we have crossed one million downloads and two million reads on Substack. Woo-hoo. We are really grateful to those of you who keep tuning in and sharing us with your friends, especially if who watch and comment on our new YouTube channel, where we are trying to grow next. For a special millionaire edition, SWIX and Alessio are finally back in person in sunny Singapore to discuss the big vibe shift in the last three months, that we are calling the Winds of AI Winter. We also discuss my nemesis, ChatGPT Advanced Voice Mode, with a special treat for those who stay till the end. Now, more than ever, watch out and take care.
Alessio [00:02:02]: Hey, everyone. Welcome to the Latent Space Podcast