Episode Details
Back to Episodes
📅 ThursdAI - May 23 - OpenAI troubles, Microsoft Build, Phi-3 small/large, new Mistral & more AI news
Description
Hello hello everyone, this is Alex, typing these words from beautiful Seattle (really, it only rained once while I was here!) where I'm attending Microsoft biggest developer conference BUILD.
This week we saw OpenAI get in the news from multiple angles, none of them positive and Microsoft clapped back at Google from last week with tons of new AI product announcements (CoPilot vs Gemini) and a few new PCs with NPU (Neural Processing Chips) that run alongside CPU/GPU combo we're familiar with. Those NPUs allow for local AI to run on these devices, making them AI native devices!
While I'm here I also had the pleasure to participate in the original AI tinkerers thanks to my friend Joe Heitzberg who operates and runs the aitinkerers.org (of which we are a local branch in Denver) and it was amazing to see tons of folks who listen to ThursdAI + read the newsletter and talk about Weave and evaluations with all of them! (Btw, one the left is Vik from Moondream, which we covered multiple times). I
Ok let's get to the news:
TL;DR of all topics covered:
* Open Source LLMs
* HuggingFace commits 10M in ZeroGPU (X)
* Microsoft open sources Phi-3 mini, Phi-3 small (7B) Medium (14B) and vision models w/ 128K context (Blog, Demo)
* Mistral 7B 0.3 - Base + Instruct (HF)
* LMSys created a "hard prompts" category (X)
* Cohere for AI releases Aya 23 - 3 models, 101 languages, (X)
* Big CO LLMs + APIs
* Microsoft Build recap - New AI native PCs, Recall functionality, Copilot everywhere
* Will post a dedicated episode to this on Sunday
* OpenAI pauses GPT-4o Sky voice because Scarlet Johansson complained
* Microsoft AI PCs - Copilot+ PCs (Blog)
* Anthropic - Scaling Monosemanticity paper - about mapping the features of an LLM (X, Paper)
* Vision & Video
* OpenBNB - MiniCPM-Llama3-V 2.5 (X, HuggingFace)
* Voice & Audio
* OpenAI pauses Sky voice due to ScarJo hiring legal counsel
* Tools & Hardware
* Humane is looking to sell (blog)
Open Source LLMs
Microsoft open sources Phi-3 mini, Phi-3 small (7B) Medium (14B) and vision models w/ 128K context (Blog, Demo)
Just in time for Build, Microsoft has open sourced the rest of the Phi family of models, specifically the small (7B) and the Medium (14B) models on top of the mini one we just knew as Phi-3.
All the models have a small co