Episode Details
Back to Episodes
📅 ThursdAI - Live @ NeurIPS, Mixtral, GeminiPro, Phi2.0, StripedHyena, Upstage 10B SoTA & more AI news from last (insane) week
Description
Wow what a week. I think I’ve reached to a level that I’m not phased by incredible weeks or days that happen in AI, but I… guess I still have much to learn!
TL;DR of everything we covered (aka Show Notes)
* Open Source LLMs
* Mixtral MoE - 8X7B experts dropped with a magnet link again (Announcement, HF, Try it)
* Mistral 0.2 instruct (Announcement, HF)
* Upstage Solar 10B - Tops the HF leaderboards (Announcement)
* Together -Striped Hyena architecture and new models (Announcement)
* EAGLE - a new decoding method for LLMs (Announcement, Github)
* Deci.ai - new SOTA 7B model
* Phi 2.0 weights are available finally from Microsoft (HF)
* QuiP - LLM quantization & Compression (link)
* Big CO LLMs + APIs
* Gemini Pro access over API (Announcement, Thread)
* Uses character pricing not token
* Mistral releases API inference server - La Platforme (API docs)
* Together undercuts Mistral with serving Mixtral by 70% and announces OAI compatible API
* OpenAI is open sourcing again - Releasing Weak-2-strong generalization paper and github! (announcement)
* Vision
* Gemini Pro api has vision AND video capabilities (API docs)
* AI Art & Diffusion
* Stability announces Zero123 - Zero Shot image to 3d model (Thread)
* Imagen 2 from google (link)
* Tools & Other
* Optimus from Tesla is coming, and it looks incredible
This week started on Friday, as we saw one of the crazier single days in the history of OSS AI that I can remember, and I’ve been doing this now for .. jesus, 9 months!
In a single say, we saw a new Mistral model release called Mixtral, which is a Mixture of Experts (like GPT4 is rumored to be) of 8x7B Mistrals, and beats GPT3.5, we saw a completely new architecture tha