Episode Details

Back to Episodes
📅 ThursdAI - Aug 29 - AI Plays DOOM, Cerebras breaks inference records, Google gives new Geminis, OSS vision SOTA & 100M context windows!?

📅 ThursdAI - Aug 29 - AI Plays DOOM, Cerebras breaks inference records, Google gives new Geminis, OSS vision SOTA & 100M context windows!?

Published 1 year, 7 months ago
Description

Hey, for the least time during summer of 2024, welcome to yet another edition of ThursdAI, also happy skynet self-awareness day for those who keep track :)

This week, Cerebras broke the world record for fastest LLama 3.1 70B/8B inference (and came on the show to talk about it) Google updated 3 new Geminis, Anthropic artifacts for all, 100M context windows are possible, and Qwen beats SOTA on vision models + much more!

As always, this weeks newsletter is brought to you by Weights & Biases, did I mention we're doing a hackathon in SF in September 21/22 and that we have an upcoming free RAG course w/ Cohere & Weaviate?

TL;DR

* Open Source LLMs

* Nous DisTrO - Distributed Training (X , Report)

* NousResearch/ hermes-function-calling-v1 open sourced - (X, HF)

* LinkedIN Liger-Kernel - OneLine to make Training 20% faster & 60% more memory Efficient (Github)

* Cartesia - Rene 1.3B LLM SSM + Edge Apache 2 acceleration (X, Blog)

* Big CO LLMs + APIs

* Cerebras launches the fastest AI inference - 447t/s LLama 3.1 70B (X, Blog, Try It)

* Google - Gemini 1.5 Flash 8B & new Gemini 1.5 Pro/Flash (X, Try it)

* Google adds Gems & Imagen to Gemini paid tier

* Anthropic artifacts available to all users + on mobile (Blog, Try it)

* Anthropic publishes their system prompts with model releases (release notes)

* OpenAI has project Strawberry coming this fall (via The information)

* This weeks Buzz

* WandB Hackathon hackathon hackathon (Register, Join)

* Also, we have a new RAG course w/ Cohere and Weaviate (RAG Course)

* Vision & Video

* Zhipu AI CogVideoX - 5B Video Model w/ Less 10GB of VRAM (X, HF, Try it)

* Qwen-2 VL 72B,7B,2B - new SOTA vision models from QWEN (X, Blog,

Listen Now