Model Releases

Latest AI model releases, benchmarks, and comparisons. Stay up to date with every new model launch from OpenAI, Anthropic, Google, Meta, and more.

Jul 12

GPT-5.6 turns OpenAI into a model menu

I break down OpenAI’s GPT-5.6 rollout, the three-model split, and the copyable way to pick the right model per task.

Jul 10

Seedream 5.0 Pro Is the Right Choice for Editable AI Images

Seedream 5.0 Pro is the best pick for reasoning-driven, editable AI image workflows with multilingual text.

Jul 10

Midjourney v8.2 release is close

Midjourney says v8.2 is nearing release, alongside another Preview option update and news tied to its Medical project.

Jul 9

Rust KRAID enters Mesa for Arm Mali GPUs

Mesa 26.2 adds KRAID, a Rust shader compiler for Arm Mali v9+ GPUs, with its first dEQP test already passing.

Jul 9

OpenAI Opens GPT-5.6 and Launches Live Voice AI

OpenAI is widening GPT-5.6 access and rolling out GPT-Live voice models after a brief government-limited preview.

Jul 6

Mistral is right to push Leanstral into proof engineering

Mistral’s Leanstral 1.5 is a smart bet on formal proof engineering as a real product category.

Jul 5

Google’s June 2026 AI updates put live translation first

Google’s June 2026 AI updates center on Gemini 3.5 Live Translate, Android 17, and new tools for agents, research, and learning.

Jul 3

Mistral Small 2603: 256K context for $0.15 in

Mistral Small 2603 pairs a 256K context window with $0.15 input pricing, $0.60 output pricing, and strong reasoning scores.

Jul 1

ACE-Step 1.5 makes local music generation a real product, not a demo

ACE-Step 1.5 proves local music generation is now good enough to beat many commercial tools.

Jul 1

Sora’s 30-seat electric aircraft clears VTOL tests

Sora Aviation finished subscale VTOL tests for its 30-seat S-1 electric aircraft and is aiming for a full-scale prototype flight in 2028.

Jun 30

K3s v1.34.9 lands with Kubernetes 1.34.9

K3s v1.34.9 updates Kubernetes to 1.34.9 and refreshes Traefik, containerd, and other bundled components.

Jun 29

Kimi 2.7 makes price the real coding benchmark

Kimi 2.7 is the better buy than Claude Fable 5 for most coding teams.

Jun 29

Kimi K2.6 tops coding and agentic AI benchmarks

Moonshot AI’s Kimi K2.6 hits top marks in coding and agentic tasks, with a 262K context window and open-weight pricing at $0.74/$3.50 per 1M tokens.

Jun 29

Llama Legends 3.8.0 adds Season 3 heroes and raids

Llama Legends 3.8.0 adds 100 superhero cards, 12 achievements, four raid bosses, and the Atlas Ancient card.

Jun 29

oMLX 0.4.5.dev1 speeds up GLM-5.2 and MiniMax M3

oMLX 0.4.5.dev1 adds custom kernels for GLM-5.2 and MiniMax M3, plus cache fixes and better model profile exposure.

Jun 29

Grok 4.5 enters private beta at Tesla and SpaceX

xAI’s Grok 4.5 has entered private beta inside Tesla and SpaceX, its first internal rollout.

Jun 27

Google OpenRL brings RL fine-tuning to Kubernetes

Google’s OpenRL lets teams run LLM post-training and fine-tuning on their own Kubernetes clusters.

Jun 27

DiffusionGemma runs fast on NVIDIA RTX and DGX

Google DeepMind’s DiffusionGemma generates text in parallel, and NVIDIA says RTX and DGX hardware can run it up to 4x faster.

Jun 27

GLM-5.2 beats GPT-5.5 on coding tests

Z.ai’s GLM-5.2 beats GPT-5.5 on several coding benchmarks while claiming far lower cost.

Jun 27

OpenAI narrows GPT-5.6 rollout after U.S. request

OpenAI is limiting GPT-5.6 Sol, Terra and Luna to trusted partners before a wider release.

Jun 27

Ubuntu 26.10 Snapshot 2 adds GNOME 50 and kernel 7.0

Ubuntu 26.10 Snapshot 2 is out for testing with kernel 7.0, GNOME 50, and planned upgrades to kernel 7.2, GNOME 51, and Mesa 26.2.

Jun 27

Claude Fable 5 launches with 1M context, $10/$50 pricing

Anthropic adds Claude Fable 5 and limited-release Claude Mythos 5, both with 1M-token context, 128k output, and new refusal handling.

Jun 26

Google Pushes Gemini 3.5 Pro to July

Google pushed Gemini 3.5 Pro from June to July after early tester feedback and added pressure from OpenAI and Anthropic.

Jun 26

Gemini 3.5 Flash makes computer use a default, not a demo

Google is right to make computer use a native Gemini 3.5 Flash feature.

Jun 26

Xiaomi MiMo-V2.5-Pro: pricing, benchmarks, and limits

Xiaomi’s MiMo-V2.5-Pro pairs a 1M-token context with strong coding, agentic, and reasoning scores at mid-range pricing.

Jun 25

OpenAI’s Sora hardware targets enterprise video

OpenAI’s Sora enterprise hardware brings local AI video generation to studios, agencies, and firms that need speed and privacy.

Jun 24

GPT-5.6 rumors point to 2M context and coding gains

Rumors point to GPT-5.6 and GPT-5.6 Pro arriving June 25 with 2M context, stronger coding agents, and lower prices than rivals.

Jun 24

Kimi’s long-context push keeps getting bigger

Moonshot AI’s Kimi chatbot keeps expanding context, agents, and model size, with Kimi K2.5 arriving in January 2026.

Jun 23

Midjourney Medical’s 60-Second Body Scan Claim

Midjourney Medical’s concept scanner claims a 60-second whole-body ultrasound scan, but the clinical evidence and FDA path are still open.

Jun 21

Apple pushes AI deeper into iPhone apps

Apple’s 2026 Apple Intelligence update adds AI editing, Siri upgrades, Safari tools, and on-device privacy across its platforms.

You've reached the end