Tag
Gemma 4
Gemma 4 is Google’s open-weight model family focused on long context, multimodal input, and flexible cloud deployment. With 256K context, vision, audio, and Apache 2.0 licensing, it matters for teams using Vertex AI, Cloud Run, GKE, or TPUs.
3 articles

Tools & Apps/May 9
Gemma 4 assistant models get faster draft tokens
Gemma 4 E2B and E4B assistant models use centroid masking to cut lm_head work about 45x with little quality loss.

Model Releases/Apr 4
Gemma 4 lands on Google Cloud
Google Cloud brings Gemma 4 to Vertex AI, Cloud Run, GKE, and TPUs, with 256K context, vision, audio, and Apache 2.0 licensing.

Research/Apr 3
AIME 2026 leaderboard: Qwen leads math tests
Qwen3.6 Plus tops the AIME 2026 math benchmark with 0.953, while 8 models show a wide gap in olympiad-style reasoning.