Google Releases Gemma 4 12B: Encoder-Free Multimodal
Google's Gemma 4 12B is encoder-free multimodal — text, audio, video, image — in 16GB VRAM under Apache 2.0. Day-0 in Transformers, llama.cpp, MLX, and Red Hat OpenShift.
Google's Gemma 4 12B is encoder-free multimodal — text, audio, video, image — in 16GB VRAM under Apache 2.0. Day-0 in Transformers, llama.cpp, MLX, and Red Hat OpenShift.
Google releases Gemma 4 MTP (3× speedup), NotebookLM Mind Maps, Gemini API webhooks, and a Gemini Health app — six launches in the 11 days before I/O.
Gemma 4 E2B/E4B bundle function calling and thinking at 2–4B params. LiteRT-LM enables agent skills on mobile — Apache 2.0, cross-platform.
Gemma 4 E2B + WebGPU + Transformers.js enables a fully local Chrome browser agent with no server calls — tabs and browsing data stay on-device.
Google DeepMind releases Gemma 4 under Apache 2.0: four models including a 31B dense ranking #3 globally on LM Arena and on-device E2B/E4B variants with audio, vision, and 256K context.

Google DeepMind's Gemma 4 family launches under Apache 2.0 with MoE architecture, on-device multimodal, and the 31B dense model ranked #3 on LM Arena.
Curated AI insights — sent when there's something worth your inbox.