6 articles

#gemma-4

Google Releases Gemma 4 12B: Encoder-Free Multimodal

Google's Gemma 4 12B is encoder-free multimodal — text, audio, video, image — in 16GB VRAM under Apache 2.0. Day-0 in Transformers, llama.cpp, MLX, and Red Hat OpenShift.

June 7, 20261 min read

Technologybreaking

Google Fires 6 AI Product Launches in 11 Days Before Google I/O

Google releases Gemma 4 MTP (3× speedup), NotebookLM Mind Maps, Gemini API webhooks, and a Gemini Health app — six launches in the 11 days before I/O.

May 9, 20261 min read

Technologybreaking

Google Gemma 4 E2B/E4B Enables Agent Skills on Edge Devices via LiteRT-LM

Gemma 4 E2B/E4B bundle function calling and thinking at 2–4B params. LiteRT-LM enables agent skills on mobile — Apache 2.0, cross-platform.

May 4, 20261 min read

Technologybreaking

Gemma 4 Powers Fully Local Browser Agent via WebGPU — No Server, No API Key

Gemma 4 E2B + WebGPU + Transformers.js enables a fully local Chrome browser agent with no server calls — tabs and browsing data stay on-device.

April 29, 20261 min read

Technologybreaking

Google Gemma 4 Launches Under Apache 2.0: 31B Ranks #3 on LM Arena

Google DeepMind releases Gemma 4 under Apache 2.0: four models including a 31B dense ranking #3 globally on LM Arena and on-device E2B/E4B variants with audio, vision, and 256K context.

April 28, 20261 min read

Gemma 4 MoE architecture visualization: glowing modular expert nodes on a crystalline AI chip with on-device smartphone and leaderboard motifs

TechnologyNotable

Gemma 4 Beats Models 20× Its Size on LM Arena, Goes Apache 2.0

Google DeepMind's Gemma 4 family launches under Apache 2.0 with MoE architecture, on-device multimodal, and the 31B dense model ranked #3 on LM Arena.

April 28, 20262 min read

AI Intelligence Newsletter

Curated AI insights — sent when there's something worth your inbox.