5 articles

#long-context

MiniMax M3: Open-Weights 1M-Context Agentic Coding Leader

MiniMax M3 offers 1M-token context, beats GPT-5.5 on SWE-bench-Pro, and releases open weights in 1-2 weeks — API now at $0.20/M tokens, a fraction of frontier pricing.

June 7, 20261 min read

Technologybreaking

SubQ Ships 12M-Token-Context Frontier Model — Order-of-Magnitude Jump

SubQ's 12M-token context frontier model is a step-change above the 1–2M plateau and forces a fundamental rethink of RAG vs in-context architectures.

May 10, 20261 min read

Technologybreaking

xAI Ships Grok 4.3 With 1M Context and 40% Price Cut

xAI's Grok 4.3 launches with 1M-token context, native video input, always-on reasoning, and the most aggressive long-context pricing in the frontier tier.

May 7, 20261 min read

Technologybreaking

Sub Quadratic Launches SubQ Claiming 12M-Token Context at 52× Efficiency

SubQ launch: 12M-token context, sparse attention, 52× FlashAttention speed claimed — but benchmarks are on the 1M-preview only. No technical report. Healthy skepticism warranted.

May 6, 20261 min read

Jade-teal and silver AI columns with amber compression threads and a floating 1/7 pricing panel

IndustrySignificant

DeepSeek V4: 1M-Context Open Weights, 1/7 Opus 4.7 Pricing

DeepSeek V4 drops two open-weight models with 1M-context by default, CSA+HCA hybrid attention, and V4-Pro priced at roughly 1/7 Opus 4.7's output cost.

April 26, 20262 min read

AI Intelligence Newsletter

Curated AI insights — sent when there's something worth your inbox.