MiniMax M3: Open-Weights 1M-Context Agentic Coding Leader
MiniMax M3 offers 1M-token context, beats GPT-5.5 on SWE-bench-Pro, and releases open weights in 1-2 weeks — API now at $0.20/M tokens, a fraction of frontier pricing.
MiniMax M3 offers 1M-token context, beats GPT-5.5 on SWE-bench-Pro, and releases open weights in 1-2 weeks — API now at $0.20/M tokens, a fraction of frontier pricing.
SubQ's 12M-token context frontier model is a step-change above the 1–2M plateau and forces a fundamental rethink of RAG vs in-context architectures.
xAI's Grok 4.3 launches with 1M-token context, native video input, always-on reasoning, and the most aggressive long-context pricing in the frontier tier.
SubQ launch: 12M-token context, sparse attention, 52× FlashAttention speed claimed — but benchmarks are on the 1M-preview only. No technical report. Healthy skepticism warranted.

DeepSeek V4 drops two open-weight models with 1M-context by default, CSA+HCA hybrid attention, and V4-Pro priced at roughly 1/7 Opus 4.7's output cost.
Curated AI insights — sent when there's something worth your inbox.