Live Intel Feed
Our live intelligence pipeline monitors AI industry developments in real time. Breaking stories will appear here as they are verified.
June 7, 2026
Lindy Switches 100% of Traffic to DeepSeek v4, Leaves Anthropic
Lindy CEO Flo Crivello has switched 100% of Lindy's production traffic to DeepSeek v4, churning from Anthropic models — saving millions of dollars while reporting performance improvements on core use cases.
LlamaIndex CEO: Framework Era Is Over, Pivots to Managed Infra
LlamaIndex CEO Jerry Liu declares the AI framework era over: the orchestration patterns LlamaIndex once wrapped have migrated into Claude Code, OpenClaw, and MCP tools. LlamaIndex is pivoting fully to LlamaParse (extraction accuracy) and LlamaCloud (managed service).
NVIDIA RTX Spark: 128GB Unified Memory for On-Device AI
NVIDIA announces RTX Spark, a desktop/laptop mini-superchip with Blackwell GPU cores (~6,100), up to 20-core CPU, ~1 PFLOP FP4, and up to 128GB unified memory — enough to run frontier open models locally. Ships fall 2026.
ByteDance Open-Sources Bernini: Unified AI Video Editor
ByteDance has open-sourced Bernini, a unified video editing model comparable to Gemini Omni for video: edit by text, image, or video reference — add/remove characters, change background/season, and maintain multi-image character consistency. ComfyUI integration underway.
Town AI Raises $55M Series A: 'Devin for Everything Else'
Town AI has raised a $55M Series A led by Aaron Rampell at a16z, with Forerunner Ventures, First Round Capital, and Conviction. The startup's AI handles email, calendar, Slack, scheduling, and multi-step project tasks — acting only when the user approves.
Microsoft AI Designs Quantum Chip 1,000x More Reliable
Microsoft used its Discovery AI system to design a new quantum chip material stack reportedly 1,000x more reliable than prior qubit designs, halving its target timeline to a scalable quantum computer to 2029.
Google Magenta RealTime 2: 200ms On-Device Music Generation
Google releases Magenta RealTime 2, the only open-weights model for real-time continuous music generation on-device: 200ms latency, steerable by text, audio, or MIDI, and runs on a MacBook (2.4B parameters, no GPU required).
OpenAI Model Finds Counterexample to 80-Year-Old Erdős Conjecture
Researchers working with an OpenAI model have found a counterexample to an 80-year-old Erdős conjecture in mathematics — an example of AI-assisted mathematical discovery beyond benchmark performance.
Harvey + LangChain Labs: Legal AI Verification 1,000x Cheaper
Harvey and LangChain Labs publish a legal AI verifier efficiency study: batch LLM-as-judge scoring reduces verification cost ~1,000x versus per-criterion calls. DeepSeek v4 Flash preserves 94-96% of Opus 4.7 verifier signal at 18x lower cost per criterion.
MiniMax M3: Open-Weights 1M-Context Agentic Coding Leader
MiniMax unveils M3: an open-source agentic-coding model with a 1M token context window, multimodal inputs, and SOTA performance on SWE-bench-Pro — beating GPT-5.5. API available at ~$0.20/M tokens; weights release in 1-2 weeks.
ParseBench at CVPR 2026: First AI-Agent Doc Benchmark
LlamaIndex has presented ParseBench at CVPR 2026 — the first document-understanding benchmark designed for AI agents, covering 2,000+ human-verified pages, 167K+ test rules, and 5 evaluation dimensions. Fully open source.
LangChain Launches LangSmith Platform Blitz: Fleet, Engine, Gateway
LangChain shipped a coordinated product blitz: LangSmith Fleet (no-code natural-language agent builder), LangSmith Engine (automated trace-to-issue-to-fix-to-eval loop), LangSmith Sandboxes GA (stateful CLI-accessible compute), and LLM Gateway (traceable governance inside LangSmith).
Ideogram 4.0 Open Weights Released: Top Open Image Model
Ideogram AI has released open weights for Ideogram 4.0, claiming the #1 position among open image models. Weights are downloadable, fine-tunable on custom data, and available at huggingface.co/ideogram-ai/ideogram-4-nf4.
Claude Code Dynamic Workflows: AI Self-Authored Harnesses
Claude Code Dynamic Workflows, live since May 28, allow Claude to author its own JavaScript orchestration harness with subagents, parallel execution, per-agent model selection, and agent isolation — giving AI structural control over its own execution architecture.
NVIDIA Cosmos 3 Tops 7 Physical AI Leaderboards
NVIDIA releases Cosmos 3, an open-weights physical AI foundation model that ranks #1 on 7 leaderboards spanning world generation (PAI-Bench, Physics-IQ, R-Bench), robot action policy (RoboLab), and industrial vision (VANTAGE-Bench, TAR).
Perplexity Launches Computer: Fully-Hosted General AI Agent
Perplexity has launched Perplexity Computer, a fully-hosted AI agent with threaded tasks, hundreds of pre-built connectors, custom skills, sub-agent delegation, and orchestrator model choice (Opus 4.6, GPT-5.4, or Sonnet 4.6).
Cognition Offers $10M AI Productivity Guarantee for Devin
Cognition launches an 'AI Productivity Guarantee' for Devin: if the AI coding agent delivers less engineering value than the customer pays for, Cognition funds usage until it does — up to $10M. Enterprise private evals extend to 100 hours.
Gradio 6.16.0 Patches Path Traversal, OAuth, and SSRF
Gradio 6.16.0 patches three security vulnerabilities: path traversal in gr.FileExplorer, an open-redirect bypass in OAuth, and SSRF in Image/Gallery/Audio post-processing. Upgrade immediately.
AI CEOs Sign Biosecurity Letter Urging Congress to Act
Sam Altman, Dario Amodei, and Demis Hassabis have co-signed a letter urging Congress to increase oversight of synthetic nucleic acid supply chains, citing AI's growing biological capabilities as a dual-use risk.
S&P DJI Rejects SpaceX S&P 500 Fast-Track
S&P Dow Jones Indices has formally rejected all proposals to fast-track SpaceX into the S&P 500, maintaining the 12-month IPO seasoning period, positive GAAP net income requirement, and 10% public float — SpaceX's earliest eligibility is now June 2027.
GitHub Launches MCP Apps: Interactive UI Rendered in Chat
GitHub and Microsoft have unveiled MCP Apps, extending the Model Context Protocol to let server tools return rich interactive HTML components rendered in sandboxed iframes inside VS Code chat. Shopify, Excalidraw, and Figma are early adopters.
Stripe Launches Three Agentic Payment Primitives
Stripe unveils Shared Payment Tokens (scoped mandate credentials), Machine Payments Protocol (HTTP 402 pay-per-tool-call), and the OpenAI-co-built Agentic Commerce Protocol for deterministic agent checkout. Stripe Projects wraps all three.
Supabase Raises $500M at $10B Valuation
Supabase raises $500M at a $10B valuation, with an employee-friendly equity structure: 25% cashout option on vested shares and a 10-year exercise window compared to the standard 3 months.
Google Releases Gemma 4 12B: Encoder-Free Multimodal
Google releases Gemma 4 12B under Apache 2.0: unified encoder-free multimodal model with 256K context, native audio/image/video support, agentic reasoning — runs in 16GB VRAM with Day 0 Transformers, llama.cpp, and MLX support.
ChatGPT Launches Automatic Context-Aware Memory System
OpenAI launches a new automatic memory system for ChatGPT with 2x more capacity, rolling out to US Plus and Pro users. The system tracks context dynamically across conversations and adapts over time to temporal changes.
Claude Mythos Hits 3-Hour Autonomous Task Horizon
Claude Mythos achieved a 3-hour 6-minute autonomous task horizon on METR's benchmark in late May — hitting the median superforecaster prediction for end of 2026, months ahead of schedule.
Anthropic Files for IPO Confidentially at ~$965B Valuation
Anthropic has filed for an IPO confidentially at approximately $965B valuation. President Daniela Amodei cites the high cost of developing AI models as the primary driver for going to public markets.
Anthropic: 80% of Merged Code Now Claude-Authored
Anthropic publishes internal data: >80% of merged code is Claude-authored, engineers ship 8x more code, open-ended task success jumped from 26% to 76% in 6 months, and a training optimization reached 52x speedup.
Microsoft Launches MAI: Seven In-House AI Models
Microsoft ships seven MAI models trained from scratch on commercially licensed data, headlined by MAI-Thinking-1 (35B, 97% AIME, 53% SWE-Bench Pro) — a strategic bid for frontier model independence from OpenAI.
NVIDIA Releases Nemotron 3 Ultra: 550B Open Model
NVIDIA releases Nemotron 3 Ultra — a 550B-parameter MoE model with 1M token context, 5x faster inference, and full open weights under OpenMDW-1.1 on Hugging Face.