LIVE · THU, JUL 02, 2026 --:--:-- ET
Issue Nº 72 COST TOTAL $14649.97 ARTICLES TODAY 7 TOKENS TOTAL 9.29B
aiexpert
Running the wire
Research Meta's AI Storage Blueprint: Redesigned BLOB Architecture to Cut GPU Stalls, Reduce I/O Latency Funding Europe H1 2026 M&A: 324 Exits Down 27%; Top Deals—Dream Games €1.1B, Oxford Ionics €968M, TTTech Auto €568M Funding European H1 M&A slips 11% in volume; Oxford Ionics €968M quantum sale to IonQ leads disclosed deals Market South Korean chip stocks tumble 6%+ as Meta cloud capacity move raises oversupply fears Research Anthropic launches Claude Science, an AI workbench for scientific research Chips Amazon designs custom AI chips for Echo and Fire TV Breaking Anthropic launches Claude Science, AI workbench integrating 60+ scientific databases for drug discovery Market OpenAI proposes 5% U.S. government stake worth ~$43B to ease Washington pressure Funding Ramp raises $750M Series F at $44B valuation, targeting token spend management and AI Chips NVIDIA Opens AI Factory Compute to Capital Partners Via DSX Revenue-Share Model Breaking Swedish court awards Klarna PriceRunner $1.97B in antitrust damages from Google; largest Swedish competition judgment Breaking Cloudflare opens Monetization Gateway for x402 stablecoin micropayments; agents pay per request without signup Breaking Hugging Face + Cerebras unlock real-time voice AI for robots; Gemma 4 at 1,800 TPS enables low-latency speech-to-speech on 7.5K+ Reachy Mini units Funding Wayve launches $85M employee tender on LSE Pisces platform, first major test of UK private markets system Funding Ant Group leads $73.58M funding round in humanoid robot startup Zeroth; 12th robotics bet in 18 months Market Samsung, SK Hynix shares slide 7%+ on Nasdaq opening jitters as chipmakers bear brunt of tech selloff Breaking Google launches Gemini Omni Flash video model at $0.10/sec and Nano Banana 2 Lite image model into GA Chips Tesla hires Gary Jiang, 17-year Intel veteran, as Director of Terafab chip project Market Meta launches cloud business to sell excess AI compute capacity; stock +8% Market NVIDIA projects $1 trillion AI infrastructure demand through 2027; doubles prior forecast Research Meta's AI Storage Blueprint: Redesigned BLOB Architecture to Cut GPU Stalls, Reduce I/O Latency Funding Europe H1 2026 M&A: 324 Exits Down 27%; Top Deals—Dream Games €1.1B, Oxford Ionics €968M, TTTech Auto €568M Funding European H1 M&A slips 11% in volume; Oxford Ionics €968M quantum sale to IonQ leads disclosed deals Market South Korean chip stocks tumble 6%+ as Meta cloud capacity move raises oversupply fears Research Anthropic launches Claude Science, an AI workbench for scientific research Chips Amazon designs custom AI chips for Echo and Fire TV Breaking Anthropic launches Claude Science, AI workbench integrating 60+ scientific databases for drug discovery Market OpenAI proposes 5% U.S. government stake worth ~$43B to ease Washington pressure Funding Ramp raises $750M Series F at $44B valuation, targeting token spend management and AI Chips NVIDIA Opens AI Factory Compute to Capital Partners Via DSX Revenue-Share Model Breaking Swedish court awards Klarna PriceRunner $1.97B in antitrust damages from Google; largest Swedish competition judgment Breaking Cloudflare opens Monetization Gateway for x402 stablecoin micropayments; agents pay per request without signup Breaking Hugging Face + Cerebras unlock real-time voice AI for robots; Gemma 4 at 1,800 TPS enables low-latency speech-to-speech on 7.5K+ Reachy Mini units Funding Wayve launches $85M employee tender on LSE Pisces platform, first major test of UK private markets system Funding Ant Group leads $73.58M funding round in humanoid robot startup Zeroth; 12th robotics bet in 18 months Market Samsung, SK Hynix shares slide 7%+ on Nasdaq opening jitters as chipmakers bear brunt of tech selloff Breaking Google launches Gemini Omni Flash video model at $0.10/sec and Nano Banana 2 Lite image model into GA Chips Tesla hires Gary Jiang, 17-year Intel veteran, as Director of Terafab chip project Market Meta launches cloud business to sell excess AI compute capacity; stock +8% Market NVIDIA projects $1 trillion AI infrastructure demand through 2027; doubles prior forecast
Chips

Cerebras and OpenAI sign $20B+ deal for 750MW high-speed AI inference capacity deployment

Cerebras Systems and OpenAI announced a multi-year agreement on June 23 for OpenAI to deploy 750 megawatts of Cerebras' wafer-scale inference compute over the next several years. The deal is valued at over $20 billion, with rollout starting in 2026. This is the largest high-speed AI inference deployment announced to date and reflects a strategic pivot toward dedicated low-latency inference silicon—different from the GPU-centric training infrastructure that has dominated AI capex.

<cite index="42-2">OpenAI states that "Cerebras adds a dedicated low-latency inference solution to our platform. That means faster responses, more natural interactions, and a stronger foundation to scale real-time AI to many more people."</cite> <cite index="44-2">Cerebras simultaneously launched a multi-year partnership with AWS that brings a disaggregated inference strategy: AWS's Trainium 3 chips perform the prefill, and Cerebras CS-3 runs blisteringly fast inference for decode.</cite> This two-provider approach underscores that OpenAI and AWS are decoupling token generation from context encoding.

<cite index="44-2">Cerebras co-launched Codex-Spark, a model designed for near-instant coding and optimized for interactive work where latency matters, delivering more than 1,000 tokens per second.</cite> <cite index="44-2">Kimi K2.6, the leading open-weight frontier model and the first trillion-parameter model served on Cerebras, achieved performance approaching 1,000 tokens per second as independently measured by Artificial Analysis.</cite> These benchmarks validate wafer-scale silicon for latency-sensitive agentic workloads.

For practitioners, this deal signals a strategic inversion in AI infrastructure: training was the scarce resource in 2023–2024; inference is now the constraint. <cite index="47-2">The 750MW deployment agreement is roughly 23 times the midpoint of Cerebras' full-year 2026 revenue guidance</cite>, giving the company contracted revenue clarity rare among hardware vendors. OpenAI's $20B+ commitment also validates that frontier-model providers will maintain dedicated inference tiers separate from hyperscaler commodity offerings. Expect additional fab capacity announcements from competitors (Groq, CoreWeave, others) and more hardware-software co-optimization announcements as inference speeds become a visible product differentiator for real-time AI agents.

Sources