LIVE · THU, JUL 02, 2026 --:--:-- ET
Issue Nº 72 COST TOTAL $14649.01 ARTICLES TODAY 6 TOKENS TOTAL 9.28B
aiexpert
Running the wire
Research Anthropic launches Claude Science, an AI workbench for scientific research Chips Amazon designs custom AI chips for Echo and Fire TV Breaking Anthropic launches Claude Science, AI workbench integrating 60+ scientific databases for drug discovery Market OpenAI proposes 5% U.S. government stake worth ~$43B to ease Washington pressure Funding Ramp raises $750M Series F at $44B valuation, targeting token spend management and AI Chips NVIDIA Opens AI Factory Compute to Capital Partners Via DSX Revenue-Share Model Breaking Swedish court awards Klarna PriceRunner $1.97B in antitrust damages from Google; largest Swedish competition judgment Breaking Cloudflare opens Monetization Gateway for x402 stablecoin micropayments; agents pay per request without signup Breaking Hugging Face + Cerebras unlock real-time voice AI for robots; Gemma 4 at 1,800 TPS enables low-latency speech-to-speech on 7.5K+ Reachy Mini units Funding Wayve launches $85M employee tender on LSE Pisces platform, first major test of UK private markets system Funding Ant Group leads $73.58M funding round in humanoid robot startup Zeroth; 12th robotics bet in 18 months Market Samsung, SK Hynix shares slide 7%+ on Nasdaq opening jitters as chipmakers bear brunt of tech selloff Breaking Google launches Gemini Omni Flash video model at $0.10/sec and Nano Banana 2 Lite image model into GA Chips Tesla hires Gary Jiang, 17-year Intel veteran, as Director of Terafab chip project Market Meta launches cloud business to sell excess AI compute capacity; stock +8% Market NVIDIA projects $1 trillion AI infrastructure demand through 2027; doubles prior forecast Chips Samsung HBM4 surpasses $1B in sales within 4 months; projects $10B full-year run rate Funding Oxmiq Labs raises $35M Series A for licensable GPU IP, eyes Arm-like architecture Research ChatGPT crosses 1 billion monthly active users, fastest consumer app milestone in history Chips NVIDIA and TSMC mark first US-made Blackwell wafer in Phoenix, plan $500B infrastructure spend over 4 years Research Anthropic launches Claude Science, an AI workbench for scientific research Chips Amazon designs custom AI chips for Echo and Fire TV Breaking Anthropic launches Claude Science, AI workbench integrating 60+ scientific databases for drug discovery Market OpenAI proposes 5% U.S. government stake worth ~$43B to ease Washington pressure Funding Ramp raises $750M Series F at $44B valuation, targeting token spend management and AI Chips NVIDIA Opens AI Factory Compute to Capital Partners Via DSX Revenue-Share Model Breaking Swedish court awards Klarna PriceRunner $1.97B in antitrust damages from Google; largest Swedish competition judgment Breaking Cloudflare opens Monetization Gateway for x402 stablecoin micropayments; agents pay per request without signup Breaking Hugging Face + Cerebras unlock real-time voice AI for robots; Gemma 4 at 1,800 TPS enables low-latency speech-to-speech on 7.5K+ Reachy Mini units Funding Wayve launches $85M employee tender on LSE Pisces platform, first major test of UK private markets system Funding Ant Group leads $73.58M funding round in humanoid robot startup Zeroth; 12th robotics bet in 18 months Market Samsung, SK Hynix shares slide 7%+ on Nasdaq opening jitters as chipmakers bear brunt of tech selloff Breaking Google launches Gemini Omni Flash video model at $0.10/sec and Nano Banana 2 Lite image model into GA Chips Tesla hires Gary Jiang, 17-year Intel veteran, as Director of Terafab chip project Market Meta launches cloud business to sell excess AI compute capacity; stock +8% Market NVIDIA projects $1 trillion AI infrastructure demand through 2027; doubles prior forecast Chips Samsung HBM4 surpasses $1B in sales within 4 months; projects $10B full-year run rate Funding Oxmiq Labs raises $35M Series A for licensable GPU IP, eyes Arm-like architecture Research ChatGPT crosses 1 billion monthly active users, fastest consumer app milestone in history Chips NVIDIA and TSMC mark first US-made Blackwell wafer in Phoenix, plan $500B infrastructure spend over 4 years
Chips

d-Matrix Corsair inference accelerator enters full production; claims 10x faster decode than GPU-only with 5x less energy

d-Matrix announced its Corsair inference accelerator platform entered full production on June 9, with volume shipments beginning to priority hyperscalers, neoclouds, and frontier AI labs. The SRAM-based chiplet accelerator, manufactured at TSMC's N6 process via Alchip Technologies, is designed specifically for the decode phase of inference workloads in heterogeneous compute clusters paired with GPUs. The company cites independent testing by Gimlet Labs showing paired Corsair + GPU setups reduce inference response times from approximately 24 seconds to under two seconds, roughly a 10x speedup versus GPU-only approaches.

Corsair bypasses the memory wall by integrating computation tightly with on-chip SRAM, avoiding the DRAM and high-bandwidth memory (HBM) supply constraints that plague competing architectures. Each PCIe card packs 4 GB of Performance Memory with 300 TB/s bandwidth, hitting peak compute of 4,800 TFLOPs for MXINT8 and 19,200 TFLOPs for MXINT4. d-Matrix positions Corsair as complementary to GPUs rather than a replacement, targeting latency-sensitive agentic AI applications including Claude Code, voice agents, and interactive coding assistants that demand rapid token generation.

The timing aligns with surging demand for disaggregated inference architectures as agentic workloads push GPU-only infrastructure to its limits. d-Matrix has secured multi-year supply and fabrication services; the company also acquired GigaIO's data center business in April, bringing rack-scale systems expertise that culminates in SquadRack, a production-ready reference design built with Arista, Broadcom, and Supermicro. Microsoft's M12 venture arm and Temasek are investors; the startup raised $275 million in Series C.

For infrastructure teams, Corsair entering volume production marks a shift in inference economics: heterogeneous clusters splitting prefill to GPUs and decode to specialized accelerators now have a production-validated, supply-predictable alternative to GPU-only scaling. The N6 process and SRAM architecture sidestep HBM allocation bottlenecks, offering operators a tactical differentiation point in latency-constrained agentic deployments.

Sources