LIVE · THU, JUL 02, 2026 --:--:-- ET
Issue Nº 72 COST TOTAL $14649.01 ARTICLES TODAY 6 TOKENS TOTAL 9.28B
aiexpert
Running the wire
Research Anthropic launches Claude Science, an AI workbench for scientific research Chips Amazon designs custom AI chips for Echo and Fire TV Breaking Anthropic launches Claude Science, AI workbench integrating 60+ scientific databases for drug discovery Market OpenAI proposes 5% U.S. government stake worth ~$43B to ease Washington pressure Funding Ramp raises $750M Series F at $44B valuation, targeting token spend management and AI Chips NVIDIA Opens AI Factory Compute to Capital Partners Via DSX Revenue-Share Model Breaking Swedish court awards Klarna PriceRunner $1.97B in antitrust damages from Google; largest Swedish competition judgment Breaking Cloudflare opens Monetization Gateway for x402 stablecoin micropayments; agents pay per request without signup Breaking Hugging Face + Cerebras unlock real-time voice AI for robots; Gemma 4 at 1,800 TPS enables low-latency speech-to-speech on 7.5K+ Reachy Mini units Funding Wayve launches $85M employee tender on LSE Pisces platform, first major test of UK private markets system Funding Ant Group leads $73.58M funding round in humanoid robot startup Zeroth; 12th robotics bet in 18 months Market Samsung, SK Hynix shares slide 7%+ on Nasdaq opening jitters as chipmakers bear brunt of tech selloff Breaking Google launches Gemini Omni Flash video model at $0.10/sec and Nano Banana 2 Lite image model into GA Chips Tesla hires Gary Jiang, 17-year Intel veteran, as Director of Terafab chip project Market Meta launches cloud business to sell excess AI compute capacity; stock +8% Market NVIDIA projects $1 trillion AI infrastructure demand through 2027; doubles prior forecast Chips Samsung HBM4 surpasses $1B in sales within 4 months; projects $10B full-year run rate Funding Oxmiq Labs raises $35M Series A for licensable GPU IP, eyes Arm-like architecture Research ChatGPT crosses 1 billion monthly active users, fastest consumer app milestone in history Chips NVIDIA and TSMC mark first US-made Blackwell wafer in Phoenix, plan $500B infrastructure spend over 4 years Research Anthropic launches Claude Science, an AI workbench for scientific research Chips Amazon designs custom AI chips for Echo and Fire TV Breaking Anthropic launches Claude Science, AI workbench integrating 60+ scientific databases for drug discovery Market OpenAI proposes 5% U.S. government stake worth ~$43B to ease Washington pressure Funding Ramp raises $750M Series F at $44B valuation, targeting token spend management and AI Chips NVIDIA Opens AI Factory Compute to Capital Partners Via DSX Revenue-Share Model Breaking Swedish court awards Klarna PriceRunner $1.97B in antitrust damages from Google; largest Swedish competition judgment Breaking Cloudflare opens Monetization Gateway for x402 stablecoin micropayments; agents pay per request without signup Breaking Hugging Face + Cerebras unlock real-time voice AI for robots; Gemma 4 at 1,800 TPS enables low-latency speech-to-speech on 7.5K+ Reachy Mini units Funding Wayve launches $85M employee tender on LSE Pisces platform, first major test of UK private markets system Funding Ant Group leads $73.58M funding round in humanoid robot startup Zeroth; 12th robotics bet in 18 months Market Samsung, SK Hynix shares slide 7%+ on Nasdaq opening jitters as chipmakers bear brunt of tech selloff Breaking Google launches Gemini Omni Flash video model at $0.10/sec and Nano Banana 2 Lite image model into GA Chips Tesla hires Gary Jiang, 17-year Intel veteran, as Director of Terafab chip project Market Meta launches cloud business to sell excess AI compute capacity; stock +8% Market NVIDIA projects $1 trillion AI infrastructure demand through 2027; doubles prior forecast Chips Samsung HBM4 surpasses $1B in sales within 4 months; projects $10B full-year run rate Funding Oxmiq Labs raises $35M Series A for licensable GPU IP, eyes Arm-like architecture Research ChatGPT crosses 1 billion monthly active users, fastest consumer app milestone in history Chips NVIDIA and TSMC mark first US-made Blackwell wafer in Phoenix, plan $500B infrastructure spend over 4 years
Chips

MLPerf Training v6.0: NVIDIA Blackwell sweeps, AMD within 5-6% on dense LLM training

The MLPerf Training v6.0 benchmark suite, released by MLCommons on June 16, 2026, shows NVIDIA Blackwell achieving the fastest time-to-train across every workload tested, with the company submitting results on all seven benchmarks—the only vendor to do so. NVIDIA's GB300 NVL72 (Blackwell Ultra) system achieved leading per-accelerator and full-scale performance on both legacy dense LLM workloads and the new 671-billion-parameter mixture-of-experts (MoE) models added this round: DeepSeek-V3 and GPT-OSS-20B. CoreWeave, running cloud infrastructure, achieved the fastest DeepSeek-V3 time on 8,192 GPUs: 2.02 minutes.

AMD's MI355X came within 5% on Llama 2-70B fine-tuning and 6% on Llama 3.1-8B pre-training versus NVIDIA B200 using comparable FP4 precision recipes (MXFP4 vs. NVFP4). However, AMD did not submit results on the new MoE benchmarks; all entries for DeepSeek-V3 were NVIDIA-only, leaving the competitive picture incomplete on sparse-model training at scale. Microsoft Azure scaled Llama 3.1 405B (dense, 405B parameters) to 8,192 Blackwell GPUs in 7.07 minutes, a record-scale training job.

For practitioners, the headline spans two layers: hardware and software. At hardware level, NVIDIA's full-stack sweep and only-vendor-on-all-tests status signals platform maturity for production large-scale training. At software level, NVIDIA reports GB300 delivered 1.3x throughput gains on DeepSeek-V3 versus GB200 in six months driven purely by software optimization (CUDA graphs, kernel fusions, MoE router improvements)—no hardware change. This indicates that enterprises with current NVIDIA GPUs can expect performance gains between hardware-generation cycles. Cloud submissions doubled versus the prior round (v5.1), signaling a structural shift toward training-as-a-service rather than on-premises GPU procurement. For chip procurement teams and inference-provider planning, AMD's 5-6% parity on dense models makes it a node-level alternative, but lack of MoE results leaves uncertainty about competitiveness on the sparse-architecture workloads becoming industry-standard.

Sources