LIVE · THU, JUL 02, 2026 --:--:-- ET
Issue Nº 72 COST TOTAL $14648.38 ARTICLES TODAY 6 TOKENS TOTAL 9.28B
aiexpert
Running the wire
Breaking Anthropic launches Claude Science, AI workbench integrating 60+ scientific databases for drug discovery Market OpenAI proposes 5% U.S. government stake worth ~$43B to ease Washington pressure Funding Ramp raises $750M Series F at $44B valuation, targeting token spend management and AI Chips NVIDIA Opens AI Factory Compute to Capital Partners Via DSX Revenue-Share Model Breaking Swedish court awards Klarna PriceRunner $1.97B in antitrust damages from Google; largest Swedish competition judgment Breaking Cloudflare opens Monetization Gateway for x402 stablecoin micropayments; agents pay per request without signup Breaking Hugging Face + Cerebras unlock real-time voice AI for robots; Gemma 4 at 1,800 TPS enables low-latency speech-to-speech on 7.5K+ Reachy Mini units Funding Wayve launches $85M employee tender on LSE Pisces platform, first major test of UK private markets system Funding Ant Group leads $73.58M funding round in humanoid robot startup Zeroth; 12th robotics bet in 18 months Market Samsung, SK Hynix shares slide 7%+ on Nasdaq opening jitters as chipmakers bear brunt of tech selloff Breaking Google launches Gemini Omni Flash video model at $0.10/sec and Nano Banana 2 Lite image model into GA Chips Tesla hires Gary Jiang, 17-year Intel veteran, as Director of Terafab chip project Market Meta launches cloud business to sell excess AI compute capacity; stock +8% Market NVIDIA projects $1 trillion AI infrastructure demand through 2027; doubles prior forecast Chips Samsung HBM4 surpasses $1B in sales within 4 months; projects $10B full-year run rate Funding Oxmiq Labs raises $35M Series A for licensable GPU IP, eyes Arm-like architecture Research ChatGPT crosses 1 billion monthly active users, fastest consumer app milestone in history Chips NVIDIA and TSMC mark first US-made Blackwell wafer in Phoenix, plan $500B infrastructure spend over 4 years Funding Oxmiq raises $35M Series A for RISC-V GPU IP, expands data center architecture focus Breaking Klarna's PriceRunner wins $1.97B antitrust verdict against Google in Swedish court Breaking Anthropic launches Claude Science, AI workbench integrating 60+ scientific databases for drug discovery Market OpenAI proposes 5% U.S. government stake worth ~$43B to ease Washington pressure Funding Ramp raises $750M Series F at $44B valuation, targeting token spend management and AI Chips NVIDIA Opens AI Factory Compute to Capital Partners Via DSX Revenue-Share Model Breaking Swedish court awards Klarna PriceRunner $1.97B in antitrust damages from Google; largest Swedish competition judgment Breaking Cloudflare opens Monetization Gateway for x402 stablecoin micropayments; agents pay per request without signup Breaking Hugging Face + Cerebras unlock real-time voice AI for robots; Gemma 4 at 1,800 TPS enables low-latency speech-to-speech on 7.5K+ Reachy Mini units Funding Wayve launches $85M employee tender on LSE Pisces platform, first major test of UK private markets system Funding Ant Group leads $73.58M funding round in humanoid robot startup Zeroth; 12th robotics bet in 18 months Market Samsung, SK Hynix shares slide 7%+ on Nasdaq opening jitters as chipmakers bear brunt of tech selloff Breaking Google launches Gemini Omni Flash video model at $0.10/sec and Nano Banana 2 Lite image model into GA Chips Tesla hires Gary Jiang, 17-year Intel veteran, as Director of Terafab chip project Market Meta launches cloud business to sell excess AI compute capacity; stock +8% Market NVIDIA projects $1 trillion AI infrastructure demand through 2027; doubles prior forecast Chips Samsung HBM4 surpasses $1B in sales within 4 months; projects $10B full-year run rate Funding Oxmiq Labs raises $35M Series A for licensable GPU IP, eyes Arm-like architecture Research ChatGPT crosses 1 billion monthly active users, fastest consumer app milestone in history Chips NVIDIA and TSMC mark first US-made Blackwell wafer in Phoenix, plan $500B infrastructure spend over 4 years Funding Oxmiq raises $35M Series A for RISC-V GPU IP, expands data center architecture focus Breaking Klarna's PriceRunner wins $1.97B antitrust verdict against Google in Swedish court
Chips

Claude in Microsoft Foundry now runs on NVIDIA GB300 Blackwell Ultra in Azure

Anthropic's Claude models in Microsoft Foundry, hosted on Azure and running on NVIDIA's GB300 Blackwell Ultra GPUs, are now generally available. Microsoft has deployed the world's first large-scale production cluster with over 4,600 Blackwell Ultra GPUs connected via NVIDIA Quantum-X800 InfiniBand, each rack integrating 72 Blackwell Ultra GPUs and 36 NVIDIA Grace CPUs into a cohesive unit optimized for reasoning models, agentic AI, and multimodal generative AI.

The cluster delivers exceptional memory bandwidth: 37 terabytes of unified fast memory per rack (20 TB HBM3E GPU + 17 TB LPDDR5X CPU), 130 TB/s NVLink bandwidth within each rack, and up to 1.44 exaflops of FP4 Tensor Core performance per VM. Cross-rack, 800 Gb/s of interconnect per GPU via Quantum-X800 InfiniBand enables non-blocking scale to tens of thousands of GPUs. Microsoft says this infrastructure reduces model training from months to weeks and supports training of models exceeding 100 trillion parameters.

In recent MLPerf Inference v5.1 benchmarks, the GB300 NVL72 delivered up to 5x higher throughput per GPU on DeepSeek-R1 (671B parameters) versus NVIDIA Hopper, with leadership performance on Llama 3.1 405B and other newer benchmarks. The architecture is purpose-built for test-time scaling and agentic reasoning, where longer thought chains and tool calls drive higher compute variance.

For architects deploying Anthropic models at scale, this marks a shift in the inference stack: Blackwell Ultra's redesigned memory and networking are optimized for reasoning workloads with high context and long-form outputs. Enterprises on Azure now get Claude backed by the densest NVIDIA fabric available, making it viable to run trillion-parameter reasoning models in production without relying on batching tricks. This is the infrastructure inflection for cost-per-token competitive reasoning.

Sources