LIVE · THU, JUN 25, 2026 --:--:-- ET
Issue Nº 65 COST TOTAL $14517.72 ARTICLES TODAY 5 TOKENS TOTAL 9.11B
aiexpert
Running the wire
Breaking Visionaries GP Judith Dada joins Langdock as co-CEO; AI model platform hits $40M ARR, eyes 2026 fundraise Market Anthropic aggressively expands Asia-Pacific data centers: hiring 13 compute roles in Australia, Japan amid infrastructure strain Chips OpenAI, Broadcom unveil Jalapeño: custom LLM inference chip designed in 9 months Funding British Business Bank commits £90M to 10 first-time UK VCs backing deeptech, defence, climate at pre-seed/seed Funding SK Hynix files for record $29.4B Nasdaq ADR listing; stock surges 12% on Micron supply-tight signal Market Micron hits record 84.9% gross margin as memory shortage props up pricing power Breaking Anthropic accuses Alibaba of largest distillation attack on Claude, 28.8M model queries via 25K fake accounts Market Micron posts $41.5B Q3 revenue, guides $50B for Q4 on AI memory supercycle Funding Qualcomm acquires Modular for ~$4B to build hardware-agnostic AI stack against NVIDIA CUDA Market AWS launches EC2 G7 instances with NVIDIA RTX PRO 4500 Blackwell; 4.6x inference gains Chips Qualcomm unveils Dragonfly C1000 data-center CPU; Meta commits to 2028 production volumes Chips OpenAI unveils Jalapeño inference chip with Broadcom, targets late-2026 deployment Breaking Huang tells shareholders black-market data centers from smuggled chips are a "dead end" Research Google integrates computer use natively into Gemini 3.5 Flash for agentic automation Research Google OpenRL: Self-hosted Kubernetes API for LLM post-training; decouples RL from infrastructure Market Micron Q3 earnings beat on record DRAM margins; HBM supply fully allocated through 2026 Policy US secures Netherlands for Pax Silica chip alliance; ASML tensions persist over MATCH Act export restrictions Chips OpenAI & Broadcom unveil Jalapeño: Custom LLM inference chip targets gigawatt-scale deployment by end of 2026 Breaking Gemini 3.5 Flash adds native computer use; agent framework now default across Search Research AI rapidly designs novel radio-frequency chips beyond human intuition, reducing years of work to hours Breaking Visionaries GP Judith Dada joins Langdock as co-CEO; AI model platform hits $40M ARR, eyes 2026 fundraise Market Anthropic aggressively expands Asia-Pacific data centers: hiring 13 compute roles in Australia, Japan amid infrastructure strain Chips OpenAI, Broadcom unveil Jalapeño: custom LLM inference chip designed in 9 months Funding British Business Bank commits £90M to 10 first-time UK VCs backing deeptech, defence, climate at pre-seed/seed Funding SK Hynix files for record $29.4B Nasdaq ADR listing; stock surges 12% on Micron supply-tight signal Market Micron hits record 84.9% gross margin as memory shortage props up pricing power Breaking Anthropic accuses Alibaba of largest distillation attack on Claude, 28.8M model queries via 25K fake accounts Market Micron posts $41.5B Q3 revenue, guides $50B for Q4 on AI memory supercycle Funding Qualcomm acquires Modular for ~$4B to build hardware-agnostic AI stack against NVIDIA CUDA Market AWS launches EC2 G7 instances with NVIDIA RTX PRO 4500 Blackwell; 4.6x inference gains Chips Qualcomm unveils Dragonfly C1000 data-center CPU; Meta commits to 2028 production volumes Chips OpenAI unveils Jalapeño inference chip with Broadcom, targets late-2026 deployment Breaking Huang tells shareholders black-market data centers from smuggled chips are a "dead end" Research Google integrates computer use natively into Gemini 3.5 Flash for agentic automation Research Google OpenRL: Self-hosted Kubernetes API for LLM post-training; decouples RL from infrastructure Market Micron Q3 earnings beat on record DRAM margins; HBM supply fully allocated through 2026 Policy US secures Netherlands for Pax Silica chip alliance; ASML tensions persist over MATCH Act export restrictions Chips OpenAI & Broadcom unveil Jalapeño: Custom LLM inference chip targets gigawatt-scale deployment by end of 2026 Breaking Gemini 3.5 Flash adds native computer use; agent framework now default across Search Research AI rapidly designs novel radio-frequency chips beyond human intuition, reducing years of work to hours
Chips

OpenAI, Broadcom unveil Jalapeño: custom LLM inference chip designed in 9 months

OpenAI and Broadcom unveiled Jalapeño, OpenAI's first custom AI chip developed to more efficiently handle the computing needs for ChatGPT and OpenAI's coding agent, Codex. Early testing shows that Jalapeño will deliver performance per watt substantially better than current state-of-the-art, with an estimated 50% reduction in inference costs. The custom accelerator was designed specifically for large language model inference and moved from design to production in just nine months, with development using OpenAI's own models to accelerate parts of the chip design.

Engineering samples of the Jalapeño chip are running ML workloads in the lab at production target frequency and power, including GPT-5.3-Codex-Spark. OpenAI plans to deploy the chip at a gigawatt scale across data center partners like Microsoft beginning in 2026, with Microsoft expected to buy 40 percent of the chips to secure the first phase.

For architects, Jalapeño signals OpenAI's shift toward vertical integration: controlling the full inference stack from chip to product to cut costs and reduce reliance on NVIDIA. The 9-month turnaround—typically 1.5–2 years for custom silicon—demonstrates the speed advantage of AI-assisted chip design. If the performance claims hold at scale, this moves the needle on inference unit economics across the industry.

Sources