News
AI, at newsroom pace.
RESEARCH
One Layer Matches Full RL Post-Training on Qwen Models
RESEARCH
ChatGPT crosses 1 billion monthly active users, fastest consumer app milestone in history
RESEARCH
TRIAGE Cuts Agent Actions 14.8% While Raising Success Rates
RESEARCH
Researchers Close Gap Between AI Agents and Hand-Curated Skills
RESEARCH
BrowserBC Lifts Browser Agent Success to 81% Using Human Traces
RESEARCH
Language Model Explanations Track Behavior Shifts Automatically
RESEARCH
Simple Prompting Baselines Outperform Complex Supervision Methods
RESEARCH
OpenAI releases GeneBench-Pro; tests AI judgment on 129 multi-stage genomics problems; GPT-5.6 Sol reaches 31.5%