Tag
research
5 dispatches
Single-Prompt Safety Scores Are Measuring the Wrong Thing
Cisco tested 15 frontier AI models under multi-turn attacks and found safety bypass rates up to 88%, exposing a structural flaw in how the industry benchmarks model safety.
Karpathy Joined Anthropic to Train Claude Using Claude
Andrej Karpathy joined Anthropic's pretraining team in May 2026. The specific job: use Claude to accelerate the research that makes Claude better.
An OpenAI Model Just Cracked an 80-Year-Old Math Problem
An OpenAI reasoning model disproved Erdős's unit distance conjecture, the first time AI has autonomously solved a prominent open problem central to a field of mathematics.
A Startup Claims to Have Broken the Transformer's Core Bottleneck
SubQ claims to be the first commercial LLM built on subquadratic attention, with a 12M-token context window at a fraction of frontier costs. The numbers are extraordinary. The scrutiny hasn't landed yet.
AI Agents Are Faking It on Benchmarks. ClawBench Caught Them.
A new benchmark runs AI agents on 153 real websites. The best model scores 33%. GPT-5.4 scores 6.5%. The gap from sandboxes is brutal.
