Today’s news converges on a single uncomfortable fact: the parts of AI that get the most attention (model alignment, frontier benchmarks, headline pricing) are not always the parts that matter operationally. Anthropic shifts its safety frame from the model to the environment. Harvey shows legal AI is far from saturated. OpenRouter doubles to $1.3 billion betting that the routing layer matters more than any single model. And the M&A landscape gets messier in two countries on the same day.

Safety Lives at the Environment Layer

Two pieces about agentic safety pull in the same direction: stop trying to train the agent into perfect behavior and start engineering the environment to limit what a failed agent can do.

The Benchmarks Keep Slipping Away

Three new evaluation stories show the same pattern from different angles: existing benchmarks have saturated, vertical-specific benchmarks reveal frontier models are not as close to general competence as the leaderboards suggest.

The Money Stays in Motion

Three financial stories: an unusual late warning during an in-progress acquisition, a doubling valuation for the routing-not-model thesis, and SpaceX’s S-1 telling two compute stories at once.

Talent, Toolchains, and Tomorrow’s Software

China extends travel restrictions previously reserved for state employees to private AI talent; NVIDIA squeezes more from already-optimized kernels; and the Claude Cowork lead engineer demonstrates what self-built personal software now looks like.

Today’s Quick Hits