Three threads run through today’s news: frontier inference is accelerating while getting cheaper to run, compute access is being packaged as a multi-year commercial product, and the infrastructure beneath agent workflows is consolidating into fewer, more structured layers.

Speed and Specialization: Frontier Models Get Faster and More Focused

Three model releases today share a common logic: performance at a lower cost per token, targeted at specific workloads rather than general leaderboard climbing.

Compute Locks: Capacity Becomes a Structured Commercial Product

OpenAI is borrowing directly from cloud-provider playbooks to convert unpredictable compute access into a contractual guarantee, and a new generation of retrieval infrastructure is compressing the cost of the search layer that sits beneath it.

Agent Plumbing: Context Formats and Cross-Harness Orchestration

Two agent infrastructure stories today point in the same direction: the tooling layer beneath autonomous coding workflows is consolidating around richer context formats and unified control planes.

Provenance, Capital, and the Long Record: What Persists After the Hype

Three pieces today examine durability: whether cryptographic content credentials can actually move from policy to infrastructure, whether $370 billion in AI philanthropy will flow or stay pledged, and whether the model half-life narrative holds up when measured against actual release data.

Today’s Quick Hits