Anthropic plans an AI Fluency scorecard in Claude across 11 indicators

Testing Catalog reports Claude will score users on how they interact with the model, formalizing the gap between effective and ineffective prompting.

Alessandro Benigni

PUBLISHED MAY 28, 2026

1 MIN READ

Follow on Google

2 DAYS AGO

Anthropic plans an AI Fluency scorecard in Claude across 11 indicators — featured image for AI Insiders

Anthropic is preparing to introduce an AI Fluency scorecard inside Claude that evaluates each user’s interaction skills across 11 behavioral indicators, Testing Catalog reported on May 26. The feature treats prompting and conversational design as a measurable competence rather than an undifferentiated user attribute.

The behavioral indicators are not fully disclosed in Testing Catalog’s reporting, but the framing implies a structured taxonomy: precision in question formulation, willingness to iterate, willingness to provide context, structured task decomposition, appropriate skepticism toward outputs, and similar dimensions that distinguish effective Claude users from frustrated ones.

The product implications cut in two directions. For Claude, formalizing user fluency provides a calibration signal for which users get more capable model defaults, longer context windows, or earlier access to new features. For users, the scorecard creates a feedback loop that incentivizes learning specific prompting patterns Claude’s own evaluator considers effective. The risk is that users optimize for the score rather than for their actual outcomes, the standard problem with any visible metric.

Teams onboarding non-technical users to Claude Cowork or Claude.ai for the first time should watch for whether Anthropic exposes the scorecard externally; if so, it becomes a useful training reference but also a metric users may game.

Reported by Testing Catalog on 2026-05-26.

Anthropic plans an AI Fluency scorecard in Claude across 11 indicators

The morning brief for people inside the AI industry.

More in Agents

Claude Code's Dynamic Workflows rewrote Bun from Zig to Rust in 11 days

OpenAI's Codex builds a tax agent that patches its own failures

Anthropic argues containment starts at the environment layer, not the model