TL;DR

Anthropic published lessons from using hundreds of Claude Code Skills across its engineering organization, according to a July 1 analysis by Thorsten Meyer AI. The key finding: Skills are better understood as reusable folders containing instructions, scripts, references and checks, not as saved prompts.

Anthropic has published an engineering account of how its Claude Code team uses Skills, after running hundreds across its engineering organization, and the central point is that a Skill is a reusable folder of instructions, scripts and references rather than a saved prompt. The finding matters for teams adopting coding agents because it reframes repeated AI instructions as shared operating assets that can be versioned, tested and reused.

The first confirmed point is definitional. Anthropic’s Claude Code Skills documentation says Skills extend Claude by adding a SKILL.md file with instructions, plus optional supporting files. The docs describe Skills as discoverable by Claude when relevant or invocable directly with a /skill-name command, with a description field used to help the model decide when to load them.

According to the Thorsten Meyer AI analysis, Anthropic’s internal Skills clustered into nine categories: library and API reference, product verification, data fetching and analysis, business-process automation, code scaffolding and templates, code quality and review, CI/CD and deployment, runbooks, and infrastructure operations. The same analysis says Anthropic’s own measurement found verification Skills, which check work after generation, had the largest effect on output quality.

The report distinguishes between what is confirmed and what is claimed. It is confirmed that Claude Code supports Skill folders, supporting files, scripts, frontmatter, project and personal locations, and related controls in current public docs. Claims about the scale of Anthropic’s internal library and the relative performance gains from specific Skill categories come from Anthropic’s engineering account as summarized by Thorsten Meyer AI; the material is vendor engineering reporting, not peer-reviewed research.

At a glance
reportWhen: Anthropic’s source post was dated June…
The developmentAnthropic’s Claude Code team has published lessons on running hundreds of reusable Skills across its engineering organization, shifting the discussion from prompting to shared agent workflows.
AI Dispatch · Insights · 1 July 2026

A Skill is a folder, not a prompt

Anthropic published what it learned running hundreds of Skills across its own engineering org. Read as a business memo, the point is bigger than a coding trick: this is how ad-hoc prompting becomes durable institutional capability — the SOPs your agents actually follow, versioned and shared.

✕ The misconception

“A Skill is just a clever markdown prompt you save in a file.”

✓ What it actually is

A folder the agent can discover, read & run — instructions, scripts, references, templates, config & on-demand hooks.

Anatomy of a Skill — the file system is context engineering
my-skill/the unit you share & version
├─ SKILL.mdroot instructions + a description written for the model (its trigger)
├─ references/deep detail pulled in only when needed — progressive disclosure
├─ scripts/real code, so the agent composes instead of rebuilding boilerplate
├─ assets/templates & files to copy into the output
├─ config.jsonsetup the agent asks for if it’s missing (e.g. which Slack channel)
└─ hooks + memoryon-demand guardrails + an append-only log so it remembers
Why it matters: the folder itself is the knowledge base. The agent reads the root, then reaches deeper only when the task demands it — the same way you’d hand a new hire a one-pager that points to the detailed docs.
The nine types — a gap-analysis map for your own library
1Library / API reference
2Product verification ★ top impact
3Data fetching & analysis
4Business-process automation
5Code scaffolding & templates
6Code quality & review
7CI/CD & deployment
8Runbooks
9Infrastructure operations
By Anthropic’s own measurement, verification Skills — the ones that check the work — moved output quality the most. If you build one category well, build that one.
The craft — what separates a good Skill from a useless one
Gotchas = highest-signal section Describe for the model, not humans (it’s the trigger) Don’t state the obvious Ship scripts, not just prose On-demand guardrail hooks (/careful, /freeze) Let it remember (log / SQLite) Don’t railroad — leave room to adapt
The take

The knowledge of how your organization actually operates can be captured, versioned, shared & executed — and the thing capturing it is a humble folder with a script and a gotchas list inside. For the builder, that’s context engineering with real tools attached. For whoever owns the budget, it’s the difference between AI that starts from zero every morning and an asset that compounds. Caveats: best practices are still evolving, checked-in Skills cost context, and curation beats accumulation. Start with one Skill, one gotcha, and the category that catches your mistakes.

Source: “Lessons from building Claude Code: How we use skills,” Thariq Shihipar (Anthropic), Claude blog, 3 June 2026. Categories, examples & measured claims are Anthropic’s; framing is the author’s. Docs: code.claude.com/docs/en/skills.
thorstenmeyerai.com

Skills Become Operating Assets

For engineering leaders, the shift is about repeatability. If a team keeps pasting the same checklist, deployment rule, testing recipe or review policy into an agent, a Skill turns that recurring guidance into a shared workflow. That can reduce variation between users and help new staff or new agents follow the same process as experienced engineers.

For developers, the practical change is that a Skill can include runnable scripts, templates and references instead of prose alone. That matters because a coding agent can compose around existing tools rather than recreating boilerplate each time. The reported strength of verification Skills also points to a near-term lesson for AI coding adoption: teams may get more value from agent checks and run recipes than from longer instruction files.

Claude AI for Beginners Bible: [5 in 1] The Ultimate Guide to Automate Your Work, Save Hours Every Week, and Use AI for Real-World Results

Claude AI for Beginners Bible: [5 in 1] The Ultimate Guide to Automate Your Work, Save Hours Every Week, and Use AI for Real-World Results

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

From Prompts To Folders

Anthropic’s public docs say Skills may live at enterprise, personal, project or plugin scope, and project Skills can be committed in a repository. The docs also say Claude Code can discover nested Skills in subdirectories, allowing monorepos to attach different instructions to different packages.

The Skill authoring guidance stresses concise instructions, clear descriptions and progressive disclosure, where SKILL.md serves as an overview and points Claude to detailed files only when needed. Current Claude Code docs also list bundled Skills such as /code-review, /run and /verify, with run and verify support tied to Claude Code version requirements in the documentation.

“Create a SKILL.md file with instructions, and Claude adds it to its toolkit.”

— Claude Code documentation

The AI Retainer Playbook: How to Build Predictable, Recurring Income Selling AI Workflows to Local Businesses | No Coding Required | Includes Outreach Scripts, Pricing Calculator & 90-Day Launch Plan

The AI Retainer Playbook: How to Build Predictable, Recurring Income Selling AI Workflows to Local Businesses | No Coding Required | Includes Outreach Scripts, Pricing Calculator & 90-Day Launch Plan

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Adoption Metrics Still Missing

The public material does not disclose enough detail to independently evaluate the internal results. It is not yet clear what sample size, baseline, grading method or task mix Anthropic used when measuring the reported quality gains from verification Skills.

It is also unclear how well the pattern transfers outside Claude Code, Anthropic’s engineering culture or teams with mature internal tooling. The analysis flags several limits: best practices are still changing, checked-in Skills can add context cost after loading, and a large Skill library may require active curation rather than simple accumulation.

Visual Studio Code USER GUIDE For Beginners 2026: Learn Project Setup, Code Editing, Testing, Version Control, Productivity Tools, and Modern Development Workflows

Visual Studio Code USER GUIDE For Beginners 2026: Learn Project Setup, Code Editing, Testing, Version Control, Productivity Tools, and Modern Development Workflows

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Teams Test Verification Skills

The next practical step for companies is likely small-scale testing. The Thorsten Meyer AI analysis recommends starting with one Skill, one known caveat and the category that catches mistakes, especially verification workflows that confirm whether an agent’s output works.

The next milestone to watch is whether teams publish more measured comparisons of agent performance with and without Skills. Wider adoption will also depend on governance around ownership, security review, script permissions and how shared Skills are updated when underlying systems change.

The Human-Agent Orchestrator: Leading and Scaling AI-Driven Organizations

The Human-Agent Orchestrator: Leading and Scaling AI-Driven Organizations

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

What did Anthropic report about Claude Code Skills?

Anthropic reported lessons from using hundreds of Skills across its engineering organization, according to the July 1 Thorsten Meyer AI analysis. The main takeaway is that Skills are reusable folders that can include instructions, references, scripts and templates.

Is a Skill just a saved prompt?

No. A saved prompt is mainly text, while a Skill folder can contain SKILL.md, reference files, scripts, templates and configuration. Claude Code’s public docs confirm that supporting files can make Skills more powerful than a single instruction file.

Which Skill category had the biggest reported effect?

According to the source analysis, Anthropic’s own measurement found verification Skills had the largest impact on output quality. Those Skills focus on checking whether generated work actually passes project-specific tests, launch steps or review rules.

Are Anthropic’s findings peer-reviewed?

No. The material is company engineering reporting and product documentation, not peer-reviewed research. Readers should treat the internal measurements as Anthropic’s reported results unless more detailed methodology is published.

What should teams do next if they use AI coding agents?

Teams should identify one repeated agent task, especially a verification or review step, and turn it into a small Skill with clear instructions and any needed script. The near-term test is whether that Skill improves consistency without adding too much maintenance burden.

Source: Thorsten Meyer AI

You May Also Like

Discovery of Cold War-era rare Eastern Bloc computers in a German hangar

A large collection of Soviet and Eastern European computers from the Cold War era has been found in a German warehouse, revealing significant historical artifacts.

Telegram ban verdict LIVE | Delhi High court declines relief; upholds govt.’s decision for temporary ban

Delhi High Court declines relief for Telegram, endorsing government’s temporary ban amid NEET re-exam fraud fears. Court affirms due process was followed.

Show HN: Bash4LLM+ – A lightweight, dependency-free Bash wrapper for LLM APIs

Bash4LLM+ introduces a dependency-free, secure Bash script for interacting with OpenAI-compatible LLM APIs, supporting multiple providers and environments.

BambuStudio has been violating PrusaSlicer AGPL license since their fork

BambuStudio has been found to violate the AGPL license of PrusaSlicer by using a closed-source networking plugin, raising legal and ethical concerns.