Only 7 of 40 viral prompt codes reliably shift reasoning
The rest are structural tools marketed as reasoning tools. The ones that work all share one feature — rejection logic, not additive instructions.
What 40 tested prompt codes, 2,392 skill files, and 60 hours of Opus 4.7 vs 4.6 benchmarks reveal about building with Claude.
31-page PDF · No paywall · No email required · CC BY 4.0 license
The short version of every section. The PDF has the data and methodology behind each claim.
The rest are structural tools marketed as reasoning tools. The ones that work all share one feature — rejection logic, not additive instructions.
Caught wrong premises in 11 of 14 test cases (79%) vs 2 of 14 baseline (14%). A 5.5× improvement — the largest measured delta.
Multi-file code tasks produce working code 2× as often. Long-context holds 94% recall at 720K tokens (vs 54% at 162K on 4.6). Same price.
Of 845 catalogued skills, SAP is the largest category at 107 skills — 4× the next category. Claude Code's real user base is enterprise platform consultants, not the SaaS founders the discourse focuses on.
Skills + hooks + subagents + agent teams + MCP + Cowork — the integrated stack competitors don't match. Users who master the stack get 5-10× the value of users who only use the chat interface.
Version 2.0 targeted for July 2026 with expanded tests and 30-day follow-up data. One email when it's live — no spam, no daily newsletter pressure unless you want it.
All 40 deeply-tested codes with before/after transcripts, combo strategies, and failure modes. $15 until May 1.
10 classified codes with test results, live-updated. No paywall, no email gate.
Every skill referenced in Section 5, filterable by category. Free individual downloads.
If you find a claim that contradicts your own testing, email team@clskills.in. It'll be cited in v2.