approval-tests

📁 lexler/skill-factory 📅 14 days ago

总安装量

周安装量

#60184

全站排名

安装命令

npx skills add https://github.com/lexler/skill-factory --skill approval-tests

Agent 安装分布

opencode 3

antigravity 3

claude-code 3

junie 3

github-copilot 3

goose 3

Skill 文档

STARTER_CHARACTER = ð¸

Approval Tests

Philosophy

“A picture’s worth 1000 assertions.”

Approval tests verify complex output by comparing against a saved “golden master” file instead of writing individual assertions. You capture the output once, review it, approve it, and future runs compare against that approved snapshot.

You don’t need to know the expected output upfront. Run the code, see what it produces, decide if it’s correct. Approval is a judgment – you’re confirming this is what the code should produce. Whoever writes the code reviews and approves.

Use approval tests when:

Output is complex – instead of 20 assertions, one approval captures everything
Characterizing legacy code – snapshot behavior, then refactor safely
Combinatorial testing – test all input combinations in one approval
Assertions would be tedious or brittle

Use assertions when:

Simple values or specific edge cases
Non-deterministic output that can’t be scrubbed

Core Workflow

1. Write test with verify(result)
2. Run test â FAILS (no .approved file yet)
3. Creates: TestName.approved.txt (empty) + TestName.received.txt (actual output)
4. Review .received file - is this correct?
5. YES â rename/copy .received to .approved
6. Run test again â PASSES
7. Commit .approved file to version control

File naming convention:

{TestClass}.{test_method}.approved.txt   â commit this
{TestClass}.{test_method}.received.txt   â gitignore this

Critical rules:

.approved files ARE your test expectations – commit them
.received files are temporary – add *.received.* to .gitignore
Never edit .approved files by hand – always generate via test

When a test fails, a diff tool opens showing approved vs received. This is how you review changes. Reporters configure which diff tool to use.

Core API Pattern

All languages follow the same pattern:

verify(result)                    # Basic string/object verification
verify_as_json(object)            # Objects as formatted JSON
verify_all(header, items)         # Collections with labels
verify_all_combinations(fn, inputs)  # All input combinations

Non-deterministic data (timestamps, GUIDs) must be scrubbed before verification.

Key Techniques

Scrubbers – replace values that change between runs (timestamps, UUIDs, random numbers, ports, paths) with stable placeholders like [Date1] or guid_1. Without scrubbing, tests pass locally but fail in CI.
Inline approvals – expectations in source code instead of separate files. Avoids file proliferation for short output. Python uses docstrings, Java uses text blocks.
Storyboard – show an object at multiple points in time, like frames in a comic. Each step appears in the diff, making it easy to see how state changes. For workflows, state machines, animations. Python/Java have classes; Node.js uses string building.
Combinations – test all permutations of input parameters in one approval. Exhaustive coverage without writing separate tests for each case. For large sets, pairwise testing reduces millions of combinations to ~100.
Multiple approvals per test – calling verify() twice overwrites the same file, so only the last one is tested. Parameter-based naming creates separate files for each scenario.

See language references for implementation details.

Language References

Detect language from project files, then read the appropriate reference for installation, quick start, core patterns, and links to deeper reference files:

python.md – Python (pyproject.toml, setup.py, requirements.txt)
nodejs.md – JavaScript/TypeScript (package.json)
java.md – Java (pom.xml, build.gradle)

Anti-Patterns

Don’t write assertions for complex objects – use verify_as_json() instead
Don’t commit .received files – they’re temporary
Don’t forget scrubbers for timestamps, GUIDs, random values
Don’t over-verify – one approval per logical behavior. Large approvals hide signal in noise; unrelated changes break tests.
Don’t hand-edit .approved files – always generate via test. Hand-edited files may not match actual code output.
Don’t use verify_all for structured data – use verify_as_json({"items": items})
Don’t mix approvals with assertions – the approval captures everything
Don’t call verify() multiple times without NamerFactory – each overwrites the same file

Flaky tests across environments usually means unscrubbed dynamic data (timestamps, UUIDs, ports, paths).

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台