unit-test-philosophy
1
总安装量
1
周安装量
#77191
全站排名
安装命令
npx skills add https://github.com/open-agreements/open-agreements --skill unit-test-philosophy
Agent 安装分布
amp
1
cline
1
opencode
1
cursor
1
continue
1
kimi-cli
1
Skill 文档
Unit Test Philosophy (Open Agreements)
Use this skill when
- A request asks to add tests, improve coverage, or harden regressions.
- A change touches
src/,integration-tests/,packages/contracts-workspace, orpackages/contracts-workspace-mcp. - You need readable Allure behavior specs and OpenSpec traceability.
Core philosophy
- Test highest-risk behavior first. Focus first on mutating flows, parser/validator boundaries, and policy/safety checks.
- Optimize for regression prevention, not just line coverage. Prioritize branches where failures could produce wrong legal output or unsafe automation behavior.
- Treat Allure as test style, not test type. Use normal unit/integration tests with Allure labels, steps, and attachments in the same files.
- Keep spec and test effectively coextensive. If behavior is important enough to test, map it to canonical OpenSpec scenarios or active change-package scenarios.
- Keep assertions behavior-oriented. Verify user-observable outputs, diagnostics, and mutation outcomes before internals.
- Make failures easy to debug. Attach structured context for inputs, normalized outputs, and error payloads.
Repo standards
Test structure
- Use Given/When/Then/And wording in Allure step titles.
- Keep scenario steps as top-level test steps; avoid wrapping the full test body in synthetic container steps like
Execute test body. - Prefer one assertion per step where practical.
- Multiple assertions in one step are acceptable when they validate one cohesive invariant.
- Keep tests deterministic (fixed fixtures, explicit env flags, no timing assumptions).
Allure API
- Prefer repo helpers over direct raw Allure calls:
integration-tests/helpers/allure-test.ts- Common helpers:
itAllure,testAllure,allureStep,allureJsonAttachment,allurePrettyJsonAttachment,allureWordLikeTextAttachment,allureParameter,allureSeverity
- Do not import from
allure-vitestin tests. - Keep helper usage consistent across
src/**/*.test.tsandintegration-tests/**/*.test.ts. - For complex fixtures/stateful flows, attach:
- a pretty JSON artifact for request/result payloads
- a human-readable âWord-likeâ artifact for document/checklist state before and after mutation when relevant
- Rendering note: HTML preview attachments rely on the report post-process sanitizer allowlist patch (
scripts/patch_allure_html_sanitizer.mjs, invoked bynpm run report:allure). Do not bypass this pipeline when generating reports for review.
File naming and placement
- Use collocated test files like
src/<module>.test.ts. - Add Allure style inside these tests; do not split by “allure-only” test types by default.
- Keep one test file focused on one module or capability.
- Migration policy: gradually rename legacy
*.allure.test.tsfiles to*.test.ts; do not introduce new*.allure.test.tsfiles.
OpenSpec traceability
- Use
.openspec('OA-###')whenever a matching scenario ID exists for the behavior. - Scenario IDs may come from either canonical specs (
openspec/specs/open-agreements/spec.md) or active change-package specs (openspec/changes/<change-id>/specs/open-agreements/spec.md). - Pre-canonical IDs from active change packages are valid during implementation and should remain stable when promoted into canonical specs.
- For new important behavior, add scenario IDs in the active change package first, map tests immediately, then promote those IDs into canonical specs when archiving.
Coverage expansion workflow
- Read coverage summaries and identify branch-heavy modules in
src/core/**and integration flows. - Rank by blast radius and mutation risk.
- Add tests in this order:
- Validation and error branches
- Strict vs permissive behavior
- No-partial-mutation / transactional guarantees
- Invariants (deterministic outputs, schema safety, idempotency)
- Run targeted tests first, then full suite and coverage.
Severity recommendation rubric
critical: mutation correctness, legal-output integrity, data-loss risk, security/policy guardrails.normal: standard behavior and compatibility scenarios.minor: narrow edge cases with low production impact.- Apply severity based on failure impact, not module ownership.
Command checklist
npm run test:run
npm run test:coverage
npm run check:spec-coverage
npm run check:allure-labels
Minimal test template (TypeScript)
import { describe, expect } from 'vitest';
import { itAllure as it, allureStep, allureJsonAttachment } from '../../../integration-tests/helpers/allure-test.js';
describe('checklist patch behavior', () => {
it('applies replacement deterministically', async () => {
let result: { ok: boolean };
await allureStep('Given a valid patch payload', async () => {
await allureJsonAttachment('patch-input.json', {
patch_id: 'patch_001',
operations: [{ op: 'replace', path: '/issues/0/status', value: 'CLOSED' }],
});
});
await allureStep('When patch validation runs', async () => {
result = { ok: true };
});
await allureStep('Then validation succeeds', async () => {
expect(result!.ok).toBe(true);
});
});
});
Extended reference
- See
references/allure-test-spec-writing-guide.mdfor full Allure step-writing guidance.