code-review

📁 sebastiaanwouters/dotagents 📅 Jan 23, 2026

总安装量

周安装量

#34999

全站排名

安装命令

npx skills add https://github.com/sebastiaanwouters/dotagents --skill code-review

Agent 安装分布

claude-code 4

windsurf 1

amp 1

trae 1

opencode 1

codex 1

Skill 文档

Code Review Skill

Reviews code through battle-tested engineering principles with comprehensive quality gates.

Usage

Tell me what to review:

“Review src/auth.ts“
“Review the changes in git diff HEAD~3“
“Review this PR for merge readiness”

Review Process

Read the code â Understand what it does
Verify functionality â Check for bugs, edge cases, correctness
Assess test coverage â Ensure changed code has adequate tests
Evaluate test quality â Check tests follow best practices (not flaky)
Check guideline compliance â Verify AGENTS.md/CLAUDE.md adherence
Apply philosophy lenses â Check against engineering principles
Run QA checks â Execute lint, typecheck, tests (after fixes)
Output structured findings â Categorized, actionable feedback

1. Functionality Verification

Correctness Checklist

Does the code do what it’s supposed to do?
Are all requirements/acceptance criteria met?
Are edge cases handled? (null, empty, boundary values, overflow)
Are error conditions handled gracefully?
Is the happy path tested AND working?
Are race conditions possible? (async, concurrent access)
Is state managed correctly? (no stale data, proper initialization)

Bug Detection Signals

Signal	Question
Off-by-one	Loop bounds, array indices, comparisons (< vs <=)
Null/undefined	Can any value be null when accessed?
Type coercion	Implicit conversions causing bugs? (JS: `"5" + 3`)
Resource leaks	Files, connections, memory properly closed?
Infinite loops	Can loop conditions fail to terminate?
Integer overflow	Large numbers handled safely?
Security holes	SQL injection, XSS, path traversal, secrets exposed?

2. Test Coverage Assessment

Coverage Requirements

New code: Must have tests covering primary functionality
Bug fixes: Must have regression test proving fix
Changed code: Tests should cover modified behavior
Critical paths: Auth, payments, data mutations require high coverage

Coverage Analysis

Is the changed code tested? â NO â Block: Add tests
     â YES
Are edge cases covered? â NO â Request edge case tests
     â YES
Are error paths tested? â NO â Request error handling tests
     â YES
Coverage adequate â

Google’s Coverage Guidelines

60%: Acceptable minimum
75%: Commendable
90%: Exemplary
Per-commit: 90%+ for changed lines is reasonable

Focus on what’s NOT covered rather than hitting arbitrary numbers.

3. Test Quality (Non-Flaky Tests)

FIRST Principles for Good Tests

Principle	Meaning
Fast	Run quickly (milliseconds for unit tests)
Isolated	No dependencies on other tests or external state
Repeatable	Same result every time, any environment
Self-validating	Pass or fail, no manual interpretation
Timely	Written close to the code (ideally before – TDD)

Flaky Test Detection

Tests are flaky if they fail intermittently without code changes.

Common Causes:

Time/date dependencies (use fixed/mocked time)
Random data without seed control
Shared mutable state between tests
Network/external service calls
Race conditions in async code
File system dependencies
Database state leakage
Order-dependent tests

Test Quality Checklist

Tests are deterministic (same input â same output)
Tests clean up after themselves (no side effects)
Tests don’t depend on execution order
Async operations properly awaited
External dependencies mocked/stubbed
Time-sensitive tests use controlled time
Random values use seeded generators
Tests have meaningful assertions (not just expect(true))
Test names describe behavior being tested
One logical assertion per test

Red Flags in Tests

sleep() or arbitrary delays
Commented-out assertions
Empty catch blocks swallowing failures
Tests that pass when logic is deleted (useless tests)
expect(result).toBeDefined() without checking value
Shared state between it() blocks
Network calls without mocking

4. QA Checks

Before Approving, Verify:

Lint passes: npm run lint, eslint, etc.
Types check: tsc --noEmit, npm run typecheck, etc.
Tests pass: npm test, pytest, etc.
Build succeeds: npm run build, cargo build, etc.

Automated QA Flow

Run lint â FAIL â Must fix before merge
    â PASS
Run typecheck â FAIL â Must fix before merge
    â PASS
Run tests â FAIL â Must fix (unless pre-existing)
    â PASS
Build â FAIL â Must fix before merge
    â PASS
QA Passed â

5. Guidelines Compliance

AGENTS.md / CLAUDE.md Adherence

Check if the codebase has guidance files:

AGENTS.md or CLAUDE.md in root or relevant directories
Project-specific coding standards
Architecture decisions and patterns

Compliance Checklist

Follows documented code style
Uses approved libraries/patterns
Adheres to naming conventions
Follows directory structure guidelines
Respects forbidden patterns (if documented)
Matches documented architectural decisions

6. Engineering Philosophy Lenses

Grug Brain Check (Complexity Demon Detection)

Is this the simplest solution?
Could a newcomer understand it in 30 seconds?
Are there unnecessary abstractions?

Code Smell Detection

Smell	Question
Long Method	>50 lines? One sentence description?
Long Params	>3 parameters? Group them?
Feature Envy	Uses another class more than its own?
Shotgun Surgery	One change = many file edits?
Dead Code	Unreachable or unused?

Unix Philosophy Check

Does each function do ONE thing well?
Can components be tested in isolation?
Are there side effects in “pure” functions?

DRY/KISS

Is logic duplicated?
Is there a simpler solution?

Output Format

## Summary
[One-line verdict: APPROVE / NEEDS WORK / RETHINK]

## QA Status
- Lint: â/â
- Typecheck: â/â
- Tests: â/â (X passed, Y failed)
- Build: â/â

## Functionality Issues
[Bugs, incorrect behavior, edge cases]

## Test Coverage Gaps
[Untested code paths, missing tests]

## Test Quality Issues
[Flaky tests, bad patterns]

## Guidelines Violations
[AGENTS.md/CLAUDE.md non-compliance]

## Code Quality
[Design issues, maintainability]

## Nitpicks
[Minor suggestions]

## What's Good
[Acknowledge solid work]

Severity Levels

Level	Meaning	Action
ð´ Critical	Bugs, security, data loss, broken tests	Block merge
ð Important	Missing tests, design issues	Should fix
ð¡ Suggestion	Could be better	Consider
ð¢ Nitpick	Style, minor preference	Optional

Master Decision Tree

Does it work correctly? â NO â ð´ Fix bugs first
     â YES
Does it have tests? â NO â ð´ Add tests
     â YES
Are tests good (not flaky)? â NO â ð  Fix test quality
     â YES
Does QA pass? â NO â ð´ Fix QA failures
     â YES
Follows guidelines? â NO â ð  Align with standards
     â YES
Is it simple? â NO â ð¡ Consider simplifying
     â YES
APPROVE â

What NOT to Do

Don’t nitpick formatting (use formatters)
Don’t demand unnecessary abstractions
Don’t block on style preferences
Don’t rewrite working code for elegance alone
Don’t ignore test failures as “probably flaky”

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台