testing

📁 yurifrl/cly 📅 1 day ago

总安装量

周安装量

安装命令

npx skills add https://github.com/yurifrl/cly --skill testing

Agent 安装分布

amp 1

cline 1

opencode 1

cursor 1

kimi-cli 1

codex 1

Skill 文档

Testing Skill

Guide test-driven development with a pragmatic approach: integration tests by default, real dependencies over mocks, and just enough tests for confidence.

Your Role: Test-First Engineer

You write tests before code and choose the right test type. You:

â Write tests first – TDD always, no code without tests â Default to integration – Touch real boundaries â Use real dependencies – Databases, filesystems, APIs â Avoid mock overuse – Fakes over mocks, 2-mock ceiling â Test behaviors – Not implementation details

â Do NOT mock everything – The mockist trap â Do NOT split artificially – Related behaviors in one test â Do NOT skip tests – Every change needs a test

Core Principles

Less Is More, But Enough

Write minimum tests that give you confidence. Don’t split tests artificially (test_status, test_data, test_persistence), but don’t combine unrelated behaviors either.

â GOOD: One test verifying status, data transformation, and persistence for a single operation â BAD: Three separate tests for the same operation’s different aspects

Integration by Default

Unless you have a clear reason for unit testing, write integration tests.

â GOOD: Test calling actual CLI binary with real filesystem â BAD: Unit test with 5 mocks simulating the world

The Mock Trap

All mocks = testing nothing. You’re only verifying mock wiring, not real behavior.

â GOOD: In-memory SQLite database with real queries â BAD: Mock database that returns canned responses

Fakes Over Mocks

Use in-memory implementations that behave like real dependencies.

â GOOD: InMemoryRepository, FakeFileSystem, stub data â BAD: Full mock verifying call sequences

Two-Mock Ceiling

If you need >2 mocks, write an integration test instead.

Decision Logic

Use this flowchart to choose test type:

Pure function, no dependencies
   â Unit test, no mocks needed

Class with DI and 1-2 simple dependencies
   â Unit test with fakes/stubs

Touches database/API/filesystem
   â Integration test with real resources

CLI command
   â Integration test calling actual binary

Would require >2 mocks
   â Integration test

Unsure
   â Integration test

Test Types

Unit Tests

When: Component supports DI and you can substitute dependencies meaningfully.

Approach:

Prefer fakes (in-memory implementations)
Use stubs for canned data
Mocks as last resort
Never exceed 2 mocks

Example:

// â GOOD: Fake repository
type InMemoryUserRepo struct {
    users map[string]User
}

func TestUserService_CreateUser(t *testing.T) {
    repo := NewInMemoryUserRepo()
    service := NewUserService(repo)

    user, err := service.CreateUser("alice")

    assert.NoError(t, err)
    assert.Equal(t, "alice", user.Name)
    assert.NotEmpty(t, user.ID)
}

// â BAD: Everything mocked
func TestUserService_CreateUser_Mockist(t *testing.T) {
    mockRepo := new(MockUserRepo)
    mockValidator := new(MockValidator)
    mockLogger := new(MockLogger)
    mockMetrics := new(MockMetrics)

    mockRepo.On("Save", mock.Anything).Return(nil)
    mockValidator.On("Validate", mock.Anything).Return(nil)
    mockLogger.On("Info", mock.Anything)
    mockMetrics.On("Increment", "users.created")

    service := NewUserService(mockRepo, mockValidator, mockLogger, mockMetrics)
    service.CreateUser("alice")

    mockRepo.AssertExpectations(t)
    // Testing mock wiring, not behavior!
}

Integration Tests

When: Default choice. Always prefer unless unit test is clearly better.

Approach:

Use real databases (SQLite in-memory, testcontainers)
Use real APIs (test/sandbox environments)
Use real filesystem operations
Only mock when explicitly requested
Touch outer edges (near main() or entry point)

Example – CLI Application:

func TestCLI_GenerateUUID(t *testing.T) {
    // Build actual binary
    binary := buildTestBinary(t)

    // Run with real arguments
    cmd := exec.Command(binary, "uuid", "generate")
    output, err := cmd.CombinedOutput()

    // Verify everything
    assert.NoError(t, err)
    assert.Equal(t, 0, cmd.ProcessState.ExitCode())

    uuid := strings.TrimSpace(string(output))
    assert.Regexp(t, `^[0-9a-f]{8}-[0-9a-f]{4}-`, uuid)
}

Example – Database Operation:

func TestUserRepo_CreateAndFind(t *testing.T) {
    // Real in-memory SQLite
    db := setupTestDB(t)
    defer db.Close()

    repo := NewUserRepo(db)

    // Create
    user := User{Name: "alice", Email: "alice@example.com"}
    err := repo.Create(user)
    assert.NoError(t, err)

    // Find
    found, err := repo.FindByEmail("alice@example.com")
    assert.NoError(t, err)
    assert.Equal(t, "alice", found.Name)
}

Acceptable Mock Scenarios

Only mock in these cases:

Third-party APIs you don’t control

Payment gateways, external SaaS
Use thin wrapper you control

Time-dependent behavior

Clock/date functions
Use time provider interface

Non-deterministic operations

Random, UUIDs
Inject generator

Hard-to-reproduce errors

Network timeouts, disk full
Test error paths specifically

Prefer thin wrappers:

// â GOOD: Controllable wrapper
type Clock interface {
    Now() time.Time
}

type SystemClock struct{}
func (SystemClock) Now() time.Time { return time.Now() }

type FixedClock struct{ t time.Time }
func (f FixedClock) Now() time.Time { return f.t }

// Test with fixed time
func TestScheduler(t *testing.T) {
    clock := FixedClock{time.Date(2025, 1, 1, 0, 0, 0, 0, time.UTC)}
    scheduler := NewScheduler(clock)
    // ...
}

Test Structure

One test can verify multiple related behaviors. Assert on status, data transformation, persistence, and side effects when they’re part of the same operation.

â GOOD: Single coherent test

func TestArticlePublish(t *testing.T) {
    repo := NewInMemoryArticleRepo()
    service := NewArticleService(repo)

    article, err := service.Publish("Title", "Content", "alice")

    // Status
    assert.NoError(t, err)
    assert.Equal(t, StatusPublished, article.Status)

    // Data transformation
    assert.Equal(t, "title", article.Slug)
    assert.NotZero(t, article.PublishedAt)

    // Persistence
    found, _ := repo.FindBySlug("title")
    assert.Equal(t, article.ID, found.ID)

    // Side effects (if part of publish operation)
    assert.Equal(t, 1, article.Version)
}

â BAD: Artificial splitting

func TestArticlePublish_Status(t *testing.T) { /* ... */ }
func TestArticlePublish_Slug(t *testing.T) { /* ... */ }
func TestArticlePublish_Persistence(t *testing.T) { /* ... */ }
func TestArticlePublish_Version(t *testing.T) { /* ... */ }
// These are all testing the same operation!

TDD Workflow

Write failing test – Red Make it pass – Green Refactor – Clean Repeat

When given a request:

Write test first
Verify it fails
Implement code
Verify it passes
Refactor if needed

When making a change:

Write test for new behavior
Verify existing tests still pass
Implement change
All tests green

Common Pitfalls

â Mistake: Mocking everything because “unit tests are faster” â Solution: Integration tests with real resources are fast enough and test real behavior

â Mistake: Testing implementation details (private methods, internal state) â Solution: Test public API and observable behavior

â Mistake: One assertion per test â Solution: Assert on all relevant aspects of the operation

â Mistake: Writing tests after code â Solution: TDD always – test first, code second

â Mistake: Skipping tests for “simple” changes â Solution: Every change needs a test, no exceptions

Checklist

Before writing code:

Test written first
Test type chosen (integration by default)
Using real dependencies (or fakes if unit test)
Mock count â¤2 (or integration test instead)
Testing behavior, not implementation
Test fails before implementation

After implementation:

Test passes
Related behaviors tested together
Error cases covered
No artificial test splitting
Code refactored if needed

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台