refactoring-legacy-code

📁 lv416e/dotfiles 📅 1 day ago

总安装量

周安装量

安装命令

npx skills add https://github.com/lv416e/dotfiles --skill refactoring-legacy-code

Agent 安装分布

amp 1

cline 1

opencode 1

cursor 1

continue 1

kimi-cli 1

Skill 文档

Refactoring Legacy Code

Overview

Big-bang rewrites fail. Refactoring without tests creates new bugs. Scope creep turns a “quick cleanup” into a six-month project.

Core principle: ALWAYS write characterization tests before changing legacy code. Understand current behavior before changing it.

Violating the letter of this process is violating the spirit of refactoring.

The Iron Law

NO REFACTORING WITHOUT CHARACTERIZATION TESTS FIRST

If you haven’t captured existing behavior in tests, you cannot change the code.

No exceptions:

Not for “obvious” improvements
Not for “simple” renames
Not for “just moving code around”
Not even for formatting changes that touch logic

When to Use

Use for ANY code modernization work:

Framework/library migration (class components to hooks, Angular to React, Python 2 to 3)
Updating deprecated APIs
Breaking monoliths into modules
Replacing legacy patterns with modern idioms
Reducing technical debt
Extracting shared libraries
Upgrading major dependency versions

Use this ESPECIALLY when:

Code has no tests (most legacy code)
You don’t fully understand what the code does
Multiple teams depend on this code
“Just a quick refactor” is proposed
Someone says “let’s rewrite it from scratch”
The code is older than the team working on it

Don’t skip when:

Refactoring seems trivial (trivial changes break things too)
You’re confident you understand the code (you don’t, fully)
Deadline pressure says “just do it” (broken code after refactoring costs more)

The Five Phases

You MUST complete each phase before proceeding to the next.

Phase 1: Reconnaissance

BEFORE touching ANY code:

Map the Dependency Graph

What depends on this code?
What does this code depend on?
Draw it out: imports, API consumers, database schemas, config files
Use tooling: dependency analyzers, import graphs, call hierarchies

# Example: find all consumers of a module
grep -r "import.*from.*legacy-module" --include="*.ts" src/
grep -r "require.*legacy-module" --include="*.js" src/

# Example: find all callers of a function
grep -rn "legacyFunction\(" --include="*.py" .

Identify the Blast Radius
- What breaks if this code changes?
- Which teams, services, or systems are affected?
- Are there downstream consumers you don’t control?
- Is there a public API contract?

Catalog Code Smells and Prioritize

Not all smells are equal. Prioritize by impact:

Priority	Smell	Why
Critical	Shared mutable state	Causes race conditions, impossible to reason about
Critical	No separation of concerns	Changes ripple everywhere
High	God classes/functions (500+ lines)	Untestable, incomprehensible
High	Circular dependencies	Prevents modularization
Medium	Copy-paste duplication	Bugs fixed in one copy, not others
Medium	Primitive obsession	Stringly-typed code hides bugs
Low	Naming conventions	Confusing but functional
Low	Formatting inconsistency	Cosmetic, fix with tooling

Understand Before Changing
- Read the code thoroughly. Don’t skim.
- Trace execution paths manually
- Read commit history: WHY was it written this way?
- Check for commented-out code with explanations
- Look for “HACK”, “TODO”, “FIXME” comments – they’re documentation
Define the Target State
- What does “done” look like?
- Draw the target architecture
- Write it down. Be specific.
- If you can’t describe the end state, you’re not ready to start

Phase 2: Characterization Tests

REQUIRED before any code changes. No exceptions.

Characterization tests capture CURRENT behavior, not DESIRED behavior. They answer: “What does this code actually do?”

Write Tests for Current Behavior

# Characterization test: capture what the code DOES, not what it SHOULD do
def test_legacy_price_calculator_with_negative_quantity():
    # Legacy code silently returns 0 for negative quantities
    # This might be a bug, but it's CURRENT behavior
    result = calculate_price(item="widget", quantity=-5, price=10.00)
    assert result == 0  # Captures actual behavior

def test_legacy_price_calculator_with_none_price():
    # Legacy code treats None as 0 - probably wrong, but current behavior
    result = calculate_price(item="widget", quantity=5, price=None)
    assert result == 0  # Captures actual behavior

Cover All Code Paths
- Happy paths
- Error paths
- Edge cases (null, empty, negative, overflow)
- Boundary conditions
- Side effects (files written, APIs called, state mutated)

Use Approval Testing for Complex Output

For code with complex output (HTML, reports, serialized data):

def test_legacy_report_generator():
    result = generate_report(sample_data)
    # First run: manually approve the output as "golden"
    # Subsequent runs: compare against golden file
    assert result == load_golden_file("report_golden.txt")

Test the Boundaries, Not the Internals
- Test at public API surfaces
- Test at module boundaries
- Don’t test private methods (they’ll change during refactoring)
- Focus on inputs and outputs
Verify Tests Catch Changes
- Make a small intentional change to the legacy code
- Confirm at least one test fails
- Revert the change
- If no test fails, your tests are inadequate

Characterization test coverage must be sufficient before proceeding.

Phase 3: Incremental Migration Strategy

Choose your pattern. Big-bang is NOT an option.

The Strangler Fig Pattern

Wrap legacy code. Route new calls through new code. Gradually migrate old calls. Remove legacy when empty.

Phase A: Legacy handles everything
  [All Traffic] â [Legacy System]

Phase B: New code wraps legacy, intercepts some paths
  [All Traffic] â [Facade/Router]
                      âââ [New Code] (migrated paths)
                      âââ [Legacy Code] (remaining paths)

Phase C: All paths migrated
  [All Traffic] â [New Code]
  [Legacy Code] â (dead, remove it)

Implementation:

// Step 1: Create facade that delegates to legacy
class UserServiceFacade {
  private legacy = new LegacyUserService();

  getUser(id: string): User {
    return this.legacy.getUser(id);  // Pass-through initially
  }
}

// Step 2: Migrate one method at a time
class UserServiceFacade {
  private legacy = new LegacyUserService();
  private modern = new ModernUserService();

  getUser(id: string): User {
    return this.modern.getUser(id);  // Migrated!
  }

  updateUser(id: string, data: UserData): void {
    return this.legacy.updateUser(id, data);  // Not yet migrated
  }
}

// Step 3: When all methods migrated, remove facade and legacy

Branch by Abstraction

For internal modules you can’t wrap with a facade:

Create an abstraction (interface) over the existing implementation
Modify all clients to use the abstraction
Build new implementation behind same abstraction
Switch implementations (feature flag, config, or direct swap)
Remove old implementation

// Step 1: Extract interface from legacy
interface DataStore {
  get(key: string): Promise<unknown>;
  set(key: string, value: unknown): Promise<void>;
}

// Step 2: Legacy implements interface
class LegacyFileStore implements DataStore { /* ... */ }

// Step 3: New implementation
class ModernDatabaseStore implements DataStore { /* ... */ }

// Step 4: Swap via configuration
const store: DataStore = config.useModernStore
  ? new ModernDatabaseStore()
  : new LegacyFileStore();

Feature Flags for Gradual Rollout

function processOrder(order: Order): Result {
  if (featureFlags.isEnabled('modern-order-processing', order.userId)) {
    return modernProcessOrder(order);
  }
  return legacyProcessOrder(order);
}

Roll out to 1% of users, then 10%, 50%, 100%. Rollback instantly if issues arise.

Phase 4: Execute Refactoring

One transformation at a time. Run tests after EVERY change.

Make ONE Change
- Rename a function
- Extract a method
- Move a class
- Replace one deprecated API call
- ONE thing
Run ALL Tests
```
# After every single change
npm test        # or pytest, or cargo test, or whatever
```
- All characterization tests pass? Continue.
- Any test fails? Revert. Understand why. Try again.

Commit Frequently

Each atomic change gets its own commit
Commit messages describe the refactoring step
Easy to bisect if something breaks later
Easy to revert one step without losing everything

git commit -m "Extract validation logic from OrderProcessor.process()"
git commit -m "Rename UserManager to UserRepository"
git commit -m "Replace deprecated crypto.createCipher with crypto.createCipheriv"

Common Safe Transformations

These are mechanical and should not change behavior:

Transformation	Risk	Verify
Rename (variable, function, class)	Low	Tests pass, grep for old name
Extract method/function	Low	Tests pass, behavior identical
Inline method/function	Low	Tests pass
Move to different file/module	Medium	Tests pass, imports updated
Extract interface	Low	Tests pass, no behavior change
Replace inheritance with composition	Medium	Tests pass, behavior identical
Replace deprecated API	Medium	Tests pass, read migration guide completely
Change data structure	High	Tests pass, check serialization/persistence

Parallel Implementation for High-Risk Changes

When transformation risk is high:

def process_payment(order):
    legacy_result = legacy_process_payment(order)
    modern_result = modern_process_payment(order)

    if legacy_result != modern_result:
        log.error(f"MISMATCH: legacy={legacy_result}, modern={modern_result}")
        # Use legacy result until mismatch rate is 0%
        return legacy_result

    return modern_result

Phase 5: Cleanup and Verification

Remove Dead Code
- Delete the legacy code paths
- Remove feature flags for completed migrations
- Delete characterization tests that tested legacy-specific behavior
- Keep tests that verify correct behavior regardless of implementation
Update Documentation
- Architecture diagrams
- API documentation
- Onboarding guides
- Dependency maps
Verify End-to-End
- Integration tests pass
- Performance benchmarks meet or exceed legacy
- No regressions in dependent systems
- Monitoring shows healthy behavior
Retrospective
- What took longer than expected?
- What broke that tests didn’t catch?
- What would you do differently?
- Update this process for next time

Red Flags – STOP and Follow Process

If you catch yourself thinking:

“Let’s just rewrite it from scratch”
“This code is so bad, tests won’t help”
“I understand it well enough to change it without tests”
“Quick rename, no tests needed”
“Let me refactor this other thing while I’m here”
“We’ll add tests after the refactoring”
“It’s just moving code around”
“The old tests will catch any problems”
“Let me fix this bug I found while refactoring”
“This is taking too long, let me skip the characterization tests”
“One big PR is easier to review than ten small ones”

ALL of these mean: STOP. Return to Phase 2.

If scope is growing: You’re in scope creep. Finish the current refactoring. File tickets for new work. ONE thing at a time.

Common Rationalizations

Excuse	Reality
“Code is too messy to test”	That’s exactly why you need tests. Use characterization tests at the boundary.
“Rewrite will be faster”	Rewrites take 3-10x longer than estimated. You’ll re-discover every edge case the hard way.
“Tests slow us down”	Tests slow you down 10 minutes now. No tests slow you down 10 days later.
“I know what this code does”	You know what you THINK it does. Characterization tests reveal what it ACTUALLY does.
“Small change, no test needed”	Small changes compound. One untested change invites another.
“We’ll migrate all at once over the weekend”	Big-bang migrations fail. Always. No exceptions.
“Legacy code is too coupled to test”	Use seams: subclass-and-override, extract-and-override, parameterize constructor.
“Refactoring while fixing bugs saves time”	Mixing refactoring with behavior changes makes both harder to verify. Separate them.
“The old tests are enough”	Old tests test old behavior. Characterization tests verify you understand CURRENT behavior.
“Feature flags add complexity”	Feature flags add CONTROLLED complexity. Big-bang adds UNCONTROLLED chaos.
“Nobody understands this code anyway”	That’s the reason to go slow, not fast.

Framework/Library Migration Checklist

When migrating frameworks (e.g., class components to hooks, jQuery to React, Python 2 to 3):

Read the official migration guide COMPLETELY – Don’t skim. Every line.

Identify all deprecated patterns in your codebase

# Example: find React class components
grep -rn "extends React.Component\|extends Component" --include="*.tsx" src/

Create a migration spreadsheet – Every file, current pattern, target pattern, status
Migrate ONE file first – The simplest one. Get it working. Learn the gotchas.
Document the gotchas – Every surprise becomes a checklist item
Migrate in dependency order – Leaves first, then branches, then trunk

Run codemods where possible – Automated transforms are safer than manual

# Example: React class to function component codemod
npx react-codemod rename-unsafe-lifecycles ./src

Verify each file independently – Tests pass after each file migration
Never mix migration with feature work – Separate commits, separate PRs

Dependency Graph Analysis

Before touching code, understand the dependency graph:

Step 1: Map imports/requires
  module-a â module-b â module-c
                       â module-d
  module-e â module-b

Step 2: Identify the refactoring order
  Leaves first: module-c, module-d (no dependents to break)
  Then: module-b (update after c and d are stable)
  Then: module-a, module-e (consumers of b)

Step 3: Identify circular dependencies (DANGER)
  module-a â module-b â module-a  â BREAK THIS FIRST

Rules:

Refactor leaves before branches
Never refactor a module while its dependencies are also changing
Break circular dependencies BEFORE other refactoring
One layer at a time, bottom-up

Breaking Monoliths into Modules

Identify Seams – Natural boundaries where code can be split
Extract Data First – Separate data access from business logic
Create Module Boundaries – Define interfaces between modules
Enforce Boundaries – Lint rules, import restrictions, separate packages
Extract Gradually – One module at a time, verify after each

// Before: God module
// user-management.ts (2000 lines)
export function createUser() { /* ... */ }
export function authenticateUser() { /* ... */ }
export function getUserProfile() { /* ... */ }
export function updateUserPreferences() { /* ... */ }
export function sendUserNotification() { /* ... */ }
export function generateUserReport() { /* ... */ }

// After: Separated by concern
// users/creation.ts
export function createUser() { /* ... */ }

// users/authentication.ts
export function authenticateUser() { /* ... */ }

// users/profile.ts
export function getUserProfile() { /* ... */ }
export function updateUserPreferences() { /* ... */ }

// notifications/user-notifications.ts
export function sendUserNotification() { /* ... */ }

// reports/user-reports.ts
export function generateUserReport() { /* ... */ }

Updating Deprecated APIs

Find all usages of the deprecated API
Read the deprecation notice – What replaces it? What changed?
Write characterization tests around each usage
Replace ONE usage at a time
Test after each replacement
Search again – Did you miss any? Dynamic calls? Reflection? Config files?

# Before: deprecated API
import warnings
with warnings.catch_warnings():
    warnings.simplefilter("error", DeprecationWarning)
    # If this throws, you have deprecated API usage
    result = legacy_function()

# Systematic replacement
# 1. Find all usages
# 2. Replace one at a time
# 3. Test after each
# 4. Grep again to confirm zero remaining

Quick Reference

Phase	Key Activities	Success Criteria
1. Reconnaissance	Map dependencies, catalog smells, define target	Understand blast radius and end state
2. Characterization Tests	Test current behavior at boundaries	Intentional change causes test failure
3. Migration Strategy	Choose pattern (strangler fig, branch by abstraction)	Written plan with incremental steps
4. Execute	One change at a time, test after each, commit frequently	All tests pass after every change
5. Cleanup	Remove dead code, update docs, verify end-to-end	No legacy code remaining, all tests green

Integration with Other Skills

This skill requires using:

test-driven-development – REQUIRED for writing characterization tests (Phase 2) and for any new code written during refactoring. Characterization tests capture current behavior; TDD drives new behavior.

Complementary skills:

systematic-debugging – Use when refactoring reveals unexpected behavior. Don’t guess at the cause; follow the four-phase debugging process.
defense-in-depth – Add validation at multiple layers when replacing legacy code with new modules. Legacy code often relies on implicit assumptions; make them explicit.

When Process Reveals “Don’t Refactor”

Sometimes Phase 1 reveals:

The code works fine and nobody needs to change it
The blast radius is too large for current resources
A rewrite IS warranted (rare, but possible for truly isolated components with clear specs)
The code is scheduled for deletion anyway

These are valid outcomes. Not every piece of legacy code needs refactoring. The reconnaissance phase exists to prevent wasted effort.

But: “Too hard to test” is never a valid reason to skip refactoring. It’s a reason to invest more in characterization tests.

Real-World Impact

From refactoring projects:

Incremental migration: 95%+ success rate, predictable timeline
Big-bang rewrite: 30% success rate, 3-10x over budget
With characterization tests: near-zero regressions
Without characterization tests: average 2.5 production incidents per migration
Strangler fig pattern: zero-downtime migration, instant rollback capability

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台