experiment-design

📁 lingzhi227/claude-skills 📅 7 days ago

总安装量

周安装量

#32656

全站排名

安装命令

npx skills add https://github.com/lingzhi227/claude-skills --skill experiment-design

Agent 安装分布

codex 7

qoder 6

gemini-cli 6

qwen-code 6

claude-code 6

github-copilot 6

Skill 文档

Experiment Design

Design structured, progressive experiment plans for research papers.

Input

$0 â Research idea, plan, or method description

References

4-stage progressive experiment prompts: ~/.claude/skills/experiment-design/references/stage-prompts.md

Scripts

Generate experiment design

python ~/.claude/skills/experiment-design/scripts/design_experiments.py --plan research_plan.json --output experiment_design.json
python ~/.claude/skills/experiment-design/scripts/design_experiments.py --method "contrastive learning" --task classification --format markdown

Generates baselines, ablation matrix, hyperparameter grid, metric selection. Stdlib-only.

4-Stage Progressive Framework (from AI-Scientist-v2)

Stage 1: Initial Implementation

Focus on getting a basic working implementation
Use a simple dataset
Aim for basic functional correctness
Completion: at least one working (non-buggy) implementation

Stage 2: Baseline Tuning

Tune hyperparameters (learning rate, epochs, batch size)
Do NOT change model architecture
Test on at least TWO datasets
Completion: stable training curves, improvement over Stage 1

Stage 3: Creative Research

Explore novel improvements and insights
Be creative and think outside the box
Test on at least THREE datasets
Completion: demonstrated novel improvement

Stage 4: Ablation Studies

Systematic component analysis
Each ablation tests a different aspect
Use same datasets as Stage 3
Completion: all planned ablations done

Output Format

{
  "stages": [
    {
      "name": "initial_implementation",
      "goals": ["Basic working baseline", "Simple dataset"],
      "max_iterations": 5,
      "completion_criteria": "Working implementation with non-zero accuracy"
    }
  ],
  "baselines": ["Method A", "Method B"],
  "datasets": ["Dataset1", "Dataset2", "Dataset3"],
  "metrics": ["accuracy", "F1", "inference_time"],
  "ablation_components": ["component_A", "component_B"],
  "hyperparameter_grid": {
    "lr": [1e-4, 1e-3, 1e-2],
    "batch_size": [32, 64, 128]
  },
  "num_seeds": 3
}

Rules

Always start simple (Stage 1) before complex experiments
Each stage builds on the best result from the previous stage
Multi-seed evaluation for statistical significance
Document every experiment run in notes.txt
Generate figures for training curves and comparisons

Related Skills

Upstream: research-planning, idea-generation
Downstream: experiment-code, data-analysis
See also: paper-assembly

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台