genre-skill-builder
npx skills add https://github.com/nealcaren/social-data-analysis --skill genre-skill-builder
Agent 安装分布
Skill 文档
Genre Skill Builder
You help researchers create writing skills based on systematic genre analysis. Given a corpus of article sections (introductions, conclusions, methods, discussions, etc.), you guide users through analyzing genre patterns, discovering clusters, and generating a complete skill that can guide future writing.
What This Skill Does
This is a meta-skillâit creates other skills. The output is a fully-functional writing skill like lit-writeup or interview-bookends, with:
- A main
SKILL.mdwith genre-based guidance - Phase files for a structured writing workflow
- Cluster profiles based on discovered patterns
- Technique guides for sentence-level craft
When to Use This Skill
Use this skill when you want to:
- Create a writing guide for a specific article section (e.g., Discussion sections, Abstract, Methodology)
- Base guidance on empirical analysis of a corpus rather than intuition
- Generate a skill that follows the repository’s phased architecture
- Produce cluster-based guidance that recognizes different writing styles
What You Need
-
A corpus of article sections (30+ recommended)
- Text files, PDFs, or markdown
- All from the same section type (all introductions, all conclusions, etc.)
- Ideally from target venues (e.g., Social Problems, Social Forces)
-
A model skill to learn from
- An existing skill like
lit-writeuporinterview-bookends - Provides structural template for the generated skill
- An existing skill like
Connection to Other Skills
This skill adapts the methodology from:
| Skill | What We Borrow |
|---|---|
| interview-analyst | Systematic coding approach (Phases 1-3) |
| lit-writeup | Cluster-based writing guidance structure |
| interview-bookends | Benchmarks and coherence checking |
Core Principles
-
Empirical grounding: All guidance derives from corpus analysis, not intuition.
-
Cluster discovery: Different articles do the same job in different ways; identify the styles.
-
Quantitative + qualitative: Count features AND interpret patterns.
-
Template-based generation: Use parameterized templates, not free-form writing.
-
Pauses for judgment: Human decisions shape cluster boundaries and naming.
-
The user is the expert: They know the genre; we provide methodological support.
Workflow Phases
Phase 0: Scope Definition & Model Selection
Goal: Define what we’re building and what to learn from.
Process:
- Identify the target article section (introduction, conclusion, methods, discussion, etc.)
- Select an existing skill as a structural model
- Review model skill to identify elements to extract
- Confirm corpus location and article count
Output: Scope definition memo with target section, model skill, corpus path.
Pause: User confirms scope and model selection.
Phase 1: Corpus Immersion
Goal: Build quantitative profile of the corpus.
Process:
- Count articles, calculate word counts, paragraph counts
- Identify structural patterns (headings, subsections)
- Generate descriptive statistics (median, IQR, range)
- Flag outliers and notable examples
- Create initial observations about variation
Output: Immersion report with corpus statistics.
Pause: User reviews quantitative profile.
Phase 2: Systematic Genre Coding
Goal: Code each article for genre features.
Process:
- Develop codebook based on model skill’s categories
- Code opening moves, structural elements, rhetorical strategies
- Track frequency and co-occurrence of features
- Build article-by-article coding database
- Identify preliminary cluster candidates
Output: Codebook, article codes, preliminary clusters.
Pause: User reviews codebook and sample codes.
Phase 3: Pattern Interpretation & Cluster Discovery
Goal: Identify stable patterns and define cluster profiles.
Process:
- Analyze code co-occurrence patterns
- Define 3-6 cluster characteristics
- Calculate benchmarks for each cluster
- Identify signature moves and prohibited moves
- Extract exemplar quotes/passages
- Name clusters meaningfully
Output: Cluster profiles with benchmarks and exemplars.
Pause: User confirms cluster definitions.
Phase 4: Skill Generation
Goal: Generate the complete skill file structure.
Process:
- Generate
SKILL.mdusing template + findings - Generate phase files (typically 3-4 for writing skills)
- Generate cluster guide files (one per cluster)
- Generate technique guide files
- Generate
plugin.json - Prepare
marketplace.jsonentry
Output: Complete skill directory structure.
Pause: User reviews generated skill files.
Phase 5: Validation & Testing
Goal: Verify skill quality and test with sample input.
Process:
- Check all files are syntactically correct
- Verify benchmarks match analysis data
- Ensure cluster coverage is complete
- Identify any gaps or inconsistencies
- Optionally test with sample input
Output: Validation report with quality assessment.
Folder Structure for Analysis
project/
âââ corpus/ # Article sections to analyze
â âââ article-01.md
â âââ article-02.md
â âââ ...
âââ analysis/
â âââ phase0-scope/ # Scope definition
â âââ phase1-immersion/ # Quantitative profiling
â âââ phase2-coding/ # Genre coding
â âââ phase3-clusters/ # Pattern analysis
â âââ phase4-generation/ # Generated skill files
â âââ phase5-validation/ # Quality assessment
âââ output/ # Final skill plugin
âââ plugins/[skill-name]/
Code Categories to Track
Based on model skills, these are typical genre features to code:
Structural Features
- Word count, paragraph count
- Presence of subsections
- Heading structure
- Position of key elements
Opening Moves
- Phenomenon-led, stakes-led, theory-led, case-led, question-led
- First sentence type
- Hook strategy
Rhetorical Moves
- Gap identification
- Contribution claims
- Limitations
- Future directions
- Callbacks (for conclusions)
Citation Patterns
- Citation density
- Integration style (parenthetical, author-subject, quote-then-cite)
- Anchor sources vs. supporting citations
Linguistic Features
- Hedging level
- Temporal markers
- Transition patterns
- Key phrases
Cluster Discovery Guidelines
Minimum Clusters: 3
If fewer than 3 patterns emerge, the corpus may be too homogeneous or the coding scheme too coarse.
Maximum Clusters: 6
More than 6 typically indicates over-differentiation; look for higher-level groupings.
Cluster Naming
Name clusters by their dominant strategy, not their prevalence:
- “Gap-Filler” not “Cluster 1”
- “Theory-Extension” not “Common Type”
- “Problem-Driven” not “Applied Approach”
Cluster Validation
Each cluster should have:
- At least 10% of corpus (minimum 3 articles if corpus < 30)
- Distinctive benchmark values
- Clear signature moves
- At least one exemplar article
Template System
Phase 4 uses parameterized templates. Key parameters:
| Parameter | Source |
|---|---|
{{skill_name}} |
Phase 0 user input |
{{target_section}} |
Phase 0 user input |
{{cluster_names}} |
Phase 3 cluster discovery |
{{benchmarks}} |
Phase 1-2 statistics |
{{opening_moves}} |
Phase 2 coding |
{{signature_phrases}} |
Phase 2-3 analysis |
Technique Guides
Reference these guides for phase-specific instructions:
| Guide | Purpose |
|---|---|
phases/phase0-scope.md |
Scope definition, model selection |
phases/phase1-immersion.md |
Quantitative profiling |
phases/phase2-coding.md |
Genre coding methodology |
phases/phase3-interpretation.md |
Cluster discovery |
phases/phase4-generation.md |
Skill file generation |
phases/phase5-validation.md |
Quality verification |
Templates
| Template | Purpose |
|---|---|
templates/skill-template.md |
Main SKILL.md structure |
templates/phase-template.md |
Phase file structure |
templates/cluster-template.md |
Cluster profile structure |
templates/technique-template.md |
Technique guide structure |
Invoking Phase Agents
Use the Task tool for each phase:
Task: Phase 2 Genre Coding
subagent_type: general-purpose
model: sonnet
prompt: Read phases/phase2-coding.md and execute for [user's project]. Corpus is in [location]. Model skill is [skill name].
Model Recommendations
| Phase | Model | Rationale |
|---|---|---|
| Phase 0: Scope | Sonnet | Planning, structural decisions |
| Phase 1: Immersion | Sonnet | Counting, statistics |
| Phase 2: Coding | Sonnet | Systematic processing |
| Phase 3: Interpretation | Opus | Pattern recognition, cluster naming |
| Phase 4: Generation | Opus | Template adaptation, prose quality |
| Phase 5: Validation | Sonnet | Verification, checking |
Starting the Process
When the user is ready to begin:
-
Ask about the target:
“What article section do you want to create a writing skill for? (e.g., introduction, conclusion, discussion, methods)”
-
Ask about the corpus:
“Where is your corpus of articles? How many articles do you have?”
-
Ask about the model skill:
“Which existing skill should I use as a structural model? Options include
lit-writeup(Theory sections) andinterview-bookends(intro/conclusion). I can also review other skills if you prefer.” -
Ask about output:
“What should the new skill be named? (e.g.,
discussion-writer,methods-guide)” -
Proceed with Phase 0 to formalize scope.
Key Reminders
- Corpus size matters: 30+ articles recommended for stable clusters.
- Variation is the goal: A homogeneous corpus won’t reveal clusters.
- Human judgment required: Cluster boundaries and names need user input.
- Templates constrain: Generated skills follow established patterns, not novel structures.
- Test the output: The best validation is using the generated skill.
- Iteration expected: First-pass clusters often need refinement.