eval-guidance-actionability

📁 whitespectre/ai-assistant-evals 📅 8 days ago

总安装量

周安装量

#57867

全站排名

安装命令

npx skills add https://github.com/whitespectre/ai-assistant-evals --skill eval-guidance-actionability

Agent 安装分布

opencode 2

claude-code 2

cursor 2

mcpjam 1

openhands 1

zencoder 1

Skill 文档

Eval Guidance & Actionability

Use this skill to evaluate whether an assistant response provides clear, usable guidance the user can act on.

Inputs

Require:

The assistant response text to evaluate.
(Optional) The userâs request or goal (helps judge whether guidance matches whatâs needed).

Internal Rubric (1â5)

5 = Provides concrete, actionable steps; prioritized; includes key details/constraints; user could execute without guessing
4 = Mostly actionable; minor missing details or ordering, but still usable
3 = Some guidance, but generic; missing important steps/details; requires user to infer next actions
2 = Largely non-actionable; mostly high-level advice; lacks steps or specifics
1 = No usable guidance; purely vague, deflective, or irrelevant to âwhat to do nextâ

Workflow

Check whether the response includes specific next actions (steps, checklist, examples, decision points).
Check completeness (missing prerequisites, constraints, caveats).
Score on a 1-5 integer scale using the rubric only.
Write concise rationale tied directly to rubric criteria.
Produce actionable suggestions that improve actionability.

Output Contract

Return JSON only. Do not include markdown, backticks, prose, or extra keys.

Use exactly this schema:

{ “dimension”: “guidance_actionability”, “score”: 1, “rationale”: “…”, “improvement_suggestions”: [ “…” ] }

Hard Rules

dimension must always equal "guidance_actionability".
score must be an integer from 1 to 5.
rationale must be concise (max 3 sentences).
Do not include step-by-step reasoning.
improvement_suggestions must be a non-empty array of concrete edits.
Never output text outside the JSON object.

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台