deepeval-testing

📁 vanman2024/ai-dev-marketplace 📅 2 days ago
3
总安装量
1
周安装量
#55566
全站排名
安装命令
npx skills add https://github.com/vanman2024/ai-dev-marketplace --skill deepeval-testing

Agent 安装分布

amp 1
opencode 1
kimi-cli 1
codex 1
github-copilot 1
claude-code 1

Skill 文档

DeepEval Testing

Skill for pytest-style LLM evaluation with DeepEval.

Overview

DeepEval provides:

  • pytest-compatible LLM tests
  • Built-in metrics (faithfulness, relevance, toxicity)
  • Custom metric creation
  • Async test execution

Use When

This skill is automatically invoked when:

  • Creating pytest-style LLM tests
  • Evaluating RAG quality
  • Measuring faithfulness/relevance
  • Building custom metrics

Available Scripts

Script Description
scripts/init-deepeval.sh Initialize DeepEval project
scripts/run-tests.sh Run DeepEval tests

Available Templates

Template Description
templates/conftest.py pytest configuration
templates/test_basic.py Basic test structure
templates/test_rag.py RAG evaluation tests

Built-in Metrics

  • AnswerRelevancyMetric
  • FaithfulnessMetric
  • ContextualRelevancyMetric
  • HallucinationMetric
  • ToxicityMetric
  • BiasMetric