deepeval-testing
3
总安装量
1
周安装量
#55566
全站排名
安装命令
npx skills add https://github.com/vanman2024/ai-dev-marketplace --skill deepeval-testing
Agent 安装分布
amp
1
opencode
1
kimi-cli
1
codex
1
github-copilot
1
claude-code
1
Skill 文档
DeepEval Testing
Skill for pytest-style LLM evaluation with DeepEval.
Overview
DeepEval provides:
- pytest-compatible LLM tests
- Built-in metrics (faithfulness, relevance, toxicity)
- Custom metric creation
- Async test execution
Use When
This skill is automatically invoked when:
- Creating pytest-style LLM tests
- Evaluating RAG quality
- Measuring faithfulness/relevance
- Building custom metrics
Available Scripts
| Script | Description |
|---|---|
scripts/init-deepeval.sh |
Initialize DeepEval project |
scripts/run-tests.sh |
Run DeepEval tests |
Available Templates
| Template | Description |
|---|---|
templates/conftest.py |
pytest configuration |
templates/test_basic.py |
Basic test structure |
templates/test_rag.py |
RAG evaluation tests |
Built-in Metrics
AnswerRelevancyMetricFaithfulnessMetricContextualRelevancyMetricHallucinationMetricToxicityMetricBiasMetric