陌讯skills聚合平台

deepeval-testing

📁 vanman2024/ai-dev-marketplace 📅 2 days ago

3

总安装量

1

周安装量

#55566

全站排名

安装命令

npx skills add https://github.com/vanman2024/ai-dev-marketplace --skill deepeval-testing

Agent 安装分布

amp 1

opencode 1

kimi-cli 1

codex 1

github-copilot 1

claude-code 1

Skill 文档

DeepEval Testing

Skill for pytest-style LLM evaluation with DeepEval.

Overview

DeepEval provides:

pytest-compatible LLM tests
Built-in metrics (faithfulness, relevance, toxicity)
Custom metric creation
Async test execution

Use When

This skill is automatically invoked when:

Creating pytest-style LLM tests
Evaluating RAG quality
Measuring faithfulness/relevance
Building custom metrics

Available Scripts

Script	Description
`scripts/init-deepeval.sh`	Initialize DeepEval project
`scripts/run-tests.sh`	Run DeepEval tests

Available Templates

Template	Description
`templates/conftest.py`	pytest configuration
`templates/test_basic.py`	Basic test structure
`templates/test_rag.py`	RAG evaluation tests

Built-in Metrics

AnswerRelevancyMetric
FaithfulnessMetric
ContextualRelevancyMetric
HallucinationMetric
ToxicityMetric
BiasMetric

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台