mlops validation

📁 fmind/mlops-python-package 📅 Jan 1, 1970

总安装量

周安装量

#69110

全站排名

安装命令

npx skills add https://github.com/fmind/mlops-python-package --skill 'MLOps Validation'

Skill 文档

MLOps Validation

Goal

To ensure software quality, reliability, and security through automated validation layers. This skill enforces Strict Typing (ty), Unified Linting (ruff), Comprehensive Testing (pytest), and Structured Logging (loguru).

Prerequisites

Language: Python
Manager: uv
Context: Ensuring code quality before merge/deploy.

Instructions

1. Static Analysis (Typing & Linting)

Catch errors before they run.

Typing:
- Tool: ty.
- Rule: No Any (unless absolutely necessary). Fully typed function signatures.
- DataFrames: Use pandera schemas to validate DataFrame structures/types.
- Classes: Use pydantic for data modeling and runtime validation.
Linting & Formatting:
- Tool: ruff (replaces black, isort, pylint, flake8).
- Rule: Zero tolerance for linter errors. Use noqa sparingly and with justification.
- Config: Centralize in pyproject.toml.

2. Testing Strategy

Verify behavior and prevent regressions.

Tool: pytest.
Structure: Mirror src/ in tests/.
```
src/pkg/mod.py -> tests/test_mod.py
```
Fixtures: Use tests/conftest.py for shared setup (mock data, temp paths).
Coverage: Aim for high coverage (>80%) on core business logic. Use pytest-cov.

Pattern: Use Given-When-Then in comments.

def test_pipeline_execution(input_data):
    # Given: Valid input data
    # When: The pipeline processes the data
    # Then: The output content matches expectations

3. Structured Logging

Enable observability and debugging.

Tool: loguru (replacing stdlib logging).
Format: Use structured logging (JSON) in production for queryability.
Levels:
- DEBUG: Low-level tracing (payloads, internal state).
- INFO: Key business events (Job started, Model saved).
- ERROR: Actionable failures (with stack traces).
Context: Include context (Job ID, Model Version) in logs.

4. Security

Protect the supply chain and runtime.

Dependencies: Use GitHub Dependabot to patch vulnerable packages.
Code Scanning: Run bandit to detect hardcoded secrets or unsafe patterns (e.g., eval, yaml.load).
Secrets: NEVER log secrets. Sanitize outputs.

Self-Correction Checklist

Type Safety: Does ty pass without errors?
Lint Cleanliness: Does ruff check pass?
Test Discovery: Does pytest successfully find modules in src/?
Log Format: Are production logs serializing to JSON?
Security: Has bandit scanned the codebase?

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台