mlflow

📁 g1joshi/agent-skills 📅 3 days ago

总安装量

周安装量

#49699

全站排名

安装命令

npx skills add https://github.com/g1joshi/agent-skills --skill mlflow

Agent 安装分布

mcpjam 1

claude-code 1

replit 1

junie 1

zencoder 1

MLflow is the standard for tracking experiments. v3.0 (2025) pivots to GenAI, adding LLM Tracing, Prompt Management, and “LLM-as-a-Judge”.

Experiment Tracking: Logging hyperparameters (lr=0.01) and metrics (accuracy=0.98).
GenAI Tracing: Visualizing the full chain of a RAG application.
Model Registry: Versioning models (my-model/v3) for deployment.

Where logs are stored (local ./mlruns or remote http://mlflow-server).

mlflow.autolog() automatically captures params from Scikit-learn, PyTorch, etc.

OpenTelemetry-based tracing to debug prompt chains.

Do:

Use mlflow.evaluate(): To run “LLM-as-a-Judge” metrics on your RAG pipeline.
Use Prompt Engineering UI: MLflow 3.0 has a UI to iterate on prompts.

Don’t:

Don’t use it for data storage: Log artifacts (models), not datasets. Log metadata about datasets instead.