data-analytics-engineering
38
总安装量
38
周安装量
#5416
全站排名
安装命令
npx skills add https://github.com/vasilyu1983/ai-agents-public --skill data-analytics-engineering
Agent 安装分布
claude-code
31
cursor
25
gemini-cli
24
codex
24
opencode
23
Skill 文档
Data Analytics Engineering
Scope
- Define metrics, grains, and dimensional models.
- Build transformation layers and semantic models.
- Implement data quality tests and observability.
- Document datasets, lineage, and ownership.
- Align analytics outputs with BI and product needs.
Ask For Inputs
- Business metrics and decision use cases.
- Source systems, data freshness, and latency needs.
- Existing warehouse, tooling, and orchestration.
- Expected data volumes and change cadence.
- Governance requirements and access controls.
Workflow
- Define metric dictionary and grains.
- Design staging, intermediate, and mart layers.
- Model dimensions and facts with clear keys.
- Build semantic layer and metric definitions.
- Add tests for freshness, nulls, ranges, and duplicates.
- Document lineage, owners, and SLAs.
- Plan rollout, backfills, and validation checks.
Outputs
- Metric dictionary and semantic model.
- Data model with schema and grain definitions.
- Transformation plan and dbt or SQLMesh structure.
- Data quality test suite and alerting plan.
- Documentation and ownership map.
Quality Checks
- Keep metric definitions stable and versioned.
- Treat metrics as APIs: document changes, deprecate safely, and backfill deliberately.
- Define data contracts for core tables (schema, freshness, keys) to control downstream breakage.
- Avoid mixed grains in a single model.
- Ensure tests cover critical joins and aggregates.
- Validate against source of truth and historical baselines.
Templates
assets/metric-dictionary.mdfor metric definitions and owners.assets/semantic-layer-spec.mdfor entities, measures, and dimensions.assets/data-quality-test-plan.mdfor test coverage planning.
Resources
references/modeling-patterns.mdfor modeling guidance and data quality patterns.references/tool-comparison-2026.mdfor dbt vs SQLMesh vs Coalesce decision matrix.data/sources.jsonfor curated vendor docs and trend-tracking sources (use as a WebSearch seed list).
Related Skills
- Use data-lake-platform for platform architecture.
- Use data-sql-optimization for query tuning.
- Use ai-ml-data-science for modeling and experiments.
Trend Awareness Protocol
IMPORTANT: When users ask recommendation questions about analytics engineering, data modeling, or BI, you MUST use WebSearch to check current trends before answering. If WebSearch is unavailable, use data/sources.json + web browsing and state what you verified vs assumed.
Trigger Conditions
- “What’s the best tool for [analytics engineering/data modeling/BI]?”
- “What should I use for [transformation/semantic layer/metrics]?”
- “What’s the latest in analytics engineering?”
- “Current best practices for [dbt/metrics layers/data quality]?”
- “Is [tool/approach] still relevant in 2026?”
- “[dbt] vs [SQLMesh] vs [other]?”
- “Best BI tool for [use case]?”
- “SQLMesh acquisition” or “Fivetran transformation”
- “Agentic analytics” or “AI data workflows”
- “Metric debt” or “metric governance”
Required Searches
- Search:
"analytics engineering best practices 2026" - Search:
"[dbt/SQLMesh/semantic layer] vs alternatives 2026" - Search:
"analytics engineering trends January 2026" - Search:
"[specific tool] new releases 2026" - Search:
"agentic analytics AI data 2026"(for AI-related queries)
What to Report
After searching, provide:
- Current landscape: What analytics tools/patterns are popular NOW
- Emerging trends: New tools, patterns, or standards gaining traction
- Deprecated/declining: Tools/approaches losing relevance or support
- Recommendation: Based on fresh data, not just static knowledge
Example Topics (verify with fresh search)
- Transformation tools (dbt, SQLMesh, Coalesce)
- Semantic layers (dbt Semantic Layer, Cube, AtScale, warehouse-native)
- Metrics stores and headless BI
- Data quality tools (dbt tests, Elementary, dbt-expectations/Metaplane)
- BI platforms (Metabase, Superset, Lightdash, Hex)
- Data modeling patterns (dimensional, wide tables, activity schema)
- Analytics engineering workflows and CI/CD
- Agentic AI workflows for analytics
- Data mesh and domain-owned data products