troubleshooting

📁 jediv/dataiku-chat-control 📅 Today

总安装量

周安装量

#69043

全站排名

安装命令

npx skills add https://github.com/jediv/dataiku-chat-control --skill troubleshooting

Agent 安装分布

amp 1

cline 1

opencode 1

cursor 1

kimi-cli 1

codex 1

Skill 文档

Dataiku Troubleshooting Guide

Debugging Checklist

Environment activated? which python should show dataiku-env
Variables set? echo $DSS_URL
Can connect? Run scripts/bootstrap.py
Recipe saved? Check for settings.save()
Job ran? Check for recipe.run()
Job succeeded? Check job.get_status()
Schema correct? Run autodetect_settings()

Top-10 Error Quick Reference

Error	Cause	Solution
`Connection refused`	Wrong DSS_URL or instance down	Verify URL, check instance status
`401 Unauthorized`	Invalid or expired API key	Regenerate key in Dataiku UI
`Project not found`	Wrong project key or no access	`client.list_project_keys()` to verify
Settings not saved	Missing `settings.save()`	Always call `settings.save()` after changes
Recipe ran but no data	Filter/join removed all rows	Check inputs, join keys, filters
Job failed	Schema mismatch, missing inputs	Inspect job status and logs
`invalid identifier` (quoted)	Lowercase column names in SQL schema	Normalize schema to UPPERCASE
`table does not exist`	Upstream dataset not built	Build datasets in dependency order
`Insert value list mismatch`	Output schema doesn’t match recipe output	Run `recipe.compute_schema_updates()` and apply
`ModuleNotFoundError: dataikuapi`	Virtual environment not activated	`source ~/dataiku-env/bin/activate`

Job Failure Investigation Pattern

# Get the most recent job and extract error details
jobs = project.list_jobs()
job = project.get_job(jobs[0]['def']['id'])
status = job.get_status()
state = status.get("baseStatus", {}).get("state")  # "DONE" or "FAILED"

if state == "FAILED":
    activities = status.get("baseStatus", {}).get("activities", {})
    for name, info in activities.items():
        if info.get("firstFailure"):
            print(f"Error: {info['firstFailure'].get('message')}")

    # Or get full log
    print(job.get_log())

Important: recipe.run() already waits for completion internally. Use recipe.run(no_fail=True) to prevent exceptions on failure, then inspect the returned job object.

Detailed Error References

For full details on each error category including causes, code examples, and solutions:

references/connection-errors.md â Connection refused, 401 Unauthorized, Project not found
references/recipe-errors.md â Settings not saved, empty output, job failures, job API usage patterns
references/sql-errors.md â Invalid identifier (quoted/general), table does not exist, pre-join computed columns, insert value list mismatch
references/environment-errors.md â ModuleNotFoundError, missing env vars, getting more help

Scripts

scripts/debug_job.py â Standalone script to debug the most recent failed job

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台