troubleshooting
2
总安装量
1
周安装量
#69043
全站排名
安装命令
npx skills add https://github.com/jediv/dataiku-chat-control --skill troubleshooting
Agent 安装分布
amp
1
cline
1
opencode
1
cursor
1
kimi-cli
1
codex
1
Skill 文档
Dataiku Troubleshooting Guide
Debugging Checklist
- Environment activated?
which pythonshould show dataiku-env - Variables set?
echo $DSS_URL - Can connect? Run
scripts/bootstrap.py - Recipe saved? Check for
settings.save() - Job ran? Check for
recipe.run() - Job succeeded? Check
job.get_status() - Schema correct? Run
autodetect_settings()
Top-10 Error Quick Reference
| Error | Cause | Solution |
|---|---|---|
Connection refused |
Wrong DSS_URL or instance down | Verify URL, check instance status |
401 Unauthorized |
Invalid or expired API key | Regenerate key in Dataiku UI |
Project not found |
Wrong project key or no access | client.list_project_keys() to verify |
| Settings not saved | Missing settings.save() |
Always call settings.save() after changes |
| Recipe ran but no data | Filter/join removed all rows | Check inputs, join keys, filters |
| Job failed | Schema mismatch, missing inputs | Inspect job status and logs |
invalid identifier (quoted) |
Lowercase column names in SQL schema | Normalize schema to UPPERCASE |
table does not exist |
Upstream dataset not built | Build datasets in dependency order |
Insert value list mismatch |
Output schema doesn’t match recipe output | Run recipe.compute_schema_updates() and apply |
ModuleNotFoundError: dataikuapi |
Virtual environment not activated | source ~/dataiku-env/bin/activate |
Job Failure Investigation Pattern
# Get the most recent job and extract error details
jobs = project.list_jobs()
job = project.get_job(jobs[0]['def']['id'])
status = job.get_status()
state = status.get("baseStatus", {}).get("state") # "DONE" or "FAILED"
if state == "FAILED":
activities = status.get("baseStatus", {}).get("activities", {})
for name, info in activities.items():
if info.get("firstFailure"):
print(f"Error: {info['firstFailure'].get('message')}")
# Or get full log
print(job.get_log())
Important:
recipe.run()already waits for completion internally. Userecipe.run(no_fail=True)to prevent exceptions on failure, then inspect the returned job object.
Detailed Error References
For full details on each error category including causes, code examples, and solutions:
- references/connection-errors.md â Connection refused, 401 Unauthorized, Project not found
- references/recipe-errors.md â Settings not saved, empty output, job failures, job API usage patterns
- references/sql-errors.md â Invalid identifier (quoted/general), table does not exist, pre-join computed columns, insert value list mismatch
- references/environment-errors.md â ModuleNotFoundError, missing env vars, getting more help
Scripts
- scripts/debug_job.py â Standalone script to debug the most recent failed job