numerai-research

📁 numerai/example-scripts 📅 11 days ago

总安装量

周安装量

#54277

全站排名

安装命令

npx skills add https://github.com/numerai/example-scripts --skill numerai-research

Agent 安装分布

cursor 2

claude-code 2

codex 1

Skill 文档

Numerai Research

Overview

This skill is a âmeta-workflowâ that sequences existing Numerai skills so research requests reliably produce: (1) runnable configs, (2) executed experiments, (3) a full written report + plots, and (4) a deployable pickle when requested.

Workflow (always follow this order)

1) Design the experiment (use numerai-experiment-design)

Follow the numerai-experiment-design skill to:
- clarify the idea (or run quick scout interpretations if ambiguous)
- choose baseline + feature set alignment (default ender20 baseline)
- create an experiment folder under numerai/agents/experiments/<experiment_name>/
- write configs in configs/
- run training via PYTHONPATH=numerai python3 -m agents.code.modeling --config <config> --output-dir <experiment_dir>
- track metrics with BMC as primary (bmc_mean, bmc_last_200_eras)
- iterate in rounds (typically 4â5 configs per round), and keep going until you hit a plateau (per the experiment-design skill)
- scale winners (bigger feature set and/or full data) before finalizing the best model

2) Implement new model types if needed (use numerai-model-implementation)

Only if the idea requires new code (new model wrapper, new fit/predict behavior, etc.):

Follow the numerai-model-implementation skill to add the model type and register it.
Add at least one smoke-test config and verify the pipeline runs.

3) Report the research (use report-research)

After you have iterated through multiple rounds and stopped finding improvements (plateau), and after any confirmatory scale runs:

Follow the report-research skill to:
- write a full experiment.md (abstract + methods + results + decisions + next steps)
- generate the standard show_experiment plot(s)
- link plots and artifacts in the report

4) Package and upload (use numerai-model-upload)

If (and only if) the user wants deployment:

Follow the numerai-model-upload skill to create a Numerai-compatible pickle and upload it via the Numerai MCP.
Remember: only Classic (tournament 8) supports pickle uploads.

Defaults (unless user specifies otherwise)

Scout first on downsampled data; scale only winners.
Run experiments in rounds (4â5 configs per round) and stop only after a plateau + confirmatory scale step.
Benchmark reference: v52_lgbm_ender20.
Always record corr + BMC metrics and include the standard plot in the report.

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台