The DecimalAI CLI provides command-line access to your workspace.
This page covers the most-used commands. The CLI also ships an evaluators group, the full skills lifecycle (status, pull, install, benchmark, push, verify, attest, spotcheck), compat-check, and regression-check (the CI command — see Regression Check). Run decimalai --help or decimalai <group> --help for the complete list.
Command overview
| Command | Group | What it does |
|---|
init | Setup | Verify your API key, test connectivity, and send a test trace. |
demo regression | Demo | Seed the “Your agent changed” demo and run the regression check. |
demo skills | Demo | Seed the “Find skills that work” demo and recompute registry stats. |
demo reset | Demo | Remove all [Demo] -prefixed data from the workspace. |
traces list | Traces | List recent traces. |
traces show | Traces | Show full detail for one trace as JSON. |
traces stats | Traces | Show trace statistics for the workspace. |
traces import | Traces | Import traces from a JSON or JSONL file. |
eval push | Evaluations | Push named evaluation scores to a trace. |
skills list | Skills | List skills in the workspace. |
skills sync | Skills | Sync local SKILL.md files to the platform. |
skills pull | Skills | Read-only download of a public skill (no fork). |
skills install | Skills | Fork a skill and write SKILL.md to disk. |
skills push / benchmark / verify / attest / spotcheck / status | Skills | Full publish + effectiveness lifecycle. |
manifests list | Manifests | List manifests for an agent. |
datasets list | Datasets | List datasets with row counts and version info. |
datasets show | Datasets | Show version history for a dataset. |
datasets pull | Datasets | Pull a dataset version to a local file. |
datasets export | Datasets | Export a dataset version to stdout or a file. |
datasets build | Datasets | Build a new dataset version from traces. |
datasets push-to-hub | Datasets | Push a dataset to HuggingFace Hub. |
replay run | Replay | Execute a replay batch through your updated agent. |
evaluators ... | Evaluators | Manage custom evaluators. |
compat-check | Compatibility | Diff a candidate manifest against a baseline. |
regression-check | CI | The CI command — see Regression Check. |
Installation
The CLI is included with the Python SDK:
Authentication
All commands accept --api-key or the DECIMAL_API_KEY environment variable:
# Via environment variable (recommended)
export DECIMAL_API_KEY="dai_sk_..."
# Or via flag
decimalai traces list --api-key dai_sk_...
Setup
decimalai init
Verify your API key, test connectivity, and send a test trace.
$ export DECIMAL_API_KEY="dai_sk_..."
$ decimalai init
Output:
✓ API key: dai_sk_tes...abc1
✓ Connected to workspace: my-team (scope: workspace)
✓ Test trace sent successfully
See it work in 2 minutes — seeds a demo into your workspace:
decimalai demo regression # what your next agent change would break
decimalai demo skills # the registry, ranked by real effectiveness
→ Open dashboard: https://app.decimal.ai/traces
→ Docs: https://docs.decimal.ai/quickstart
| Option | Default | Description |
|---|
--api-key | $DECIMAL_API_KEY | API key |
--base-url | https://api.decimal.ai | Platform URL |
--no-test-trace | (sends by default) | Skip sending a test trace |
Demo sandbox
One-command guided demos that seed realistic data into your workspace — see the headline workflows before instrumenting anything. All demo rows are prefixed [Demo] and removable with demo reset.
decimalai demo regression
Seed the “Your agent changed” demo: a v1→v2 agent (model swap, tool rename/removal, prompt rewrite) plus a trace corpus, then run the regression check and print a deep link to the impact report.
$ decimalai demo regression
Output:
Demo A — "Your agent changed"
Seeding the demo agent (v1 → v2 + traces)…
agent: [Demo] support-agent · traces: 10 · v1 c7664af1 → v2 1d163437
Running the regression check (v2 vs auto-resolved v1)…
Verdict: high_risk — 2 traces will break. Review before merging.
Traces analyzed: 110 (high 2 / med 108 / low 0)
Open the impact report (keep / repair / replay / drop fan-out):
https://app.decimal.ai/agents/.../impact-reports/<id>
decimalai demo skills
Seed the “Find skills that work” demo: three public skills with deliberately varied effectiveness, plus the stats recompute — so the registry ranks them by real (seeded) data.
decimalai demo reset
Remove all demo data for the workspace (both demos). Exact-prefix matched on [Demo] , so your own agents and skills are never touched.
| Option (regression / skills) | Default | Description |
|---|
--reset / --no-reset | --reset | Wipe existing demo data before seeding, so you always land in a clean state |
--web | (derived from --base-url) | Frontend base URL for the printed links |
Traces
decimalai traces list
List recent traces.
decimalai traces list --limit 20 --status success
| Option | Default | Description |
|---|
--limit | 20 | Max results to return |
--status | all | Filter by success or error |
decimalai traces show
Show full detail for a specific trace as JSON.
decimalai traces show trc_abc123
decimalai traces stats
Show trace statistics for your workspace.
decimalai traces stats --agent-name my-agent
| Option | Default | Description |
|---|
--agent-name | all | Filter by agent name |
decimalai traces import
Import traces from a JSON or JSONL file.
decimalai traces import traces.json
decimalai traces import traces.jsonl --format jsonl
| Option | Default | Description |
|---|
--format | auto | Import format: auto, json, jsonl |
Evaluations
decimalai eval push
Push evaluation scores to a trace.
decimalai eval push trc_abc123 --score quality=0.9 --score relevance=0.85
| Option | Required | Description |
|---|
--score | Yes (repeatable) | Score in name=value format |
--source | No (default: cli) | Source label for the scores |
Skills
decimalai skills list
List all skills in your workspace.
decimalai skills list --limit 50
decimalai skills sync
Sync local SKILL.md files to the platform.
decimalai skills sync ./agents/skills
The skills directory is a positional argument (default ./skills).
| Option | Default | Description |
|---|
SKILLS_DIR (positional) | ./skills | Path to the skills directory |
--dry-run | off | Walk and hash but don’t POST — preview what would change |
--apply-pulls / --no-apply-pulls | --apply-pulls | Write backend-newer skills back to disk |
Manifests
decimalai manifests list
List manifests for an agent.
decimalai manifests list --agent-name my-agent
Datasets
decimalai datasets list
List all datasets in the workspace with row counts and version info.
decimalai datasets show
Show version history for a specific dataset.
decimalai datasets show ds_abc123
decimalai datasets pull
Pull a dataset version to a local file. The primary way to get training data onto disk.
# Pull latest version
decimalai datasets pull ds_abc123 -o ./training_data.jsonl
# Pull specific version
decimalai datasets pull ds_abc123 -o ./data.jsonl --version v2
# Pull as Parquet
decimalai datasets pull ds_abc123 -o ./data.parquet
| Option | Default | Description |
|---|
--output / -o | (required) | Output file path. Format inferred from extension. |
--version / -v | latest | Version: latest, v3, 3, or UUID |
--format | auto | Override: jsonl or parquet |
decimalai datasets export
Export a dataset version to stdout (for piping) or a file.
# Export to stdout (for piping)
decimalai datasets export ds_abc123 | head -5
# Export specific version to file
decimalai datasets export ds_abc123 --version v2 -o training.jsonl
| Option | Default | Description |
|---|
--version / -v | latest | Version specifier |
--format | jsonl | Format: jsonl or parquet |
--output / -o | stdout | Output file path |
decimalai datasets build
Build a new dataset version from traces.
decimalai datasets build ds_abc123 --verdict keep
| Option | Default | Description |
|---|
--verdict | all | Filter by eval verdict: keep, repair |
decimalai datasets push-to-hub
Push a dataset to HuggingFace Hub, making it loadable by Axolotl, Unsloth, TRL, and any tool supporting load_dataset().
# Push latest version (private repo)
decimalai datasets push-to-hub ds_abc123 my-org/support-agent-sft
# Push specific version as public
decimalai datasets push-to-hub ds_abc123 my-org/my-dataset --version v2 --public
| Argument / Option | Default | Description |
|---|
DATASET_ID | (required) | The dataset ID |
REPO_ID | (required) | HuggingFace repo: org/dataset-name |
--version / -v | latest | Version specifier |
--token | $HF_TOKEN | HuggingFace API token |
--public / --private | --private | Repo visibility |
--split | train | Dataset split name |
After pushing, use in training:
# Python (TRL, Unsloth)
from datasets import load_dataset
ds = load_dataset("my-org/support-agent-sft")
# Axolotl config
datasets:
- path: my-org/support-agent-sft
type: chat_template
Requires pip install huggingface_hub datasets.
Replay
decimalai replay run
Execute a replay batch — re-run stale prompts through your updated agent.
decimalai replay run batch_abc123 --agent-fn my_app.agent:run
| Argument / Option | Required | Description |
|---|
BATCH_ID | Yes | The replay batch to execute |
--agent-fn | Yes | Python path to your agent function (module:function) |
--dry-run | No | Show what would be done without executing |
--no-skip-failed | No | Stop on first failure instead of continuing |
Output:
✓ Loaded agent function: my_app.agent:run
✓ [1/50] abc12345: completed
✓ [2/50] def67890: completed
✗ [3/50] ghi11111: failed
...
========================================
Replay Summary
========================================
Total: 50
Completed: 48
Passed: 45
Failed: 3
Skipped: 2
Pass rate: 94%
The CLI exits with code 1 if any replays fail, making it suitable for CI/CD pipelines.
Global Options
All commands accept these options:
| Option | Default | Description |
|---|
--api-key | $DECIMAL_API_KEY | API key |
--base-url | https://api.decimal.ai | Platform URL |
--project | default | Project name |
Environment Variables
| Variable | Description |
|---|
DECIMAL_API_KEY | API key (used by all commands) |
DECIMALAI_API_KEY | Alias (also accepted) |
DECIMAL_BASE_URL | Platform API URL |
DECIMAL_AUTO_TRACE | Auto-instrument on import. Accepts any framework or provider flag name: langchain, openai-agents, adk, llamaindex, crewai, autogen, otel, or a direct provider SDK (openai, anthropic, google). |