Skip to main content
The DecimalAI CLI provides command-line access to your workspace.
This page covers the most-used commands. The CLI also ships an evaluators group, the full skills lifecycle (status, pull, install, benchmark, push, verify, attest, spotcheck), compat-check, and regression-check (the CI command — see Regression Check). Run decimalai --help or decimalai <group> --help for the complete list.

Command overview

CommandGroupWhat it does
initSetupVerify your API key, test connectivity, and send a test trace.
demo regressionDemoSeed the “Your agent changed” demo and run the regression check.
demo skillsDemoSeed the “Find skills that work” demo and recompute registry stats.
demo resetDemoRemove all [Demo] -prefixed data from the workspace.
traces listTracesList recent traces.
traces showTracesShow full detail for one trace as JSON.
traces statsTracesShow trace statistics for the workspace.
traces importTracesImport traces from a JSON or JSONL file.
eval pushEvaluationsPush named evaluation scores to a trace.
skills listSkillsList skills in the workspace.
skills syncSkillsSync local SKILL.md files to the platform.
skills pullSkillsRead-only download of a public skill (no fork).
skills installSkillsFork a skill and write SKILL.md to disk.
skills push / benchmark / verify / attest / spotcheck / statusSkillsFull publish + effectiveness lifecycle.
manifests listManifestsList manifests for an agent.
datasets listDatasetsList datasets with row counts and version info.
datasets showDatasetsShow version history for a dataset.
datasets pullDatasetsPull a dataset version to a local file.
datasets exportDatasetsExport a dataset version to stdout or a file.
datasets buildDatasetsBuild a new dataset version from traces.
datasets push-to-hubDatasetsPush a dataset to HuggingFace Hub.
replay runReplayExecute a replay batch through your updated agent.
evaluators ...EvaluatorsManage custom evaluators.
compat-checkCompatibilityDiff a candidate manifest against a baseline.
regression-checkCIThe CI command — see Regression Check.

Installation

The CLI is included with the Python SDK:
pip install decimalai

Authentication

All commands accept --api-key or the DECIMAL_API_KEY environment variable:
# Via environment variable (recommended)
export DECIMAL_API_KEY="dai_sk_..."

# Or via flag
decimalai traces list --api-key dai_sk_...

Setup

decimalai init

Verify your API key, test connectivity, and send a test trace.
$ export DECIMAL_API_KEY="dai_sk_..."
$ decimalai init
Output:
  ✓ API key: dai_sk_tes...abc1
  ✓ Connected to workspace: my-team (scope: workspace)
  ✓ Test trace sent successfully

  See it work in 2 minutes — seeds a demo into your workspace:
    decimalai demo regression   # what your next agent change would break
    decimalai demo skills       # the registry, ranked by real effectiveness

  → Open dashboard: https://app.decimal.ai/traces
  → Docs: https://docs.decimal.ai/quickstart
OptionDefaultDescription
--api-key$DECIMAL_API_KEYAPI key
--base-urlhttps://api.decimal.aiPlatform URL
--no-test-trace(sends by default)Skip sending a test trace

Demo sandbox

One-command guided demos that seed realistic data into your workspace — see the headline workflows before instrumenting anything. All demo rows are prefixed [Demo] and removable with demo reset.

decimalai demo regression

Seed the “Your agent changed” demo: a v1→v2 agent (model swap, tool rename/removal, prompt rewrite) plus a trace corpus, then run the regression check and print a deep link to the impact report.
$ decimalai demo regression
Output:
  Demo A — "Your agent changed"
  Seeding the demo agent (v1 → v2 + traces)…
    agent: [Demo] support-agent  ·  traces: 10  ·  v1 c7664af1 → v2 1d163437
  Running the regression check (v2 vs auto-resolved v1)…

  Verdict: high_risk — 2 traces will break. Review before merging.
  Traces analyzed: 110 (high 2 / med 108 / low 0)

  Open the impact report (keep / repair / replay / drop fan-out):
    https://app.decimal.ai/agents/.../impact-reports/<id>

decimalai demo skills

Seed the “Find skills that work” demo: three public skills with deliberately varied effectiveness, plus the stats recompute — so the registry ranks them by real (seeded) data.
$ decimalai demo skills

decimalai demo reset

Remove all demo data for the workspace (both demos). Exact-prefix matched on [Demo] , so your own agents and skills are never touched.
$ decimalai demo reset
Option (regression / skills)DefaultDescription
--reset / --no-reset--resetWipe existing demo data before seeding, so you always land in a clean state
--web(derived from --base-url)Frontend base URL for the printed links

Traces

decimalai traces list

List recent traces.
decimalai traces list --limit 20 --status success
OptionDefaultDescription
--limit20Max results to return
--statusallFilter by success or error

decimalai traces show

Show full detail for a specific trace as JSON.
decimalai traces show trc_abc123

decimalai traces stats

Show trace statistics for your workspace.
decimalai traces stats --agent-name my-agent
OptionDefaultDescription
--agent-nameallFilter by agent name

decimalai traces import

Import traces from a JSON or JSONL file.
decimalai traces import traces.json
decimalai traces import traces.jsonl --format jsonl
OptionDefaultDescription
--formatautoImport format: auto, json, jsonl

Evaluations

decimalai eval push

Push evaluation scores to a trace.
decimalai eval push trc_abc123 --score quality=0.9 --score relevance=0.85
OptionRequiredDescription
--scoreYes (repeatable)Score in name=value format
--sourceNo (default: cli)Source label for the scores

Skills

decimalai skills list

List all skills in your workspace.
decimalai skills list --limit 50

decimalai skills sync

Sync local SKILL.md files to the platform.
decimalai skills sync ./agents/skills
The skills directory is a positional argument (default ./skills).
OptionDefaultDescription
SKILLS_DIR (positional)./skillsPath to the skills directory
--dry-runoffWalk and hash but don’t POST — preview what would change
--apply-pulls / --no-apply-pulls--apply-pullsWrite backend-newer skills back to disk

Manifests

decimalai manifests list

List manifests for an agent.
decimalai manifests list --agent-name my-agent

Datasets

decimalai datasets list

List all datasets in the workspace with row counts and version info.
decimalai datasets list

decimalai datasets show

Show version history for a specific dataset.
decimalai datasets show ds_abc123

decimalai datasets pull

Pull a dataset version to a local file. The primary way to get training data onto disk.
# Pull latest version
decimalai datasets pull ds_abc123 -o ./training_data.jsonl

# Pull specific version
decimalai datasets pull ds_abc123 -o ./data.jsonl --version v2

# Pull as Parquet
decimalai datasets pull ds_abc123 -o ./data.parquet
OptionDefaultDescription
--output / -o(required)Output file path. Format inferred from extension.
--version / -vlatestVersion: latest, v3, 3, or UUID
--formatautoOverride: jsonl or parquet

decimalai datasets export

Export a dataset version to stdout (for piping) or a file.
# Export to stdout (for piping)
decimalai datasets export ds_abc123 | head -5

# Export specific version to file
decimalai datasets export ds_abc123 --version v2 -o training.jsonl
OptionDefaultDescription
--version / -vlatestVersion specifier
--formatjsonlFormat: jsonl or parquet
--output / -ostdoutOutput file path

decimalai datasets build

Build a new dataset version from traces.
decimalai datasets build ds_abc123 --verdict keep
OptionDefaultDescription
--verdictallFilter by eval verdict: keep, repair

decimalai datasets push-to-hub

Push a dataset to HuggingFace Hub, making it loadable by Axolotl, Unsloth, TRL, and any tool supporting load_dataset().
# Push latest version (private repo)
decimalai datasets push-to-hub ds_abc123 my-org/support-agent-sft

# Push specific version as public
decimalai datasets push-to-hub ds_abc123 my-org/my-dataset --version v2 --public
Argument / OptionDefaultDescription
DATASET_ID(required)The dataset ID
REPO_ID(required)HuggingFace repo: org/dataset-name
--version / -vlatestVersion specifier
--token$HF_TOKENHuggingFace API token
--public / --private--privateRepo visibility
--splittrainDataset split name
After pushing, use in training:
# Python (TRL, Unsloth)
from datasets import load_dataset
ds = load_dataset("my-org/support-agent-sft")
# Axolotl config
datasets:
  - path: my-org/support-agent-sft
    type: chat_template
Requires pip install huggingface_hub datasets.

Replay

decimalai replay run

Execute a replay batch — re-run stale prompts through your updated agent.
decimalai replay run batch_abc123 --agent-fn my_app.agent:run
Argument / OptionRequiredDescription
BATCH_IDYesThe replay batch to execute
--agent-fnYesPython path to your agent function (module:function)
--dry-runNoShow what would be done without executing
--no-skip-failedNoStop on first failure instead of continuing
Output:
✓ Loaded agent function: my_app.agent:run
  ✓ [1/50] abc12345: completed
  ✓ [2/50] def67890: completed
  ✗ [3/50] ghi11111: failed
  ...
========================================
Replay Summary
========================================
  Total:      50
  Completed:  48
  Passed:     45
  Failed:     3
  Skipped:    2
  Pass rate:  94%
The CLI exits with code 1 if any replays fail, making it suitable for CI/CD pipelines.

Global Options

All commands accept these options:
OptionDefaultDescription
--api-key$DECIMAL_API_KEYAPI key
--base-urlhttps://api.decimal.aiPlatform URL
--projectdefaultProject name

Environment Variables

VariableDescription
DECIMAL_API_KEYAPI key (used by all commands)
DECIMALAI_API_KEYAlias (also accepted)
DECIMAL_BASE_URLPlatform API URL
DECIMAL_AUTO_TRACEAuto-instrument on import. Accepts any framework or provider flag name: langchain, openai-agents, adk, llamaindex, crewai, autogen, otel, or a direct provider SDK (openai, anthropic, google).