Lifecycle
Common patterns
Build from keep + pass traces
POST /datasets/{id}/build with allowed_verdicts=["keep"] and min_eval_score=0.7 is the canonical SFT recipe.Export to JSONL or Parquet
GET /datasets/{id}/versions/{v}/export?format=jsonl returns the rows ready for OpenAI fine-tuning or HuggingFace.Pull as HuggingFace Dataset
decimalai.load_hf_dataset(...) returns a materialized datasets.Dataset object. Compatible with every open-source trainer. (Use decimalai.pull_dataset(dataset_id, path) instead to download to a local JSONL/Parquet file.)Compare versions
Versions are immutable. To see what’s changed between v1 and v2, fetch both and diff the row counts + source breakdowns.
Formats
| Format | Full Name | Use Case |
|---|---|---|
| SFT | Supervised Fine-Tuning | One input→output row per LLM call. Imitation learning. Most common. |
| DPO | Direct Preference Optimization | One row per (input, chosen, rejected) triple. Replay-driven — when v2 outperforms v1 on the same input, v2 = chosen. |
Quick start
Related
- Datasets Guide — filter strategies and recipes
- Training Pipeline Tutorial — end-to-end: trace → eval → fine-tune
- Skills & Data Pipeline — SFT vs DPO, repair, replay