inf dataset to upload JSONL inference data and manage datasets created from captured traffic, existing uploads, or JSONL files on disk. Materialized datasets feed into inf eval run for evals and into training jobs.
Alias: inf datasets
inf dataset upload
Import a JSONL file into the active project as an upload entry. An upload is the raw material you can then materialize into an eval or training dataset. The CLI validates the file locally, uploads it in parts, waits for processing to finish, and prints the detected format plus the processed line count.
Arguments
| Argument | Required | Description |
|---|---|---|
file | Yes | Path to the JSONL file to upload |
Options
| Flag | Required | Description | Default |
|---|---|---|---|
-n, --name <name> | No | Upload name shown in Catalyst | Filename without extension |
--no-wait | No | Return after the transfer finishes instead of polling processing | Off |
inf dataset create --upload-id below or in the dashboard.
Examples
inf dataset create
Materialize an eval or training dataset from captured traffic, an existing upload, or a JSONL file on disk. The file-backed path uploads, waits for processing, and materializes in one command.
Options
| Flag | Required | Description | Default |
|---|---|---|---|
-n, --name <name> | Yes | Dataset name | — |
-t, --type <type> | Yes | eval or training | — |
-f, --file <path> | No | JSONL file on disk — uploads, waits for processing, then materializes from that upload | — |
--upload-id <id> | No | Materialize from an existing upload | — |
--task <taskId> | No | Filter captured traffic by task ID | — |
--model <modelId> | No | Filter captured traffic by model ID | — |
--since <date> | No | Start of the time window for traffic filters (ISO 8601 or YYYY-MM-DD HH:MM:SS) | 30 days ago |
--until <date> | No | End of the time window for traffic filters | 1 minute from now |
--limit <n> | No | Cap on the number of inferences included | — |
--status <status> | No | Status filter: success (default), 2xx, or a specific code like 200 — datasets reject non-success traffic unless you override | success |
--description <text> | No | Free-text dataset description | — |
--file and --upload-id are mutually exclusive — --file creates a new upload automatically. Date values accept ISO 8601 (2026-04-01T00:00:00Z) or ClickHouse format (2026-04-01 00:00:00).
Dataset materialization runs asynchronously. The command prints the dataset ID and points at inf dataset get <id> to check progress.
Examples
inf dataset list
Display datasets in the active project.
inf dataset ls
Options
| Flag | Required | Description | Default |
|---|---|---|---|
-l, --limit <n> | No | Maximum number of results | 20 |
--json to get full UUIDs for scripting.
Examples
inf dataset get
View detailed information about a specific dataset — ID, name, type, inference count, export status, source project, and creation date.
Arguments
| Argument | Required | Description |
|---|---|---|
id | Yes | Dataset ID, UUID prefix (4+ chars), or exact name |
inf dataset download
Download a dataset as a JSONL file. If the server-side export isn’t ready yet, the CLI requests it and polls until it’s ready before downloading.
Arguments
| Argument | Required | Description |
|---|---|---|
id | No | Dataset ID, UUID prefix (4+ chars), or exact name. If omitted in an interactive terminal, the CLI prompts you to choose. |
Options
| Flag | Required | Description | Default |
|---|---|---|---|
-o, --output <path> | No | Output file path | <dataset-name>.jsonl for Hugging Face, <dataset-name>.source-backed.jsonl for source-backed |
-f, --format <format> | No | Download format: huggingface or source-backed | Prompted in a TTY; otherwise huggingface |