If you already have curated data from annotation pipelines, synthetic generation, or another platform, you can upload it as JSONL instead of building a dataset from captured traffic. Uploads and datasets are separate objects in Catalyst:Documentation Index
Fetch the complete documentation index at: https://docs.inference.net/llms.txt
Use this file to discover all available pages before exploring further.
- An upload is the imported JSONL file plus its validation and processing status.
- A dataset is the stable collection you use for evals, training, and download.
How to upload
- Dashboard
- CLI
- Go to Datasets in the dashboard
- Click Upload Data
- Select your
.jsonlfile - Give the upload a name and start the import
The upload command does not ask whether the data is for evals or training. You choose eval vs training when you create a dataset from the completed upload.
After upload
- Wait for the upload to finish processing in Datasets > Uploads.
- Open the dataset creation flow and select the upload as your source.
- Choose whether the resulting dataset is eval or training.
Supported formats
Two JSONL formats are supported. See Dataset Formats for full schemas, required fields, and validation rules.| Format | Structure | Best for |
|---|---|---|
| Source-backed | { request, response } per line | Round-tripping data captured from providers |
| Hugging Face | { messages } per line | Standard training/eval format, easy to create |
Validation behavior
- Invalid rows are reported with line numbers in the upload status details.
- Uploads can complete with some failed rows if at least one row imports successfully.
- Mixed-format files are treated as a fatal error and fail the upload.
- Source-backed rows must include a usable model value in the request.
Upload limits
| Limit | Value |
|---|---|
| Maximum file size | 10 GB |
| Maximum line count | 1,000,000 |
Next steps
Build from traffic instead
Pull datasets directly from your captured production traffic.
CLI Command Reference
Upload from the terminal with
inf dataset upload.Dataset formats reference
Full schema details and validation rules.