Dataset Uploads

Upload JSONL files containing inference request/response pairs directly through the dashboard. Uploaded data flows into the same analytics pipeline as live inferences, making it available for training dataset curation, evaluation, and analysis.

File Format

Only JSONL (JSON Lines) files with a .jsonl extension are supported. Each line must be a valid JSON object with the following fields:

Field	Type	Required	Description
`request`	object	Yes	The raw provider request body (e.g., OpenAI or Anthropic format)
`response`	object	No	The raw provider response body (nullable)

The system auto-detects the provider format (OpenAI or Anthropic) from the structure of the response body.

Example JSONL Content

{"request": {"model": "gpt-4o", "messages": [{"role": "system", "content": "You are a helpful assistant."}, {"role": "user", "content": "What is the capital of France?"}]}, "response": {"id": "chatcmpl-abc123", "object": "chat.completion", "choices": [{"index": 0, "message": {"role": "assistant", "content": "The capital of France is Paris."}, "finish_reason": "stop"}], "usage": {"prompt_tokens": 24, "completion_tokens": 8, "total_tokens": 32}}}
{"request": {"model": "gpt-4o", "messages": [{"role": "user", "content": "Translate 'hello' to Spanish."}]}, "response": {"id": "chatcmpl-def456", "object": "chat.completion", "choices": [{"index": 0, "message": {"role": "assistant", "content": "Hola"}, "finish_reason": "stop"}], "usage": {"prompt_tokens": 12, "completion_tokens": 2, "total_tokens": 14}}}

The response field is optional. You can upload request-only data if you don’t have the corresponding responses — this is useful for curating prompt datasets.

Upload Limits

Limit	Value
Maximum file size	100 MB
Maximum line count	1,000,000 lines

These limits are enforced both client-side (in the upload dialog) and server-side. Files exceeding either limit will be rejected before processing begins.

How to Upload

Navigate to the Datasets page in your project’s Observability dashboard.
Click the “Upload Data” button.
Enter a name for the upload (e.g., “GPT-4o production samples”).
Select or drag-and-drop a .jsonl file.
The UI validates file type, size, and line count before uploading.
Click Upload to start the upload.

The file picker only accepts .jsonl files. If your data is in another format (CSV, JSON array, etc.), convert it to JSONL first — one JSON object per line, no trailing commas.

How Upload Processing Works

After the file is uploaded, a background job picks it up for processing. Here’s what happens:

Stage	Status	Description
Upload received	pending	File is stored and queued for processing
Parsing & validation	processing	Each line is validated, parsed, and provider format is auto-detected
Complete	completed	All lines have been processed and inserted into the analytics database
Error	failed	Processing encountered a fatal error

During processing:

Lines are validated individually — a single malformed line won’t fail the entire upload.
Rows are inserted into the analytics database in batches of 1,000 for efficiency.
Lines that fail validation are recorded, with up to 100 error details stored per upload.
The dashboard auto-polls every 5 seconds while any upload is in progress, so you can watch status updates in real time.

Viewing Uploaded Data

Once processing completes, uploads appear in the Uploads section of the Datasets page. Each upload displays:

Name — the label you provided during upload
Status badge — pending, processing, completed, or failed
Line counts — processed lines vs. total lines
Failed lines — number of lines that failed validation
Creation date — when the upload was initiated

Inspecting Results

Expand a completed upload to see error details for any lines that failed validation.
Click the external link icon to jump to the Inferences page, pre-filtered to show only inferences from that upload.
Click Download to retrieve the original uploaded JSONL file.

Use the Inferences page filter to compare uploaded historical data against live production inferences — helpful for spotting distribution drift or quality regressions.

Use Cases

Scenario	Description
Historical data import	Upload inference logs from other providers (OpenAI, Anthropic) to analyze them alongside your Inference.net data.
Training dataset curation	Import production data, then use the dashboard to filter, review, and curate datasets for fine-tuning.
Multi-source consolidation	Combine inference logs from multiple environments or providers into a single analytics view.

Get Started

Workhorse Models

Features

Fine-Tuning

Use Cases

Resources

File Format

Example JSONL Content

Upload Limits

How to Upload

How Upload Processing Works

Viewing Uploaded Data

Inspecting Results

Use Cases

Get Started

Workhorse Models

Features

Fine-Tuning

Use Cases

Resources

​File Format

​Example JSONL Content

​Upload Limits

​How to Upload

​How Upload Processing Works

​Viewing Uploaded Data

​Inspecting Results

​Use Cases

File Format

Example JSONL Content

Upload Limits

How to Upload

How Upload Processing Works

Viewing Uploaded Data

Inspecting Results

Use Cases