Deploy a Trained Model

What you’ll have when you finish
Before you start
Step 1: review the completed training job
Step 2: create the deployment
Step 3: copy the public model identifier
Step 4: send a smoke test
Step 5: watch the deployed traffic
Verify it worked
What to do next

This guide is the handoff from model improvement to production validation.

What you’ll have when you finish

one deployment for the trained model
one public model identifier
one successful smoke test against the deployment endpoint

Before you start

complete a training run with the E2E Fine-tuning guide
confirm the trained model still beats or matches the baseline on the eval you trust

Step 1: review the completed training job

Before you create a deployment, inspect the training job detail page for:

final status
external job ID
base model
current or final loss
checkpoint evals and average scores
final model reference / weights

Do not promote a model you cannot explain.

Step 2: create the deployment

In the deployment create flow, you choose:

deployment name
model
speed target
instance count

The dashboard also generates a public deployment identifier in the shape teamSlug/name-randomId unless you override it.

Step 3: copy the public model identifier

Once the deployment exists, copy the deployment’s public model identifier from the overview page. You will use that as the model value in a normal API request.

Step 4: send a smoke test

Use the deployment’s public model identifier with the standard API shape and verify that:

the request completes successfully
the output is correct enough for the workflow
the request shows up in the deployment inferences view

Step 5: watch the deployed traffic

After rollout, inspect:

deployment overview
instances
recent deployment inferences
Observe analytics for the surrounding workflow

The goal is not just “deployment succeeded.” The goal is “the model behaves correctly under real traffic.”

Verify it worked

You should now have:

one live deployment
one public model identifier
one successful deployment request visible in the deployment inferences tab

What to do next

Observe

Keep routing real traffic through Catalyst so the next eval and training cycle stays grounded in production behavior.

Manage and Monitor

Learn more about deployment operations, billing, and scaling.

⌘I

Get Started

Observe

Datasets

Eval

Train

Deploy

Platform

What you’ll have when you finish

Before you start

Step 1: review the completed training job

Step 2: create the deployment

Step 3: copy the public model identifier

Step 4: send a smoke test

Step 5: watch the deployed traffic

Verify it worked

What to do next

Observe

Manage and Monitor

Get Started

Observe

Datasets

Eval

Train

Deploy

Platform

Documentation Index

​What you’ll have when you finish

​Before you start

​Step 1: review the completed training job

​Step 2: create the deployment

​Step 3: copy the public model identifier

​Step 4: send a smoke test

​Step 5: watch the deployed traffic

​Verify it worked

​What to do next

Observe

Manage and Monitor

What you’ll have when you finish

Before you start

Step 1: review the completed training job

Step 2: create the deployment

Step 3: copy the public model identifier

Step 4: send a smoke test

Step 5: watch the deployed traffic

Verify it worked

What to do next