Open Source Models

What’s planned
Why this matters

This feature is on the roadmap. Today, only models trained on the platform can be deployed.

What’s planned

Deploy off-the-shelf OSS models — run popular open source models on dedicated GPUs without going through training. Pick a model, select your instance, and deploy.
Bring your own trained models — deploy models you’ve already fine-tuned outside the platform. Upload your weights and serve them on the same infrastructure.

Why this matters

Not every deployment starts with training on the Inference platform. Some teams want dedicated GPU serving for an existing open source model, or they’ve already fine-tuned a model elsewhere and want to host it. This feature will bring all of that into the same deployment workflow — same API, same monitoring, same scaling options.

Want early access?

Talk to our team if you’d like to be notified when open source and custom model deployments are available.

Manage and Monitor

API Keys and Authentication

⌘I

Get Started

Observe

Datasets

Eval

Train

Deploy

Platform

What’s planned

Why this matters

Want early access?

Get Started

Observe

Datasets

Eval

Train

Deploy

Platform

Documentation Index

​What’s planned

​Why this matters

Want early access?

What’s planned

Why this matters