> ## Documentation Index
> Fetch the complete documentation index at: https://docs.inference.net/llms.txt
> Use this file to discover all available pages before exploring further.

# Manage and Monitor

> Start, stop, and delete deployments. Monitor production performance and scale when you need to.

<Frame>
  <img src="https://mintcdn.com/kuzco/zhciHP1--S5gDjt6/images/deploy/deploy-overview.png?fit=max&auto=format&n=zhciHP1--S5gDjt6&q=85&s=0847ad1d13f3ba31ce24a157e96ce40b" alt="Deployments list with per-deployment metrics" width="2062" height="484" data-path="images/deploy/deploy-overview.png" />
</Frame>

## Lifecycle operations

* **Start** — bring a stopped deployment back online
* **Stop** — take the deployment offline
* **Delete** — remove the deployment entirely

## Scaling

By default, your model deploys on a single dedicated GPU. If you need additional GPUs or auto-scaling, [reach out to our team](https://inference.net/meet-with-us/).

## Monitoring

Once your deployment is live, click into it from the **Deployments** page to see metrics and individual inference calls. You get the same [Gateway](/platform/gateway/overview) experience — latency, error rates, token usage, and full request/response payloads — scoped to that deployment.

<Frame caption="Deployment detail page showing metrics and inference calls.">
  <img src="https://mintcdn.com/kuzco/zhciHP1--S5gDjt6/images/deploy/deploy-metrics.png?fit=max&auto=format&n=zhciHP1--S5gDjt6&q=85&s=c05d79f298b0534681d1a441ea0141d3" alt="Deployment metrics and inference calls" width="2058" height="1210" data-path="images/deploy/deploy-metrics.png" />
</Frame>

## The loop continues

Your custom model is live. Use [Gateway](/platform/gateway/overview) to watch its production performance, run [evals](/platform/eval/overview) to catch regressions, and [train](/platform/train/overview) the next version when you're ready.

<CardGroup cols={3}>
  <Card title="Gateway" icon="satellite-dish" href="/platform/gateway/overview">
    Monitor production traffic.
  </Card>

  <Card title="Eval" icon="flask" href="/platform/eval/overview">
    Catch quality regressions.
  </Card>

  <Card title="Train" icon="brain" href="/platform/train/overview">
    Build the next version.
  </Card>
</CardGroup>
