- OpenAI-compatible Vertex endpoint: use the OpenAI SDK and set
x-inference-provider-urlto your Vertex/endpoints/openapiURL. - Native Vertex APIs: call the Vertex model operation path through the Catalyst gateway and set
x-inference-provider-urlto the global or regionalaiplatform.googleapis.combase URL.
x-inference-provider: vertex-ai for both modes so Catalyst applies Vertex-specific URL and authentication handling.
This guide is for Gemini and Anthropic through Google Cloud Vertex AI. For
direct Gemini API calls using
generativelanguage.googleapis.com, see the
Google Gemini guide.Supported Vertex endpoints
| API shape | Catalyst path example | Streaming |
|---|---|---|
| OpenAI-compatible Gemini | /v1/chat/completions with x-inference-provider-url: .../endpoints/openapi | Yes |
| Native Gemini | /v1/projects/{project}/locations/{location}/publishers/google/models/{model}:generateContent | No |
| Native Gemini stream | /v1/projects/{project}/locations/{location}/publishers/google/models/{model}:streamGenerateContent | Yes |
| Anthropic on Vertex | /v1/projects/{project}/locations/{location}/publishers/anthropic/models/{model}:rawPredict | No |
| Anthropic on Vertex stream | /v1/projects/{project}/locations/{location}/publishers/anthropic/models/{model}:streamRawPredict | Yes |
| Native Gemini with API key | /v1/publishers/google/models/{model}:generateContent with x-inference-provider-api-key: $GEMINI_VERTEX_API_KEY | No |
Environment
global location, use:
OpenAI-compatible Vertex
Use this path when you want to keep the OpenAI SDK shape for Vertex Gemini models.Native Gemini on Vertex
Use native Gemini paths when you need Vertex’sgenerateContent or streamGenerateContent request format.
GEMINI_VERTEX_API_KEY as x-inference-provider-api-key. Catalyst forwards Google API keys to Vertex as the key query parameter.
:streamGenerateContent. Add ?alt=sse if you want Vertex to return server-sent events.
Anthropic on Vertex
Anthropic models on Vertex use Vertex operation paths and Anthropic’s Vertex payload shape. Use a Google Cloud OAuth access token or service-account-minted access token asx-inference-provider-api-key.
:streamRawPredict and include "stream": true in the body.
Header summary
| Header | Required | Notes |
|---|---|---|
Authorization | Yes | Your Catalyst project API key. |
x-inference-provider | Yes | Set to vertex-ai. |
x-inference-provider-api-key | Yes | Google API key for native Gemini, or OAuth2 access token for Vertex. |
x-inference-provider-url | Yes | Vertex host or /endpoints/openapi URL, depending on the API shape. |
x-inference-environment | No | Dashboard environment tag. |
x-inference-task-id | No | Dashboard task grouping. |