Skip to main content

Direct API defaults

Current baseline limits for the direct API are:
  • Language models: 500 requests per minute
  • Image models: 100 requests per minute
These are the fastest numbers to reason about for the shared realtime API.

When limits become the wrong tool

If you are hitting rate limits regularly, the answer is often to change the execution mode rather than just ask for a larger number. Consider:

Need a higher limit?

If you need more headroom on the direct API, meet with our team or contact [email protected].