Skip to main content
Inference.net Documentation home page
Search...
⌘K
Support
Dashboard
Dashboard
Search...
Navigation
Resources
Rate Limits
Documentation
View Dashboard
Search Models
Get Started
Introduction
Quickstart
Workhorse Models
ClipTagger
Schematron
Features
Embeddings API
Batch API
Structured Outputs
Vision
Background Inference (Asynchronous API)
Fine-Tuning
Fine‑Tuning & Distillation
Use Cases
Image Captioning
Translation
Classification
Resources
Rate Limits
FAQ
Partner Program
On this page
API Rate Limits
Resources
Rate Limits
Copy page
Rate limits for the Inference.net API
Copy page
API Rate Limits
Please
contact us
or use the support chat to request a higher rate limit.
Language Models: 500 requests per minute
Image Models: 100 requests per minute
Multi-Label Tagging
FAQ
⌘I
Assistant
Responses are generated using AI and may contain mistakes.