Quick reference of webhook support for asynchronous inference
Header | Description | Example |
---|---|---|
X-Inference-Event | Event type | generation.completed , async-embedding.completed , or slow-group.completed |
X-Inference-Webhook-ID | Webhook identifier | AhALzdz8S |
X-Inference-Generation-ID | Generation ID (if applicable) | XBKcs7F1s2oJ_AHiLMbF4 |
X-Inference-Group-ID | Group ID (for group events) | GRP_XYZ123 |
User-Agent | inference.net webhook agent | Kuzco-Webhook/1.0 |
Content-Type | Always application/json | application/json |
Metric | Value | Notes |
---|---|---|
Response timeout | 30 seconds | Must respond within this time |
Retry attempts | 3 | With exponential backoff |
Max payload size | 10MB | Typical: 5-50KB |
Delivery time | Under 60 seconds | From completion to webhook |
Code | Meaning | Retry? |
---|---|---|
200-299 | Success | No |
400-499 | Client error | No |
500-599 | Server error | Yes |
Timeout | No response in 30s | Yes |
Issue | Solution |
---|---|
Not receiving webhooks | Check webhook not disabled in dashboard, test connectivity, verify HTTPS URL |
Duplicate webhooks | Implement idempotency, ensure 200 OK response |
Webhooks timing out | Respond immediately, process asynchronously |
Invalid payload | Validate against documented schema |
Test webhook fails | Check endpoint is publicly accessible, returns 200 OK |