Submit multiple asynchronous inference requests as a single group for easier tracking and webhook notifications.
/v1/slow/group/chat/completions
and /v1/slow/group/completions
endpoints.Feature | Group API | Batch API |
---|---|---|
Maximum requests | 50 | 1,000,000 |
Input format | JSON array in request body | JSONL file upload |
File management | Not required | Required |
Use case | Small batches, quick implementation | Large-scale processing |
Webhook support | Yes | Yes |
Completion time | 1-72 hours | 1-72 hours |
/v1/slow/group/chat/completions
/v1/slow/group/completions
webhook_id
(correct)webhook_url
(incorrect)webhook_idd
(typo)