Rate limits

Per-key request and concurrency budgets.

Limits apply per API key. They derive from the owner's plan unless overridden by an admin.

PlanRPSConcurrent jobs
Pro55
Max2525

Headers

Every response includes:

  • X-RateLimit-Limit - requests allowed in the current 60-second window
  • X-RateLimit-Remaining - remaining quota
  • X-RateLimit-Reset - seconds until the window rolls over

429 responses

Both RPS exhaustion and concurrent-job exhaustion return HTTP 429.

{
  "error": {
    "code": "rate_limited",
    "message": "Rate limit exceeded (5 req/s). Retry after 7s."
  }
}

Always honour Retry-After if present; back off and retry. Webhook deliveries do this for you automatically.

Spend caps

Per-key daily and monthly credit caps are independent of rate limits. When a cap is reached, requests return HTTP 402 with code spend_cap_reached.