Logo SVG copied to clipboard
Early Access Build agents that never fail Learn more
OpenAI HTTP 429 Rate limit

OpenAI 429 — Rate limit exceeded

68teams hit this · 30d
540occurrences · 30d
70%auto-recovered by Manifest
Occurrences across all Manifest teams, last 14 days

What this error means

OpenAI throttles by requests-per-minute and tokens-per-minute per model. Parallel agent calls trip it in bursts.

How to fix it

  • Honor the Retry-After header before retrying
  • Add client-side concurrency limits
  • Request a higher tier or split traffic across models
Example error message
{
  "error": {
    "message": "Rate limit reached for requests. Limit 3500 per min.",
    "type": "requests",
    "code": "rate_limit_exceeded"
  }
}

Frequently asked

RPM or TPM — which limit am I hitting?

The error names it: "requests" is per-minute request cap, "tokens" is per-minute token cap.

Don't let your requests fail again and again

Manifest fixes your bad LLM requests on the fly so they return successful responses before it reaches your agent. No downtime.

  • Deprecated / Not-found models
  • Wrong parameters
  • Malformed requests
  • Exceeded context windows

Join the waitlist to get early access with a free month.