What this error means
A 503 UNAVAILABLE is server-side: Gemini is briefly saturated. It is transient but clusters during peak hours.
How to fix it
- Retry after a short delay with backoff
- Fail over to a second model on repeated 503s
- Avoid hammering the same region during a spike
Example error message
{
"error": {
"code": 503,
"message": "The model is currently experiencing high demand. Spikes in demand are usually temporary.",
"status": "UNAVAILABLE"
}
}Frequently asked
Is a 503 my fault?
No. It is a temporary capacity issue on the provider; a retry or fallback usually succeeds within seconds.