What is fallback?
When a model fails (provider outage, rate limit, bad request), Manifest retries with a backup model from the same tier. Your agent gets a response instead of an error.How it works
Manifest selects a backup
Manifest picks the next fallback model from the tier’s fallback list. Fallback models are tried in the order you configure them.
What triggers a fallback
Any HTTP status code >= 400 triggers a fallback, with one exception: 424 (Failed Dependency) does not trigger a fallback (this is the status Manifest itself returns when the entire chain is exhausted, preventing infinite loops). This includes:| Status | Example |
|---|---|
| 400 | Bad request |
| 401 | Authentication error |
| 403 | Forbidden |
| 429 | Rate limited |
| 500 | Internal server error |
| 502 | Bad gateway |
| 503 | Service unavailable |
| 529 | Provider overloaded |
Configuration
Fallback models are configured per tier in the Manifest dashboard. Each tier can have up to 5 fallback models, tried in order.- Cloud
- Local
Open the dashboard
Go to app.manifest.build and navigate to Routing.
Response headers
When a fallback succeeds, the response includes the standard routing headers plus two extra ones:| Header | Description |
|---|---|
X-Manifest-Tier | The routing tier |
X-Manifest-Model | The model that served the response (the fallback model, not the original) |
X-Manifest-Provider | The provider that handled the request |
X-Manifest-Confidence | Routing confidence score |
X-Manifest-Reason | Why this tier was selected |
X-Manifest-Fallback-From | The primary model that was attempted first |
X-Manifest-Fallback-Index | Position in the fallback chain (0 = first fallback, 1 = second, etc.) |
| Header | Description |
|---|---|
X-Manifest-Fallback-Exhausted | Set to true when all models failed |
Fallback vs routing
| Routing | Fallback | |
|---|---|---|
| When | Before the request is sent | After the request fails |
| Goal | Pick the cheapest capable model | Recover from a failure |
| Speed | < 2 ms scoring | Adds one extra round-trip per retry |
| Tier | Assigns a tier | Stays within the same tier |