Blog: LLM Routing, AI Agents & Cost Control

Diagram of the Manifest gateway: a request fails with a 400 temperature out of range error, Auto-fix applies a patch and the retried request returns 200 OK

Jul 27, 2026

Your failing requests now fix themselves

Auto-fix is live on Manifest Cloud. When a provider rejects a request with a fixable error, Manifest patches it and sends it again. Here's what shipped.

Manifest Cloud pricing plans: Free, Pro, and Enterprise

Jul 8, 2026

Introducing paid plans for Manifest Cloud

Manifest Cloud now has a Pro plan at $19/month. The Free plan stays free with all features included. Here is what changes and what stays the same.

The reliability stack for LLM agents: where each tool and method fits

Jul 2, 2026

The reliability stack for LLM agents: tools and methods

A request can fail before you send it, while it runs, or after it returns. This is a directory of tools and methods grouped by what each one does.

Jul 1, 2026

This will get you banned from your ChatGPT subscription

A ChatGPT subscription is cheap inference, but strictly personal. Sharing, automating, serving other users, or reselling access can get you banned.

Tibo Sottiaux (@thsottiaux) on X: about 5% of production traffic is on the Pi harness, another 5% on OpenCode, and you can use your ChatGPT account in a flourishing set of other tools

Jul 1, 2026

ChatGPT Plus: Enjoy $200 of Tokens for $20 While It Lasts

OpenAI lets you spend your $20 ChatGPT Plus subscription inside third-party harnesses like OpenCode and Pi — worth ~$200 in tokens. Here's why it may not last.

Diagram of an agentic optimization loop: contract, target, eval, and state files feeding a plan-generate-score cycle that runs until the budget is spent

Jun 30, 2026

I Stopped Prompting My Agent. Now I Design the Loop That Prompts It.

How a Python script, four files, and a strict eval turned a personal AI agent from something I operated into something I supervise.

Jun 30, 2026

The errors that actually break LLM agents in production

Your agent ran clean in the demo. Friday it returns 400s. Here are the six plumbing failures we see most in Manifest's logs.

Manifest routing tiers in the dashboard: Standard, Complex and Reasoning, each set to a model with configurable fallbacks

Jun 22, 2026

We are deprecating our rule-based routing

Manifest is deprecating rule-based complexity and specificity routing on September 1, 2026. Why, and how to switch to the default or custom routing tiers.

The modelparams.dev homepage — an open, community-maintained catalog of every LLM parameter, for every model

Jun 16, 2026

Announcing modelparams.dev, the open-source model parameter database

modelparams.dev is a free, open-source database of parameters for popular AI models. Browse the UI, fetch the API, or get type-safe configs in TypeScript.

Jun 12, 2026

Fable 5 Doubled in Price, Then Vanished in Three Days. Don't Hardwire Your Agents to One Model.

Claude Fable 5 launched as Anthropic's most expensive model, then access was suspended in three days. The real lesson: don't depend on a single model — route.

Jun 12, 2026

Run Claude Code on your ChatGPT Plus subscription

Use Manifest to route Claude Code requests through your existing ChatGPT Plus subscription instead of paying for API keys on top.

Jun 3, 2026

The Ultimate Guide to AI Subscription Plans

Connect Claude Max, ChatGPT, Copilot, Gemini, Grok and more to your AI agent through Manifest — use the plans you already pay for instead of per-request fees.

AI inference is expensive — 10 ways to cut your LLM inference bill without hurting output quality

May 19, 2026

10 Ways To Reduce Your LLM API Costs

Your AI app is live and the inference bill is eating your margins. Here are 10 practical ways to cut LLM costs without hurting output quality.

May 11, 2026

Manifest Weekly #09

Claude Code support via Anthropic endpoint, multiple API keys per provider, and a stack of routing fixes.

Abstract illustration of AI request routing: luminous threads branching from a single origin into four distinct paths on a deep navy background

May 5, 2026

MyTrainer Cuts AI Inference Costs With Manifest

A practical case study on using Manifest to route coaching chat, deterministic generation, and asynchronous planning workflows in MyTrainer.

May 4, 2026

Manifest Weekly #08

Manifest Weekly #08: see exactly how much routing saves you in the dashboard, one-command Hermes setup, and a stack of routing and streaming reliability fixes.

Apr 27, 2026

Manifest Weekly #07

LM Studio and llama.cpp as built-in providers, custom header routing tiers, simplified setup, and OpenAI Responses API.

Apr 20, 2026

Manifest Weekly #06

Manifest Weekly #06: message-level feedback for routing decisions, OpenCode Go subscription provider support, and a pile of routing reliability fixes.

Apr 13, 2026

Manifest Weekly #05

Manifest now works with any AI agent, routing by task type, two new subscription providers, and Docker-only self-hosting.

Apr 6, 2026

Manifest Weekly #04

Start routing through free models in one click, and Anthropic OAuth finally working with Sonnet and Opus.

Apr 4, 2026

Anthropic Dropped OpenClaw From Max — What To Do

Claude Max subscriptions no longer cover OpenClaw usage — every agent message is billed separately. Here are your options and how to route around the change.

Mar 31, 2026

Manifest Weekly #03

New cloud onboarding for OpenClaw 3.22-beta, Anthropic OAuth fix for Sonnet/Opus, 100% model pricing coverage, and encrypted API key storage.

Manifest Cloud dashboard with OpenClaw terminal showing routing in action

Mar 30, 2026

Set Up Manifest Cloud for your OpenClaw agent

Step-by-step guide to set up Manifest Cloud and start routing OpenClaw requests to the right model automatically.

Mar 25, 2026

Manifest Weekly #02

Copilot & MiniMax subscription providers, smarter model registry, routing scorer improvements, and a pile of bugfixes.

Mar 21, 2026

Awesome Free LLM APIs

All the LLM APIs with permanent free tiers we could find, updated March 2026. Star the repo or open a PR to help keep it current.

Diagram showing an LLM Router directing requests from OpenClaw to different AI models including Anthropic, DeepSeek, OpenAI, and Mistral

Mar 21, 2026

What is an LLM Router?

An LLM router sends each prompt to a different model based on the task. Here's how rule based and AI routing compare for OpenClaw users.

A lobster with glasses reading books at a desk, representing an expert OpenClaw agent

Mar 18, 2026

Fix OpenClaw Agents That Won't Stay Autonomous

OpenClaw automations that ran for hours then quietly stopped? Here are the skill files, cron jobs, and memory rules that fix it.

Mar 16, 2026

Manifest Weekly #01

Anthropic subscription support, automatic model fallback, dynamic model discovery, and a ton of bugfixes.

Adam Smith illustration representing hyper specialization in the AI era

Mar 11, 2026

Hyper Specialization: Stockfish, Smith & the AI Era

It's a mistake to think we'll stay relevant by orchestrating AIs. Our salvation may be the opposite: hyper specialization.

Mar 2, 2026

How to Stop Burning Money on OpenClaw

After speaking with over a hundred OpenClaw users, cost is the topic that comes up in almost every conversation. Here are the strategies that actually work.

Feb 27, 2026

How To Shut Up OpenClaw CLI Banner 🦞

Tired of OpenClaw CLI's banner jokes on every command? Here's the one-line config change that disables them, plus the env var for CI environments.

Feb 25, 2026

The Claw Market Map, Q1 2026

The emerging ecosystem built around OpenClaw — key players across managed hosting, LLM routing, security, developer tools, marketplaces and more.

Manifest dashboard showing cost savings across LLM providers

Feb 22, 2026

Introducing Manifest

Introducing Manifest: an open-source LLM router that sends every request to the cheapest capable model and saves AI-native teams up to 70% on token costs.

Blog