Pay-per-call vs subscriptions: pricing for AI-agent customers

When you decide how to charge for your API, you're really deciding which buyer you're designing for. Subscriptions and monthly quotas were built for a human who can forecast usage, justify a budget line, and sign off on a plan. Autonomous agents don't do any of that. They show up with a task and a budget, decide call-by-call whether a result is worth its price, and move on.

This post is about the pricing model — not the protocol mechanics (see x402 vs API keys for that), and not how to set the actual number (see how to price an API for AI agents). It's about why pay-per-call tends to fit agent customers better, and the cases where a subscription still wins.

The two models, side by side

	Subscription / quota	Pay-per-call
Designed for	Humans who plan ahead	Agents deciding in the moment
Commitment	Monthly plan, often annual	None — pay per request
First call requires	Signup, plan selection, often a card	Nothing but the payment in the request
Unused capacity	Customer overpays (or you cap them)	No waste — pay for what's called
Price change	A renegotiation / migration	A config change
Best when	Usage is steady and known	Usage is spiky, sporadic, or unknown

Why per-call fits agents: the friction is the product

Every step you put between "an agent wants your result" and "the agent has it" is a place the agent gives up and calls someone else. A subscription front-loads that friction: create an account, choose a tier, enter a card, wait for approval. For a human evaluating a tool over a week, that's fine. For an agent resolving a task in seconds, it's a wall.

Pay-per-call removes the wall. The payment is the request. There's no plan to choose and nothing to provision — which is exactly why it suits callers you'll never meet. In behavioral terms, you've collapsed the activation energy to near zero: the agent that wants the result can get it now.

The mental-accounting problem with quotas

Subscriptions also create a budgeting headache for the operator running the agent. A monthly quota forces them to predict demand they can't predict — buy too much and they overpay for idle capacity; buy too little and the agent hits a wall mid-task. "You have 73 calls left this month" is a constraint a person manages; it's a failure mode for an autonomous system.

Per-call pricing maps cost directly to use. The operator reasons about a single, legible number — "this call costs a cent" — instead of amortizing a plan across unknown volume. That legibility is worth more to an agent buyer than any volume discount, because it makes the agent's own cost decisions simple.

What you give up — and the cases where subscriptions win

Being honest about the trade-offs is the whole point; a pricing page that pretends one model dominates everywhere isn't trustworthy. Per-call gives up two real things:

Predictable recurring revenue. Subscriptions smooth your revenue and improve retention by inertia. Per-call revenue tracks usage, which is spikier.
Deep, committed relationships. A signed annual contract with a known customer is a stronger commercial relationship than a stream of anonymous calls.

So subscriptions still win when:

Your customers are known businesses who want a predictable monthly line item and a contract.
Usage is steady and forecastable, so a quota isn't a guessing game.
You're serving humans, not agents — people are comfortable with plans and value the commitment.

The good news is it's not exclusive. Pay-per-call sits alongside your existing plans: keep subscriptions for your human, contracted customers, and add per-call for the autonomous callers who would never have signed up anyway. You're not migrating a model — you're covering a buyer you couldn't serve before.

Start where the risk is lowest

Whichever way you lean, the lowest-risk way to find out is to try per-call on one route. Test mode runs the whole challenge → pay → verify → allow loop for free, so you can see how agents actually behave against your endpoint before committing — and because there's no plan to unwind, turning a route on or off is reversible. That removes most of the regret from the decision: you're not signing up for a billing platform, you're metering one endpoint and watching what happens.

FAQ

Should I charge AI agents a subscription or per call? Per call, in almost every case. An autonomous agent has no account, can't reason about "remaining quota," and won't sign up or wait for a key — so a subscription is friction it can't clear. Per-request pricing, quoted in a signed 402 at the moment of the call, lets an agent you've never met pay and get a result immediately. Keep subscriptions for known human integrations that want predictable monthly billing.

Is there such a thing as a "user-agent API subscription plan"? Rarely, and it usually misses the point. Some APIs try to bolt agent traffic onto a human subscription tier, but an agent can't manage a plan or a quota the way a person does. The model that actually fits agent (user-agent) traffic is per-call payment; if you want recurring revenue from a known agent operator, a subscription for their account plus per-call x402 for overflow is cleaner than forcing agents into a plan.

Can I run both a subscription and pay-per-call on the same API? Yes — and many APIs do. Subscriptions sit in front of your existing keyed routes for known developers; per-request x402 sits in front of the same API for agentic traffic. See how to price an API for AI agents for setting the per-call number.

Next steps

Set the number: how to price an API for AI agents
The model end to end: monetize your API for AI agents
Protocol vs API keys: x402 vs API keys
Compare every option: the comparison hub
See the cost: pricing — free to start, no card, full test mode