Fair pricing that scales
Start every customer on free + PAYG — they pay only when they get value. When usage stabilizes, offer committed plans. No forced migrations, no sticker shock.
Your customers' usage is unpredictable. Your inference costs are real. Flat-rate pricing either bleeds your margins or drives your best prospects away. UsageTap gives you the billing infrastructure to charge for value delivered — with guardrails that protect both sides.
No credit card required. Then $0.001 per call.
Start every customer on free + PAYG — they pay only when they get value. When usage stabilizes, offer committed plans. No forced migrations, no sticker shock.
Built-in pricing plans, customer-facing usage widgets, Stripe sync, and entitlement gating. You get what takes 3–6 months to build internally.
Quotas, rate limits, anomaly detection, and forecast alerts. Your customers see their spend before the invoice. You see problems before they become churn.
4-step integration
Add begin, check allowed, your LLM call, and update metrics. That's it. Use the SDK hook or plain HTTP, whichever fits your stack.
Supports LLM calls, tokens, rate-limiting, API calls, searches, audio/video minutes, and custom meter types.
AI prompt kit / reference →Use the hook to enforce premium entitlements and report usage in one flow.
const usageTap = useUsageTap();
const begin = await usageTap.begin({
customerId,
requested: { premium: true },
});
if (!begin.allowed.premium) {
throw new Error("Premium calls exhausted for this customer");
}
const model = "openai/<model>";
const response = await openai.responses.create({ model, input });
// Model name is illustrative; UsageTap supports multiple providers via your configured AI stack.
await usageTap.end({
callId: begin.callId,
model,
inputTokens: response.usage?.input_tokens ?? 0,
responseTokens: response.usage?.output_tokens ?? 0,
});
Every AI feature costs real money per call. UsageTap lets you offer generous free tiers with hard limits, graduate customers to pay-as-you-go, and introduce committed plans when usage stabilizes — all driven by policy, not code changes.
01
20 AI actions/month with hard limits. Zero risk to you, zero cost to them.
02
Customer pays per action after free allowance. No signup friction, no sticker shock.
03
UsageTap detects predictable patterns and can recommend a committed plan.
04
Lower effective rate with included allowance + PAYG overage. Predictable for both sides.
Building usage-based billing from scratch takes months. UsageTap ships it out of the box.
| Capability | DIY | UsageTap |
|---|---|---|
| Usage metering | Custom tables & aggregation | 4 SDK calls |
| Plan & tier logic | Custom upgrade/downgrade flows | Dashboard configuration |
| Stripe integration | Webhooks & reconciliation | Automatic sync |
| Customer usage UI | Build from scratch | Drop-in React widgets |
| Rate limiting & overages | Custom middleware | Policy-driven controls |
| Anomaly detection | Statistical pipeline | Built-in alerts |
| Timeline | 3–6 months | Days |

Risk management
Daily and hourly anomaly alerts flag usage that diverges from expected trends. Weekly forecast emails warn customers before they hit limits, with one-click CTAs to upgrade or top up credit.
Notify your ops team when usage spikes or dips beyond forecasted thresholds.
Help customers adjust behavior and plans before limits are reached.


Plan- or customer-level caps per call, token, and capability—no custom code per tier.
Block, throttle, or bill overage. Switch policies without engineering sprints.
Drop‑in React components for usage vs allowance, forecast charts, and upgrade prompts.
Forecast future spend in dollars and API calls—act before overages become churn.
Detect actual usage diverging from forecasted expectations using statistical methods.
Stream calls, tokens, and cost to any OTLP endpoint—Datadog, Grafana, New Relic.
Free tool
Upload your OpenAI or Anthropic billing CSV and get an instant forecast with anomaly detection. No signup required.
As your customers grow
AI-powered products don't behave like traditional software. One customer may send a few prompts while another runs thousands of calls overnight. Adaptive Pricing lets you start with PAYG and introduce commitment when usage becomes predictable.
Let customers begin with free + PAYG so they only pay when they get value. No lock-in to one monetization path.
Move stable accounts into committed plans with included allowance and PAYG overage. No forced blanket upgrades.
Committed tiers work as a savings option. Customers who prefer flexible PAYG stay there.
Automatic, manual, or no migration at all. You choose the policy that fits your GTM.
Seat-based pricing worked when marginal costs were near zero. AI changed that. We're working with early adopters to build the infrastructure that replaces it — fair usage pricing with real-time controls. Shape the product with us.