Docs — Burn Limit

Welcome

Hi. If you're reading this, you probably noticed your AI bill is going up and you can't quite explain why. Maybe a teammate ran an agent in a loop overnight. Maybe Claude Opus is being called when Haiku would have done fine. Maybe a single client project ate more than the others and you can't prove it.

Burn Limit is built for exactly that. We're a small product, not a platform. Read this whole page once and you'll know everything we do — and everything we don't.

Why we built this

We were spending hundreds of dollars a month on Claude and OpenAI as solo developers, and the only way to know how was to log into three different dashboards and math it ourselves. None of them told us "you spent $4.20 on this side project today," or "Sonnet would have done that for 60% less," or "this client owes you $312."

The big observability tools (Datadog, Langfuse, Helicone) are built for ops teams at Series B+ companies. Brilliant tools, but overkill if you're one person trying to figure out whether the AI side of your work is profitable.

Burn Limit is the small, opinionated, slightly retro tool we wanted for ourselves.

Quick start

You can be up and running in about 90 seconds.

Sign up

Create a free account. No credit card. You get 7 days of history, manual log upload, burn receipts, the break-even calculator, and 2 flex cards per month.

Pick how you want to track

Three options — manual upload, VS Code extension, or local proxy. We cover all three in the next section. If you're not sure, start with manual upload to see what we look like, then add the extension once you're convinced.

Look at your dashboard

The dashboard shows your spend across providers, your top models, and your month-over-month trend. The Burn Receipt shows a single clean summary you can save or share.

Set a personal budget (optional)

Under Settings, set a monthly dollar cap. We'll warn you when you're trending past it. Free users get this too.

Three ways to track your spend

We give you three on-ramps because every developer's setup is different.

1. Manual .jsonl upload (Free)

Go to OpenAI or Anthropic, export your usage logs as JSONL, drag the file into Burn Limit. We parse it, calculate the cost ourselves using current model pricing, and you see results immediately.

We re-compute cost server-side from the token counts — we don't trust the "cost" field in your upload, because pricing changes weekly and provider exports lag.

2. VS Code extension (Dev plan and up)

Install the BURN Limit VS Code extension. Log in through the browser handshake (we use a one-tap deep link, no copy-pasting tokens). The extension shows your current burn in the status bar in real time and quietly syncs every API call.

This is the path most users settle on. It's frictionless once it's installed.

3. Local cost proxy on localhost:8787 (Dev plan and up)

For tooling that doesn't live in VS Code (a custom Node script, a Python agent, a CLI), we run a tiny local proxy on localhost:8787. You point your client at it instead of the real API endpoint; it forwards the call, measures the cost, and logs it to Burn Limit before returning the response.

This is also what powers "Should I?" mode — see the Concepts section.

4. Admin import (Agency and up)

If you have an OpenAI admin key or an Anthropic admin key, you can pull historical usage for your entire org in one go. We rate-limit this to 5 imports per hour per user, and we cap the date range at 92 days per call.

Providers we support

Pricing tables are kept up to date in-app under the Models page. Today we cover:

Anthropic — Claude Opus 4.7, Sonnet 4.6, Haiku 4.5, and older 3.x families
OpenAI — GPT-4o, GPT-4o-mini, o1, o3, and the rest of the GPT family
Google — Gemini 2.5 Pro, Gemini 2.0 Flash
Kimi (Moonshot)
Cursor — via extension, for tracking Cursor's built-in usage

Free plan is capped to one provider at a time. Dev plan and above is unlimited.

Concepts you'll hear us mention

Burn Receipt

A single-page summary of your AI spend for any period — model breakdown, top sessions, trend vs the previous period. Designed to be saved or sent to a CFO who doesn't want a dashboard, they want a receipt.

Flex Card

A shareable image of your spend stats. Think Spotify Wrapped, but for AI burn. Free plan gets 2 per month, Dev plan gets unlimited. Yes, it's vanity. We like vanity that travels.

Break-even calculator

You charge a client $X per project. You're burning $Y on AI to deliver it. At what point do you stop making money? This tool answers that, and it's on every plan including Free.

"Should I?" mode

When you have the local proxy installed, you can ask it "should I run this prompt?" before you actually run it. We estimate the cost from your prompt size + model choice and either green-light it or suggest a cheaper model. Dev plan and up.

Efficiency Score

A grade (A+ to F) showing how well-matched your model choice is to your task. If you're using Opus for one-line code completions, your score will tell you so, and so will the Efficiency Advisor on the Team plan.

Spike Alert

A runaway-agent guard. If a team member's last-hour spend is 3× their 30-day hourly average and over $0.50, we email the team owner within minutes. Team plan only. The cron runs every hour.

Pass-through Invoice

If you're an agency, you can mark up the AI cost (say, 25%) and generate an invoice for your client that itemizes the work and shows the markup transparently. Each invoice gets a public share link at /i/<token>. Clients don't need a login.

Client Portal

A read-only dashboard at /client/<slug>that shows your client the live compute spend for their project, with your agency branding (logo, colors) on top. Agency plan.

The plans, plainly

Here's what each tier actually gets you. No tier exists to push you to the next one.

Free

1 AI provider at a time
7 days of history
Dashboard, Burn Receipts, Break-even calculator
Manual log upload (.jsonl)
2 Flex Cards per month
Personal monthly budget cap

Devmost picked

$19/mo

Everything in Free
All five providers, simultaneously
Unlimited history
VS Code extension with live status bar
Local cost proxy on localhost:8787
"Should I?" pre-spend check
Efficiency Score + historical trends + forecasting
Unlimited Flex Cards

Agency

$79/mo

Everything in Dev
Client & project tagging
Per-client spend breakdown
Pass-through invoices with markup
Public invoice share links
White-label receipts + PDF export
CSV export for FreshBooks, Bonsai, Wave
Agency branding (logo, primary/accent colors)
OpenAI & Anthropic admin import

Team

$199/mo

Everything in Agency
Shared team dashboard, per-member breakdown
Team budget with email alerts at 80% and 100%
Spend forecasting (days until breach)
Seat utilization audit
Spike alerts (runaway agent detection, hourly)
Efficiency Advisor (per-model downgrade savings)
Monthly digest email
Priority support

Billed monthly through Lemon Squeezy. Cancel anytime — no annual lock-in.

For agencies billing clients

If you're shipping AI features to clients and trying to make a margin on it, this is the section to read carefully.

Create a Project per client engagement. Tag it with the client name and a budget.
Sessions tagged to that project roll up automatically. You'll see exactly how much AI cost each project consumed.
When the engagement is done, generate a Pass-through Invoice. Set a markup percentage (we default to 0% — you decide). We produce an invoice with line items, totals, and the markup made transparent.
Share the invoice with a public link, or export it as PDF / CSV for your accounting software.

For ongoing clients, give them a Client Portal link with your branding. They see live spend, not stale monthly PDFs. It builds trust and saves you a status meeting.

For teams managing developers

Team plan is for the lead who got blindsided by a $4,000 OpenAI bill in March and doesn't want a repeat.

Team dashboard — every member's spend in one view, sortable by cost or model.
Team budget alerts — set a monthly cap. We email you at 80% and again at 100%. The check runs daily at 9am UTC.
Spike alerts — if any member burns 3× their normal hourly rate (and at least $0.50), we email within the hour. This is the runaway agent guard.
Efficiency Advisor — tells you, per developer, what they'd save by downgrading specific calls to a cheaper model. Concrete numbers, not general advice.
Monthly digest email — on the 1st of each month, the team owner gets a one-page summary: total, vs last month, top spenders, top models, budget utilization.

Add team members by email under /team. The invitee gets an email telling them the team owner can now see their AI usage data. They can leave the team from their settings any time — we're not in the business of surveillance.

Privacy & what we keep

The truthful version, not the legal version (that one's on the Privacy page).

We store your token counts, model names, timestamps, and costs. That's what powers your dashboard.
We do not store your prompts or completions. If you're using the local proxy, content stays on your machine — only the metadata leaves.
We do not sell or share your data. Ever. There's no "industry benchmarks" product on our roadmap that requires aggregating across customers.
Your API tokens (OpenAI / Anthropic admin keys) are used at request time and not persisted after the import job completes.
Account deletion is real. Click "Delete Account" in Settings and we cascade-delete everything: usage records, projects, invoices, branding, audit logs, and your auth identity. There's no soft-delete or 30-day grace period.

What we don't do (yet)

Some honesty about gaps. We'd rather you know now than discover them on day three.

No hard spend caps that block API calls. We alert you, but we don't sit in front of your providers and refuse requests. Adding a hard shutoff to the local proxy is on the list.
No Slack or webhook integrations. Alerts are email-only today.
No SAML / SSO. Login is email + password or Google. If you need SAML, write to us — we'll prioritize it if there's real demand.
No mobile app. The web app is mobile-responsive, but there's no iOS / Android wrapper.
No on-prem. We're a hosted SaaS only.

Rate limits

So you don't hit them by accident:

API (everything): 100 requests per minute per IP
File upload: 10 per hour per user, 10 MB max, .jsonl only
Admin import: 5 per hour per user, 92-day window per call
Extension sync: 30 per hour per user
Proxy log: 1,000 per minute per user

If your real workflow needs more, email us — we can lift limits per-account.

A note from the founder

If you got this far, thank you. Genuinely.

Burn Limit is a small bet that a lot of developers and small teams want a tool that's honest about cost without acting like an enterprise observability platform. We're not trying to be everything. We're trying to be the one tab you open every Monday to see what last week cost you.

If we get something wrong, tell us. If a metric on your dashboard looks suspicious, tell us — we'd rather fix the math than pretend it's fine. If a feature you need isn't here, tell us that too. The roadmap is short on purpose; the next thing we build is usually the next thing a user asked for.

Thanks for being early. We won't waste it.

— The Burn Limit team

Ready to stop guessing?

Free forever for what most people need. No credit card to start.

THE DOCS