Coming Soon

See what AI costs.
Then cut it.

Every token counts.

ScroogeLLM is a VS Code extension that sits between you and your LLM provider. It shows you exactly what every call costs — then optimizes it automatically.

LLM costs add up fast

Most developers have no idea what each AI call actually costs. ScroogeLLM makes every dollar visible — then recovers what you can.

61% saved
Average daily savings for a solo developer

Everything to make LLMs affordable

From real-time cost tracking to automatic optimization — all running locally on your machine.

Real-time cost visibility

See exactly what every LLM call costs as it happens. Per-request, per-session, and cumulative totals right in your status bar.

Free

Prompt compression

Automatic context trimming removes redundant tokens before they hit the API. Same quality responses, fewer tokens billed.

Response caching

Identical or near-identical prompts return cached results instantly. Zero cost, zero latency on repeat queries.

Smart model routing

Not every task needs the most expensive model. ScroogeLLM suggests downgrades when a cheaper model can handle the request equally well.

PII anonymization

Deterministic fake names replace real PII before it reaches the provider. Same fake name for the same input, stable across your session.

Free

Savings audit trail

Every request logs raw cost vs. actual cost. Full audit trail you can inspect, export, and use to justify tooling budgets.

Free

OS keychain integration

API keys stored in your operating system's native keychain. Never plaintext, never transmitted. Your keys, your machine, period.

Free

Free tier always on

Core optimizations work without paying a cent. Visibility, PII protection, and audit logging are free forever. Paid features add deeper savings.

Free

A local proxy that earns its keep

ScroogeLLM runs a lightweight proxy on localhost. Your AI tools talk to the proxy. The proxy talks to the provider. In between, it does its work.

1

Install & configure

Install from the VS Code Marketplace. Point your LLM tools at localhost. Done.

2

Intercept & optimize

Every request flows through the proxy. Prompts are compressed, PII is scrubbed, and responses are cached automatically.

3

Track & save

See real-time costs in VS Code. Review your audit trail. Watch the savings accumulate, request by request.

Your code stays on your machine

ScroogeLLM is designed for developers who take data seriously. No cloud, no accounts, no telemetry.

Localhost only

The proxy binds to 127.0.0.1 by default. No remote exposure without your explicit opt-in.

Native keychain

API keys live in macOS Keychain, Windows Credential Locker, or Linux Secret Service. Never in plaintext files.

Zero telemetry

No usage data, no analytics, no phone-home. We never see your prompts, your code, or your API keys.

Deterministic PII scrubbing

Real names, emails, and identifiers are replaced with stable fakes before data leaves your machine. Same input, same fake, every time.

Stop paying full price for AI

ScroogeLLM is coming to the VS Code Marketplace. Leave your email and be the first to know.