HOW IT WORKS
Wrap once.
Protected forever.
Pass your agent into halt() with a budget constraint. Halt instruments every LLM call transparently and throws a BudgetExceededError before a single cent over limit is charged.
01
Install
npm install halt — no peer dependencies, no config files.
02
Wrap
halt(agent, { budget }) — one function call wraps any agent.
03
Ship
Halt intercepts every LLM call and stops execution at the limit.
FEATURES
Built for production.
⬡
Hard stop
Throws BudgetExceededError before the over-limit call is made. Not after. Zero overage.
◎
Zero config
No dashboard, no API key, no webhook. npm install and one line of code.
⟲
Transparent
Wraps any OpenAI-compatible client. Your agent code doesn't change.
⌥
Framework-agnostic
Works with LangChain, AutoGen, custom agents, or raw SDK calls.
⧖
< 1ms overhead
Budget checks are in-process. No network round-trip, no latency cost.
⊛
Per-call tracking
Token counts, model prices, and cumulative spend logged on every call.
USE CASES