Blog

Practical guides on reducing AI costs and optimizing token usage.

2026-04-22

Open-sourcing erabot: the CLI + scanner for LLM cost waste

We open-sourced the tree-sitter detector and Typer CLI behind erabot.ai under MIT. Install it with pip, run it locally, read the code on GitHub. Here is what we kept open, what stays closed, and why.

9 min readopen-sourceclillm-coststree-sitter

2026-03-28

Why You Keep Hitting Token Limits (And How to Fix It)

Understanding context windows, token waste patterns, and how to scan your codebase for costly LLM calls that silently drain your AI budget.

8 min readtoken-limitscost-optimizationcontext-windows

2026-03-27

How to Cut Your OpenAI API Bill by 40% Without Changing Your Prompts

A practical guide to reducing OpenAI API costs through token optimization, model selection, caching, and code-level fixes — with real dollar savings examples.

10 min readopenaicost-reductionapi-optimization

2026-03-26

The Hidden Cost of Long Context Windows: Why Bigger Isn't Always Better

Long context windows from Claude and GPT-4 seem like a superpower — until you see the bill. Learn why context length is the biggest hidden cost driver in AI applications.

7 min readcontext-windowsclaude-apicost-optimization