LLM economics
OpenAI-compatible is not cost-compatible
A matching API surface tells you nothing about what a token actually costs to serve. Two endpoints can speak the same protocol and live in different economic universes.
ReadField notes
Short, sharp pieces on inference economics, the tools I build, and what music keeps teaching me about systems.
A matching API surface tells you nothing about what a token actually costs to serve. Two endpoints can speak the same protocol and live in different economic universes.
ReadCalculators predict a best case you will rarely hit. A meter reads the truth off live telemetry. Why I built the meter instead of another spreadsheet.
ReadWhat years of Hindustani classical practice taught me about systems: improvisation inside constraints, repetition without boredom, and staying with complexity.
Read