Cloves Almeida’s Notes
Technical notes from building production AI decision systems.
Recent writing
-
Engineers Cannot Rely on Falling Token Prices
Frontier models are much more capable than they were a year ago, but the old assumption that token prices will keep falling is breaking. At scale, teams can no longer count on price cuts to do the inference- optimization work for them.