33 points | by syumei 13 hours ago ago
1 comments
50% cache savings is nice buuuuut.. token cost is the product now, not the model. Every coding agent is converging on the same trick, cache the context across calls. Anthropic's prompt caching, then Cursor's snapshots, now this Reddit thing.
50% cache savings is nice buuuuut.. token cost is the product now, not the model. Every coding agent is converging on the same trick, cache the context across calls. Anthropic's prompt caching, then Cursor's snapshots, now this Reddit thing.