We’re quickly coming to an inflection point for using AI to build stuff.

Until the near future, most companies encouraging AI-usage amongst their workforce have been worried mainly “that” they’re using it. Use more tokens. More token usage means our team uses it. We think that’s a good thing.

However, that will soon change because a developer all-in on AI now costs the company between $500 and $1500 extra per day in tokens.

Like all tool and resource costs, organizations will quickly have to start managing token budgets. Here’s a current list of token costs for the mainstream models:

ModelInput CostOutput CostCache Read Cost
glm0.62.20.11
kimi-k2.50.52.8
gemini-3-flash0.53.0
gpt-5.3-codex1.7514.00.175
gpt-5.42.515.00.25
claude-sonnet-4-53.015.00.3
claude-sonnet-4-63.015.00.3
gemini-3-1-pro2.012.00.2
claude-opus-4-65.025.00.5

Is Opus 4.6 2X better than GPT-5.4 for software?
What is safe for GLM use?
I’ve heard gemini is good at docs, can we use it for that?

Much like figuring out which person should do which task, we now have to do this with models based on token cost.

Figure out how to get AI working for you, or you’ll be working for AI.


Discover more from johnmaconline

Subscribe to get the latest posts sent to your email.

Pin It on Pinterest

Share This

Discover more from johnmaconline

Subscribe now to keep reading and get access to the full archive.

Continue reading