LLM Cost Per Query 2026:
What Does One AI Request Actually Cost?
Break down the true cost of a single AI query across all major models. See how costs scale from 1 request to 1 million, and calculate the break-even point where AI becomes expensive.
Cost Per Query: Full Table
Assuming a typical conversational query: 500 input tokens (your message + system prompt) and 500 output tokens (the AI's response). This equals ~375 words in + 375 words out.
| Model | Cost / Query | 1K queries/day | 10K queries/day | 100K queries/day |
|---|---|---|---|---|
| Gemini 2.0 Flash | $0.000075 | $2.25/mo | $22.50/mo | $225/mo |
| GPT-4o mini | $0.000375 | $11.25/mo | $112.50/mo | $1,125/mo |
| Claude Haiku 4.5 | $0.0006 | $18/mo | $180/mo | $1,800/mo |
| GPT-4o | $0.00625 | $187.50/mo | $1,875/mo | $18,750/mo |
| Claude Sonnet 4.6 | $0.009 | $270/mo | $2,700/mo | $27,000/mo |
| Claude Opus 4.6 | $0.052 | $1,560/mo | $15,600/mo | $156,000/mo |
| o3 | $0.025 | $750/mo | $7,500/mo | $75,000/mo |
How Query Size Affects Cost
The 500+500 token example above is for a simple chat message. Real-world query costs vary dramatically based on use case:
| Use Case | Typical Tokens | GPT-4o mini cost | GPT-4o cost |
|---|---|---|---|
| Simple FAQ answer | 200 in / 150 out | $0.000118 | $0.002 |
| Customer support chat turn | 800 in / 300 out | $0.0003 | $0.005 |
| Code explanation (file) | 3,000 in / 800 out | $0.00093 | $0.0155 |
| Document summarization (10 pages) | 8,000 in / 500 out | $0.0015 | $0.025 |
| Long document analysis (50 pages) | 40,000 in / 2,000 out | $0.0072 | $0.12 |
| Full codebase review (1K files) | 100,000 in / 5,000 out | $0.018 | $0.30 |
The AI Cost Ceiling: When Does AI Become Unsustainable?
AI becomes expensive when your cost-per-query exceeds the revenue or value generated per query. Key benchmarks:
- SaaS product ($10/month subscription): Can afford ~$0.05–0.10 per query (5–10% of revenue on inference)
- Free-tier product (ad-supported): Must keep cost below ~$0.001 per query to be viable
- Enterprise tool ($500/month): Can afford $2–5 per complex query
If your cost per query exceeds 10% of your revenue per user, you need to optimize or reprice.
How to Get Your Real Per-Query Cost
Log token counts for a representative sample of real requests. Calculate:
Most observability tools (LangSmith, Helicone, Braintrust) calculate this automatically. Set up monitoring before you scale.
Calculate Your Exact Per-Query Cost
Enter your model, token counts, and query volume to get your monthly estimate.
Open API Cost Calculator