Skip to content
Cost Analysis

LLM Cost Per Query 2026:
What Does One AI Request Actually Cost?

Break down the true cost of a single AI query across all major models. See how costs scale from 1 request to 1 million, and calculate the break-even point where AI becomes expensive.

10 min read·Updated March 2026
Cost Per Single Query (500 input + 500 output tokens)
$0.000075
Gemini 2.0 Flash
$0.000375
GPT-4o mini
$0.00625
GPT-4o
$0.052
Claude Opus 4.6

Cost Per Query: Full Table

Assuming a typical conversational query: 500 input tokens (your message + system prompt) and 500 output tokens (the AI's response). This equals ~375 words in + 375 words out.

ModelCost / Query1K queries/day10K queries/day100K queries/day
Gemini 2.0 Flash$0.000075$2.25/mo$22.50/mo$225/mo
GPT-4o mini$0.000375$11.25/mo$112.50/mo$1,125/mo
Claude Haiku 4.5$0.0006$18/mo$180/mo$1,800/mo
GPT-4o$0.00625$187.50/mo$1,875/mo$18,750/mo
Claude Sonnet 4.6$0.009$270/mo$2,700/mo$27,000/mo
Claude Opus 4.6$0.052$1,560/mo$15,600/mo$156,000/mo
o3$0.025$750/mo$7,500/mo$75,000/mo

How Query Size Affects Cost

The 500+500 token example above is for a simple chat message. Real-world query costs vary dramatically based on use case:

Use CaseTypical TokensGPT-4o mini costGPT-4o cost
Simple FAQ answer200 in / 150 out$0.000118$0.002
Customer support chat turn800 in / 300 out$0.0003$0.005
Code explanation (file)3,000 in / 800 out$0.00093$0.0155
Document summarization (10 pages)8,000 in / 500 out$0.0015$0.025
Long document analysis (50 pages)40,000 in / 2,000 out$0.0072$0.12
Full codebase review (1K files)100,000 in / 5,000 out$0.018$0.30

The AI Cost Ceiling: When Does AI Become Unsustainable?

AI becomes expensive when your cost-per-query exceeds the revenue or value generated per query. Key benchmarks:

  • SaaS product ($10/month subscription): Can afford ~$0.05–0.10 per query (5–10% of revenue on inference)
  • Free-tier product (ad-supported): Must keep cost below ~$0.001 per query to be viable
  • Enterprise tool ($500/month): Can afford $2–5 per complex query

If your cost per query exceeds 10% of your revenue per user, you need to optimize or reprice.

How to Get Your Real Per-Query Cost

Log token counts for a representative sample of real requests. Calculate:

avg_input_tokens = sum(input_tokens) / total_requests
avg_output_tokens = sum(output_tokens) / total_requests
cost_per_query =
(avg_input_tokens / 1_000_000) × input_price
+ (avg_output_tokens / 1_000_000) × output_price

Most observability tools (LangSmith, Helicone, Braintrust) calculate this automatically. Set up monitoring before you scale.

Calculate Your Exact Per-Query Cost

Enter your model, token counts, and query volume to get your monthly estimate.

Open API Cost Calculator