Skip to content
Cost Analysis

AI Chatbot Cost Per Message 2026:
What Each Conversation Actually Costs

Exact cost breakdown per chatbot message for GPT-4o, Claude, Gemini, and Llama. Includes conversation history costs, customer support scaling math, and when to switch models.

10 min read·Updated March 2026
Cost Per Chatbot Message
$0.006
GPT-4o per message
$0.00035
GPT-4o mini per message
$0.00008
Gemini Flash per message
$0.00005
Llama (Groq) per message

The Hidden Cost of Chat History

Every chatbot message costs more than just the current exchange — you're also paying for the entire conversation history resent with each turn:

  • Turn 1: 200 input + 150 output = 350 tokens
  • Turn 2: 550 input (history) + 150 output = 700 tokens
  • Turn 5: 1,700 input (history) + 150 output = 1,850 tokens
  • Turn 10: 3,200 input (history) + 150 output = 3,350 tokens

A 10-turn conversation = ~15,000 total tokens, not 7,000 as you might expect.

Cost Per Message by Model (Typical Chat Exchange)

Assuming: 200-token system prompt, 150-token user message, 200-token AI response, 5-turn average conversation.

ModelTurn 1Turn 5Full 10-turn convo1,000 convos/mo
GPT-4o$0.0013$0.010$0.094$94
GPT-4o mini$0.000075$0.00058$0.0056$5.60
Claude Haiku 3.5$0.00028$0.0022$0.021$21
Claude Sonnet 4.5$0.0015$0.012$0.112$112
Gemini 2.0 Flash$0.000035$0.00028$0.0026$2.60
Llama 3.1 8B (Groq)$0.000025$0.00019$0.0018$1.80

Customer Support Cost at Scale

For a company handling 50,000 support conversations per month (5 turns average):

  • GPT-4o: 50,000 × $0.047 = $2,350/month
  • GPT-4o mini: 50,000 × $0.0028 = $140/month
  • Gemini Flash: 50,000 × $0.0013 = $65/month
  • Human agent cost for comparison: 50,000 conversations × $2–5/conversation = $100,000–$250,000/month

Even the most expensive AI option (GPT-4o at $2,350/month) is 97% cheaper than human agents.

3 Ways to Reduce Chatbot Costs

1. Truncate Conversation History

Instead of sending the full conversation history, send only the last N turns. For most chatbots, 3–5 turns is sufficient context. This caps costs regardless of conversation length.

2. Summarize Long Conversations

After 5+ turns, replace the full history with a 100-token summary. Saves 80% on input tokens for long conversations while preserving context.

3. Route by Complexity

Use GPT-4o mini or Gemini Flash for simple questions (80% of traffic). Escalate to GPT-4o only for complex cases. This reduces costs by 80–90% while maintaining quality for hard cases.

Chatbot Cost Calculator Formula

Monthly chatbot cost = Number of conversations × Average turns × (Input tokens per turn × Input price + Output tokens per turn × Output price) / 1,000,000

Example: 10,000 conversations, 5 turns, 600 input / 200 output per turn, GPT-4o mini:

= 10,000 × 5 × (600 × $0.15 + 200 × $0.60) / 1,000,000 = $6.00/month

Calculate Your Chatbot Costs

Enter your conversation volume and get exact cost estimates across all models.

Chatbot Cost Calculator