PRICING DATA: MARCH 2026
API Price Check
Enter your token volume once, then compare exact provider pricing including cached-input discounts and long-context tradeoffs.
Select Models to Compare
Monthly Volume
0% cached80% cached
Monthly Cost Comparison
Gemini 3.1 FlashCheapest
$4.17Input: $1.05Cached: $0.12Output: $3.001M context
DeepSeek V4
$15.22Input: $3.85Cached: $0.42Output: $10.95128K context
Gemini 3.1 Pro
$69.39Input: $17.50Cached: $1.89Output: $50.002M context
Claude Sonnet 4.6
$100.50Output
Input: $21.00Cached: $4.50Output: $75.00200K context
GPT-5.4
$117.50Output
Input: $35.00Cached: $7.50Output: $75.00256K context
Claude Opus 4.6
$502.50Input
Output
Input: $105.00Cached: $22.50Output: $375.001M context
Potential savings: $498.33/month by using Gemini 3.1 Flash instead of Claude Opus 4.6. That is $5,980/year.
Per-Token Rate Card (per 1M tokens)
| Model | Input | Cached Input | Output | Context | Speed |
|---|---|---|---|---|---|
GPT-5.4 OpenAI | $5.00 | $2.50 | $15.00 | 256K | Medium |
Claude Opus 4.6 Anthropic | $15.00 | $7.50 | $75.00 | 1M | Slower |
Claude Sonnet 4.6 Anthropic | $3.00 | $1.50 | $15.00 | 200K | Fast |
Gemini 3.1 Pro Google | $2.50 | $0.63 | $10.00 | 2M | Medium |
Gemini 3.1 Flash Google | $0.15 | $0.04 | $0.60 | 1M | Fastest |
DeepSeek V4 DeepSeek | $0.55 | $0.14 | $2.19 | 128K | Fast |
Integrate Any AI API with ZBuild
ZBuild auto-configures API integrations for OpenAI, Anthropic, Google, and more. Ship AI-powered features without wrestling with SDKs.
Try ZBuild Free →