इसे छोड़कर कंटेंट पर जाएं
शुरू करें
रूटिंगमॉडलफ़ीचरदस्तावेज़ऐप्स साइन इन शुरू करें

Simple, Transparent Pricing

Pay per use. Auto-recharge. Full visibility.

GreatRouter charges per request based on the model used. Every price is denominated with the correct unit — no hidden fees, no confusing line items.

How It Works

Pay-Per-Use

You are charged only for the requests you make. No minimums, no monthly commitments, no surprise fees.

Auto-Recharge

When your credit balance drops below a configurable threshold, your payment method is automatically charged a predefined amount. You control both the threshold and the recharge amount.

Built-In Logging & Tracing

Every request is logged with full visibility: model used, token count, latency, cost, and request/response data. Access detailed traces from your dashboard.

Real-Time Cost Tracking

See your spending by model, provider, and time period — updated in real time.

Text Generation Pricing

Text models are billed per million tokens (input and output separately).

ModelInputOutputCached Input
GPT-5 (OpenAI)$1.25 / 1M tokens$10.00 / 1M tokens$0.125 / 1M tokens
GPT-5.4 (OpenAI)$2.50 / 1M tokens$15.00 / 1M tokens$0.25 / 1M tokens
GPT-5.4 Mini (OpenAI)$0.40 / 1M tokens$1.60 / 1M tokens
GPT-5.4 Nano (OpenAI)$0.20 / 1M tokens$0.80 / 1M tokens
GPT-5.4 Pro (OpenAI)$30.00 / 1M tokens$60.00 / 1M tokens
GPT-5.5 (OpenAI)$5.00 / 1M tokens$25.00 / 1M tokens
GPT-5.5 Pro (OpenAI)$30.00 / 1M tokens$150.00 / 1M tokens
o4 Mini (OpenAI)$1.10 / 1M tokens$4.40 / 1M tokens
Claude Sonnet 4 (Anthropic)$3.00 / 1M tokens$15.00 / 1M tokens$0.30 / 1M tokens
Claude Sonnet 4.5 (Anthropic)$3.00 / 1M tokens$15.00 / 1M tokens$0.30 / 1M tokens
Claude Sonnet 4.6 (Anthropic)$3.00 / 1M tokens$15.00 / 1M tokens$0.30 / 1M tokens
Claude Haiku 4.5 (Anthropic)$1.00 / 1M tokens$5.00 / 1M tokens$0.10 / 1M tokens
Claude Opus 4.6 (Anthropic)$5.00 / 1M tokens$25.00 / 1M tokens$0.50 / 1M tokens
Claude Opus 4.7 (Anthropic)$5.00 / 1M tokens$25.00 / 1M tokens$0.50 / 1M tokens
Claude Opus 4.8 (Anthropic)$5.00 / 1M tokens$25.00 / 1M tokens$0.50 / 1M tokens
Gemini 2.5 Flash (Google)$0.30 / 1M tokens$2.50 / 1M tokens$0.075 / 1M tokens
Gemini 2.5 Flash Lite (Google)$0.10 / 1M tokens$0.40 / 1M tokens
Gemini 2.5 Pro (Google)$1.25 / 1M tokens$10.00 / 1M tokens$0.3125 / 1M tokens
Gemini 3 Flash (Google)$0.50 / 1M tokens$3.00 / 1M tokens$0.05 / 1M tokens
Gemini 3.1 Flash Lite (Google)$0.25 / 1M tokens$1.00 / 1M tokens
Gemini 3.1 Pro (Google)$1.25 / 1M tokens$5.00 / 1M tokens$0.3125 / 1M tokens
Qwen3 Max (Alibaba)$1.25 / 1M tokens$5.00 / 1M tokens
Qwen3.5 397B A17B (Alibaba)$1.50 / 1M tokens$6.00 / 1M tokens
LLaMA 3.1 70B Instruct (Meta)$0.293 / 1M tokens$2.253 / 1M tokens
LLaMA 3.3 70B (Meta)$0.293 / 1M tokens$2.253 / 1M tokens
LLaMA 4 Scout (Meta)$0.27 / 1M tokens$0.85 / 1M tokens
Mistral Small 3.1 24B (Mistral AI)$0.351 / 1M tokens$0.555 / 1M tokens
GEMA-SEA-LION v4 27B (AI Singapore)$0.351 / 1M tokens$0.555 / 1M tokens
Kimi K2.5 (Moonshot AI)$0.60 / 1M tokens$3.00 / 1M tokens
Kimi K2.6 (Moonshot AI)$0.95 / 1M tokens$4.75 / 1M tokens
GLM 4.7 Flash (Z.ai)$0.061 / 1M tokens$0.40 / 1M tokens
DeepSeek R1 Distill 32B (DeepSeek)$0.497 / 1M tokens$4.881 / 1M tokens
Grok 4.3 (xAI)$1.25 / 1M tokens$2.50 / 1M tokens$0.20 / 1M tokens
Grok 4.20 Reasoning (xAI)$2.00 / 1M tokens$10.00 / 1M tokens
Grok 4.20 Non-Reasoning (xAI)$2.00 / 1M tokens$10.00 / 1M tokens
Grok 4.20 Multi-Agent (xAI)$2.00 / 1M tokens$10.00 / 1M tokens
Nemotron 3 120B (NVIDIA)$0.50 / 1M tokens$2.50 / 1M tokens
GPT-OSS 120B (OpenAI)$0.35 / 1M tokens$1.40 / 1M tokens
GPT-OSS 20B (OpenAI)$0.20 / 1M tokens$0.80 / 1M tokens
M2.7 (MiniMax)$0.30 / 1M tokens$1.50 / 1M tokens
Qwen2.5 Coder 32B (Qwen)$0.66 / 1M tokens$2.64 / 1M tokens
QwQ 32B (Qwen)$0.66 / 1M tokens$2.64 / 1M tokens
Qwen3 30B A3B FP8 (Qwen)$0.0509 / 1M tokens$0.0509 / 1M tokens

LoRA / Fine-Tunable Base Models

These models support LoRA fine-tuning. Pricing is per-adapter when fine-tuned; base model pricing shown.

ModelInputOutput
Gemma 2B IT LoRA (Google)$0.01 / 1M tokens
Gemma 7B IT (Google)$0.05 / 1M tokens$0.05 / 1M tokens
Gemma 7B IT LoRA (Google)$0.05 / 1M tokens
Llama 2 7B Chat HF LoRA (Meta)$0.06 / 1M tokens
Llama 3.1 70B Instruct (Meta)$0.01 / 1M tokens$0.01 / 1M tokens
Mistral 7B Instruct v0.2$0.05 / 1M tokens$0.05 / 1M tokens
Mistral 7B Instruct v0.2 LoRA$0.05 / 1M tokens

Image Generation Pricing

Image models are billed per image, with pricing that varies by resolution and model.

Premium Image Models

ModelPrice
Flux 1 Schnell (Black Forest Labs)$0.000053 / image
Flux 2 Dev (Black Forest Labs)tile-step pricing from $0.00021
Flux 2 Flex (Black Forest Labs)$0.05 / MP output, $0.05 / MP input
Flux 2 Klein 4B (Black Forest Labs)$0.000059 / input tile, $0.000287 / output tile
Flux 2 Klein 9B (Black Forest Labs)$0.015 / image
Flux 2 Max (Black Forest Labs)$0.07 / 1st MP, $0.03 / addtl MP, $0.03 / input MP
Flux 2 Pro Preview (Black Forest Labs)$0.03 / 1st MP, $0.015 / addtl MP, $0.015 / input MP
Recraft V4 (Recraft)$0.04 / image
Recraft V4 Pro (Recraft)$0.25 / image
Recraft V4 Vector (Recraft)$0.08 / image
Recraft V4 Pro Vector (Recraft)$0.30 / image
Recraft V4.1 (Recraft)$0.04 / image
Recraft V4.1 Utility (Recraft)$0.04 / image
Recraft V4.1 Utility Pro (Recraft)$0.25 / image
Recraft V4.1 Pro (Recraft)$0.25 / image
Recraft V4.1 Vector (Recraft)$0.08 / image
Recraft V4.1 Pro Vector (Recraft)$0.30 / image
Recraft V4.1 Utility Vector (Recraft)$0.08 / image
Recraft V4.1 Utility Pro Vector (Recraft)$0.30 / image
Seedream 4.0 (ByteDance)$0.03 / image
Seedream 4.5 (ByteDance)$0.04 / image
Seedream 5 Lite (ByteDance)$0.035 / image
Seedance 2.0 (ByteDance)see video pricing
Imagen 4 (Google)$0.04 / image
Nano Banana (Google)input $0.30 / 1M tokens, output $30.00 / 1M tokens
Nano Banana 2 (Google)input $0.50 / 1M tokens, output $60.00 / 1M tokens
Nano Banana Pro (Google)input $2.00 / 1M tokens, output $120.00 / 1M tokens
Grok Imagine (xAI)$0.02 / image
Grok Imagine Quality (xAI)$0.05 / image
GPT Image 1.5 (OpenAI)input $5.00 / 1M tokens, input images $8.00 / 1M tokens, output $10.00 / 1M tokens
GPT Image 2 (OpenAI)input $5.00 / 1M tokens, input images $8.00 / 1M tokens, output $10.00 / 1M tokens
Wan 2.6 Image (Alibaba)$0.03 / image
Phoenix 1.0 (Leonardo)$0.006 / image
Lucid Origin (Leonardo)$0.007 / image

Standard Image Models

ModelPrice
Stable Diffusion XL Lightning (ByteDance)$0.035 / image
Dreamshaper 8 LCM (Lykon)$0.035 / image
Stable Diffusion v1.5 Img2Img (RunwayML)$0.035 / image
Stable Diffusion v1.5 Inpainting (RunwayML)$0.035 / image
Stable Diffusion XL Base 1.0 (Stability AI)$0.035 / image

Video Generation Pricing

Video models are billed per second of output.

ModelPrice
Veo 3 Fast (Google)$0.08 / sec (720p), $0.10 / sec (1080p), $0.10 / sec (720p w/ audio), $0.12 / sec (1080p w/ audio), $0.30 / sec (4K w/ audio)
Veo 3 (Google)$0.20 / sec (720p), $0.20 / sec (1080p), $0.40 / sec (720p w/ audio), $0.40 / sec (1080p w/ audio)
Veo 3.1 Fast (Google)$0.08 / sec (720p), $0.10 / sec (1080p), $0.25 / sec (4K), $0.10 / sec (720p w/ audio), $0.12 / sec (1080p w/ audio), $0.30 / sec (4K w/ audio)
Veo 3.1 (Google)$0.20 / sec (720p), $0.20 / sec (1080p), $0.40 / sec (4K), $0.40 / sec (720p w/ audio), $0.40 / sec (1080p w/ audio), $0.60 / sec (4K w/ audio)
PixVerse V6$0.025 / sec (360p), $0.035 / sec (540p), $0.045 / sec (720p), $0.090 / sec (1080p)
PixVerse V5.6tiered by resolution + duration
Vidu Q3 Turbo$0.04 / sec (540p), $0.06 / sec (720p), $0.07 / sec (1080p)
Vidu Q3 Pro$0.05 / sec (540p), $0.125 / sec (720p), $0.15 / sec (1080p)
Hailuo 2.3 (MiniMax)$0.047 / sec
Hailuo 2.3 Fast (MiniMax)$0.032 / sec
Seedance 2.0 (ByteDance)$0.22 / sec (720p), $0.55 / sec (1080p)
Seedance 2.0 Fast (ByteDance)$0.08 / sec (720p), $0.17 / sec (1080p)
HH1-T2V (Alibaba)$0.14 / sec (720p), $0.28 / sec (1080p)
HH1-I2V (Alibaba)$0.14 / sec (720p), $0.28 / sec (1080p)
Gen 4.5 (Runway)$0.12 / sec
Grok Imagine Video (xAI)$0.05 / sec
Grok Imagine Video 1.5 Preview (xAI)$0.08 / sec, $0.14 / sec (720p)

Image-to-Video Models

ModelPrice
Wan 2.7 I2V (Alibaba)$0.10 / sec (720p), $0.15 / sec (1080p)

Audio Pricing

Speech-to-Text

ModelPrice
Deepgram Nova-3$0.0052 / 1M input tokens
Deepgram Flux$0.0077 / 1M input tokens
Whisper (OpenAI)$0.000453 / 1M input tokens
Whisper Tiny EN (OpenAI)$0.000453 / 1M input tokens
Whisper Large V3 Turbo (OpenAI)$0.000513 / 1M input tokens
Universal 3 Pro (AssemblyAI)$0.0035 / audio minute
GPT-4o Transcribe (OpenAI)$0.006 / audio minute
Grok STT (xAI)$0.001667 / audio minute

Text-to-Speech

ModelPrice
TTS-1 (OpenAI)$0.000015 / character
TTS-1 HD (OpenAI)$0.00003 / character
Aura 1 (Deepgram)$0.015 / 1M input tokens
Aura 2 English (Deepgram)$0.03 / 1M input tokens
Aura 2 Spanish (Deepgram)$0.03 / 1M input tokens
Speech 2.8 Turbo (MiniMax)$0.00006 / character
Speech 2.8 HD (MiniMax)$0.0001 / character
TTS 2 (Inworld)$0.000035 / character
TTS 1.5 Max (Inworld)$0.000035 / character
TTS 1.5 Mini (Inworld)$0.000025 / character
MeloTTS (MyShell)$0.000205 / 1M input tokens
Grok TTS (xAI)$0.000015 / character

Gemini 3.1 Flash TTS Pricing

Token TypePrice
Input text tokens$0.75 / 1M tokens
Input audio tokens$3.00 / 1M tokens
Output text tokens$4.50 / 1M tokens
Output audio tokens$12.00 / 1M tokens

Classification Pricing

ModelPrice
ResNet-50 (Microsoft)$0.00000251 / 1M input tokens
DistilBERT SST-2 (Hugging Face)$0.0263 / 1M input tokens
BGE Reranker Base (BAAI)$0.00311 / 1M input tokens

Music Generation

Music and audio models bill per generation, track, request, or compute second depending on the provider. See the Models catalog for the full routable list with units.

ModelPriceUnit
music generator (CassetteAI)$0.0013per compute second
Elevenlabs Music$0.80per generation
Music 2.6 (MiniMax)$0.01 / $0.15per track (lyrics / full)
Minimax Music 2.5 / 2.6$0.15per generation
Suno V4.5 / V5variesper generation (async)

Translation Pricing

ModelPrice
M2M100 1.2B (Meta)$0.342 / 1M tokens (input + output)
IndicTrans2 (AI4Bharat)$0.342 / 1M tokens (input + output)

Embeddings Pricing

ModelPrice
BGE Small EN v1.5 (BAAI)$0.0202 / 1M tokens
BGE Base EN v1.5 (BAAI)$0.0666 / 1M tokens
BGE Large EN v1.5 (BAAI)$0.204 / 1M tokens
BGE M3 (BAAI)$0.0118 / 1M tokens
EmbeddingGemma 300M (Google)$0.01 / 1M tokens
PLaMo Embedding 1B (Preferred Networks)$0.0186 / 1M tokens
Qwen3 Embedding 0.6B (Qwen)$0.0118 / 1M tokens

Image-to-Text Pricing

ModelPrice
LLaVA 1.5 7B (LLaVA-HF)$0.05 / 1M input tokens, $0.05 / 1M output tokens

Voice Activity Detection

ModelPrice
Smart Turn V2 (Pipecat AI)$0.000338 / 1M input tokens

Cost Estimation Examples

Summarize a 10-page PDF

  • Input: ~3,000 tokens
  • Model: LLaMA 3.3 70B (via auto-route)
  • Cost: ~$0.0009 (3K input × $0.293 / 1M + ~1K output × $2.253 / 1M)

Generate an image

  • Model: Flux 2 Flex (via auto-route)
  • Cost: ~$0.05

Transcribe a 30-minute meeting

  • Model: Deepgram Nova-3 (via auto-route)
  • Cost: ~$0.156 (30 min × $0.0052 / 1M tokens)

Generate 10 seconds of 720p video

  • Model: Veo 3 Fast (via auto-route)
  • Cost: ~$0.80 (10 × $0.08)

Price Tiers

Every model is assigned a price tier to help you quickly compare cost across models. Tiers use percentile ranking within each category — a model’s tier depends on how it compares to other models that do the same job.

TierMeaningPercentile
EconomyLowest-cost options in the category0–20th
StandardBelow-average cost, good value20–40th
BalancedNear the category average40–60th
PremiumAbove-average cost, higher capability60–80th
FlagshipTop-tier, most capable and expensive80–100th

How it works

  • Models are grouped by category (LLM, image generation, video generation, etc.) and sorted by average price.
  • Each tier holds roughly 20% of models in that category.
  • Balanced means near the median price for that category.
  • Tiers are relative within categories — an Economy LLM may cost more in absolute dollars than a Flagship TTS model.
  • Categories with fewer than 5 models use a 3-tier split (Economy / Balanced / Flagship); single-model categories are always Balanced.

You can use the maxCost parameter on the /v1/models/route endpoint to filter out models above a given tier, or filter by price_tier in model listings.

Fair Usage

GreatRouter has no minimum commitment. You pay only for the requests you make. Free credits are included with every new account for testing and evaluation.