Direct Providers
Native platforms that build and host their own models.
| Brand | Usage Limit | Tier 1 | Tier 2 | Tier 3 | Tier 4 |
|---|---|---|---|---|---|
|
Antigravity
Gemini 3.1 Pro
· $2.00 in / $12.00 out per 1M
· AA 57
Gemini 2.5 Pro
· $1.25 in / $10.00 out per 1M
· AA 35
Gemini 2.5 Flash
· $0.30 in / $2.50 out per 1M
· AA 21
|
Daily prompt allowance that scales by subscription tier, with separate quotas for Pro and Thinking models |
Free $0/mo
|
AI Plus $7.99/mo
|
AI Pro $19.99/mo
|
AI Ultra $249.99/mo
|
|
Xiaomi MiMo
MiMo-V2.5-Pro
· $1.00 in / $3.00 out per 1M
MiMo-V2.5
· $0.40 in / $2.00 out per 1M
MiMo-V2-Pro
· $1.00 in / $3.00 out per 1M
|
Monthly or annual token-based credit allowance |
Starter Pack $5.28/mo
B tier · 379/$
|
Standard $14.08/mo
B tier · 473/$
|
Pro $44/mo
A tier · 530/$
|
Max $88/mo
A tier · 606/$
|
|
StepFun Step Plan
Step 3.5 Flash
· $0.10 in / $0.30 out per 1M
· AA 38
Step 3.5 Flash 2603
· $0.10 in / $0.30 out per 1M
· AA 38.5
Step-2-16k
· $5 in / $16 out per 1M
|
Rolling 5-hour prompt quota plus weekly limit, where prompts convert to multiple model requests |
Flash Mini $6.99/mo
S tier · 25,751/$
|
Flash Plus $9.99/mo
S tier · 72,072/$
|
Flash Pro $29/mo
S tier · 93,103/$
|
Flash Max $99/mo
S tier · 90,909/$
|
|
ChatGPT Codex
GPT-5.5
· $5.00 in / $30.00 out per 1M
· AA 60
GPT-5.5 Pro
· $30.00 in / $180.00 out per 1M
GPT-5.1-Codex-Max
· $1.25 in / $10.00 out per 1M
|
Rolling 3-hour message quota with multiplier-based higher tiers |
Go $8/mo
|
Plus $20/mo
|
Pro $100/mo
|
Max $200/mo
|
|
MiniMax
MiniMax-M2.7
· $0.30 in / $1.20 out per 1M
· AA 50
MiniMax-M2.5
· $0.15 in / $0.95 out per 1M
· AA 42
MiniMax-M2.1
· $0.29 in / $0.95 out per 1M
· AA 39
|
Rolling 5-hour request quota for text models plus daily allowances for other modalities |
Starter $10/mo
S tier · 18,000/$
|
Plus $20/mo
S tier · 27,000/$
|
Max $50/mo
S tier · 36,000/$
|
— |
|
Z.AI GLM
GLM-5.1
· $1.40 in / $4.40 out per 1M
· AA 51.4
GLM-5
· $1.00 in / $3.20 out per 1M
· AA 50
GLM-5 Turbo
· $1.20 in / $4.00 out per 1M
· AA 46.8
|
Rolling 5-hour prompt quota with weekly caps, where advanced models consume multiple quota units |
Lite $18/mo
B tier · 133/$
|
Pro $72/mo
B tier · 167/$
|
Max $160/mo
B tier · 300/$
|
— |
|
Kimi
Kimi K2.6
· $0.60 in / $2.50 out per 1M
· AA 54
Kimi K2.5
· $0.60 in / $2.50 out per 1M
Kimi K2
· $0.60 in / $2.50 out per 1M
|
Monthly API credits equal to the plan price, plus chat and researcher quotas |
Moderato $19/mo
|
Allegretto $39/mo
|
Allegro $99/mo
|
Vivace $199/mo
|
|
Claude Code
Claude 4.6 Opus
· $5.00 in / $25.00 out per 1M
· AA 53
Claude 4.6 Sonnet
· $3.00 in / $15.00 out per 1M
· AA 44.4
Claude 4.5 Haiku
· $1.00 in / $5.00 out per 1M
· AA 31
|
Rolling 5-hour message and token quota with plan-specific multipliers |
Pro $20/mo
|
Max 5x $100/mo
|
Max 20x $200/mo
|
— |
Daily prompt allowance that scales by subscription tier, with separate quotas for Pro and Thinking models
Tier 1
Free
$0/mo
- ·Pro: Dynamic daily limit
Tier 2
AI Plus
$7.99/mo
- ·Pro: 30 prompts/day
Tier 3
AI Pro
$19.99/mo
- ·Pro: 100 prompts/day
Tier 4
AI Ultra
$249.99/mo
- ·Pro: 500 prompts/day
Monthly or annual token-based credit allowance
Tier 1
Starter Pack
$5.28/mo
- ·60M credits/month
Tier 2
Standard
$14.08/mo
- ·200M credits/month
Tier 3
Pro
$44/mo
- ·700M credits/month
Tier 4
Max
$88/mo
- ·1,600M credits/month
Rolling 5-hour prompt quota plus weekly limit, where prompts convert to multiple model requests
Tier 1
Flash Mini
$6.99/mo
- ·100 prompts / 5h
Tier 2
Flash Plus
$9.99/mo
- ·400 prompts / 5h
Tier 3
Flash Pro
$29/mo
- ·1,500 prompts / 5h
Tier 4
Flash Max
$99/mo
- ·5,000 prompts / 5h
Rolling 3-hour message quota with multiplier-based higher tiers
Tier 1
Go
$8/mo
- ·GPT-5.2 Instant: 160 messages / 3h
Tier 2
Plus
$20/mo
- ·GPT-5.5: 160 messages / 3h
Tier 3
Pro
$100/mo
- ·GPT-5.5 Pro: 5x Plus usage
Tier 4
Max
$200/mo
- ·GPT-5.5 Pro: 20x Plus usage
Rolling 5-hour request quota for text models plus daily allowances for other modalities
Tier 1
Starter
$10/mo
- ·1,500 requests / 5h
Tier 2
Plus
$20/mo
- ·4,500 requests / 5h
Tier 3
Max
$50/mo
- ·15,000 requests / 5h
Tier 4
—Rolling 5-hour prompt quota with weekly caps, where advanced models consume multiple quota units
Tier 1
Lite
$18/mo
- ·80 base prompts / 5h, 400 / week
Tier 2
Pro
$72/mo
- ·400 base prompts / 5h, 2000 / week
Tier 3
Max
$160/mo
- ·1600 base prompts / 5h, 8000 / week
Tier 4
—Monthly API credits equal to the plan price, plus chat and researcher quotas
Tier 1
Moderato
$19/mo
- ·$19 in API credits/month
Tier 2
Allegretto
$39/mo
- ·$39 in API credits/month
Tier 3
Allegro
$99/mo
- ·$99 in API credits/month
Tier 4
Vivace
$199/mo
- ·$199 in API credits/month
Rolling 5-hour message and token quota with plan-specific multipliers
Tier 1
Pro
$20/mo
- ·~45 messages / 5h
Tier 2
Max 5x
$100/mo
- ·~200 messages / 5h
Tier 3
Max 20x
$200/mo
- ·~900 messages / 5h
Tier 4
—Aggregators
Platforms that route to or bundle multiple underlying models.
| Brand | Usage Limit | Tier 1 | Tier 2 | Tier 3 | Tier 4 |
|---|---|---|---|---|---|
|
OpenCode Go
DeepSeek V4 Pro
· $1.74 in / $3.48 out per 1M
· AA 52
Kimi K2.5
· $0.60 in / $3.00 out per 1M
· AA 47
GLM-5.1
· $1.40 in / $4.40 out per 1M
· AA 43.8
|
Rolling 5-hour, weekly, and monthly dollar-equivalent usage limits |
Go $5 first mo, then $10/mo
|
— | — | — |
|
Nous Portal
Hermes 4 405B
· $0.09 in / $0.37 out per 1M
Hermes 4 70B
· $0.05 in / $0.20 out per 1M
Hermes 3 70B
· $0.30 in / $0.30 out per 1M
· AA 12.6
|
Monthly credits by plan |
Basic $10/mo
C tier · 10/$
|
Plus $20/mo
C tier · 10/$
|
Scale $50/mo
C tier · 10/$
|
Max $100/mo
C tier · 10/$
|
|
Cursor
Claude 3.7 Sonnet
· $3.00 in / $15.00 out per 1M
· AA 35
GPT-4o
· $2.50 in / $10.00 out per 1M
· AA 19
Claude 3.5 Sonnet
· $3.00 in / $15.00 out per 1M
· AA 16
|
Monthly credit pool for premium models based on plan tier, with unlimited Auto mode completions |
Hobby Free
|
Pro $20/mo
|
Pro+ $60/mo
|
Ultra $200/mo
|
|
GitHub Copilot
GPT-4o
· $2.50 in / $10.00 out per 1M
· AA 17
Claude 3.5 Sonnet
· $3.00 in / $15.00 out per 1M
· AA 16
Gemini 1.5 Pro
· $1.25 in / $5.00 out per 1M
· AA 16
|
Monthly premium requests by plan |
Free $0/mo
|
Pro $10/mo
C tier · 30/$
|
Pro+ $39/mo
B tier · 38/$
|
— |
|
Fireworks Fire Pass
Kimi K2.5 Turbo
· $0.60 in / $2.50 out per 1M
· AA 47
Kimi K2.6
· $0.74 in / $4.66 out per 1M
· AA 54
Llama 3.1 405B Instruct
· $3.00 in / $3.00 out per 1M
· AA 17
|
Unlimited weekly access to a specific model router for personal agentic coding |
Fire Pass $7 / 7 days
|
— | — | — |
Crof
DeepSeek V4 Pro
· $1.00 in / $2.15 out per 1M
· AA 52
Kimi K2.6
· $0.50 in / $1.99 out per 1M
· AA 54
GLM-5
· $0.48 in / $1.90 out per 1M
· AA 50
|
Daily request quota across all models |
Hobby $5/mo
A tier · 3,000/$
|
Pro $10/mo
A tier · 3,000/$
|
Intermediate $20/mo
S tier · 3,750/$
|
— |
|
Kilo Pass
Claude 3.5 Sonnet
· $3.00 in / $15.00 out per 1M
· AA 16
GPT-4o
· $2.50 in / $10.00 out per 1M
· AA 17
DeepSeek V3.2
· $0.28 in / $0.42 out per 1M
· AA 32
|
Monthly paid credits plus up to 50% free bonus credits, billed at exact provider API rates |
Starter $19/mo
C tier · 14/$
|
Pro $49/mo
C tier · 14/$
|
Expert $199/mo
C tier · 14/$
|
— |
|
BytePlus
GLM-5.1
· $1.05 in / $3.50 out per 1M
· AA 51
Kimi-K2.5
· $0.60 in / $3.00 out per 1M
· AA 47
DeepSeek-V3.2
· $0.14 in / $0.28 out per 1M
· AA 32
|
Rolling 5-hour request quota plus weekly and monthly caps |
Lite $10/mo
B tier · 240/$
|
Pro $50/mo
B tier · 240/$
|
— | — |
|
Hugging Face Pro
Llama 3.1 70B Instruct
· $0.56 in / $0.56 out per 1M
· AA 12
Mixtral 8x7B Instruct
· $0.54 in / $0.60 out per 1M
· AA 8
Command R+
· $2.50 in / $10.00 out per 1M
|
Monthly inference credits and daily ZeroGPU compute time |
Pro $9/mo
|
— | — | — |
|
Baidu Cloud
GLM-5
· $0.60 in / $2.08 out per 1M
· AA 49.8
MiniMax-M2.5
· $0.30 in / $1.20 out per 1M
· AA 41.9
Kimi-K2.5
· $0.60 in / $3.00 out per 1M
|
Monthly request allowance by plan |
Lite ¥40/mo (~$5.50)
S tier · 3,273/$
|
Pro ¥200/mo (~$27.50)
S tier · 3,273/$
|
— | — |
|
Alibaba Cloud
Qwen3.5-Plus
· $0.40 in / $2.40 out per 1M
GLM-5
· $1.00 in / $3.20 out per 1M
· AA 50
Kimi K2.5
· $0.60 in / $2.50 out per 1M
· AA 47
|
Monthly request quota with 5-hour and weekly caps |
Pro $49/mo
A tier · 1,837/$
|
— | — | — |
|
Ollama Cloud
Kimi K2.6
· $0.95 in / $4.00 out per 1M
· AA 54
DeepSeek V4 Pro
· $0.30 in / $0.50 out per 1M
· AA 52
GLM-5.1
· $1.05 in / $3.50 out per 1M
· AA 51
|
Metered by GPU time consumption based on model size and request duration rather than fixed token or request caps |
Pro $20/mo
|
Max $100/mo
|
— | — |
|
Cerebras
GLM-4.7
· $2.25 in / $2.75 out per 1M
· AA 42
gpt-oss-120B
· $0.35 in / $0.75 out per 1M
· AA 33
Llama 3.3 70B
· $0.85 in / $1.20 out per 1M
· AA 14
|
Daily token allowance by plan |
Pro $50/mo
A tier · 480/$
|
Max $200/mo
A tier · 600/$
|
— | — |
Rolling 5-hour, weekly, and monthly dollar-equivalent usage limits
Tier 1
Go
$5 first mo, then $10/mo
- ·$60 of usage / month
Tier 2
—Tier 3
—Tier 4
—Monthly credits by plan
Tier 1
Basic
$10/mo
- ·$11 credits/month
Tier 2
Plus
$20/mo
- ·$22 credits/month
Tier 3
Scale
$50/mo
- ·$55 credits/month
Tier 4
Max
$100/mo
- ·$110 credits/month
Monthly credit pool for premium models based on plan tier, with unlimited Auto mode completions
Tier 1
Hobby
Free
- ·50 slow requests/month
Tier 2
Pro
$20/mo
- ·$20 credits/month
Tier 3
Pro+
$60/mo
- ·$60 credits/month
Tier 4
Ultra
$200/mo
- ·$400 credits/month
Monthly premium requests by plan
Tier 1
Free
$0/mo
- ·50 premium requests/month
Tier 2
Pro
$10/mo
- ·300 premium requests/month
Tier 3
Pro+
$39/mo
- ·1,500 premium requests/month
Tier 4
—Unlimited weekly access to a specific model router for personal agentic coding
Tier 1
Fire Pass
$7 / 7 days
- ·Kimi K2.5 Turbo: Unlimited usage
Tier 2
—Tier 3
—Tier 4
—
Daily request quota across all models
Tier 1
Hobby
$5/mo
- ·500 requests/day
Tier 2
Pro
$10/mo
- ·1,000 requests/day
Tier 3
Intermediate
$20/mo
- ·2,500 requests/day
Tier 4
—Monthly paid credits plus up to 50% free bonus credits, billed at exact provider API rates
Tier 1
Starter
$19/mo
- ·Up to $26.60 credits/month
Tier 2
Pro
$49/mo
- ·Up to $68.60 credits/month
Tier 3
Expert
$199/mo
- ·Up to $278.60 credits/month
Tier 4
—Rolling 5-hour request quota plus weekly and monthly caps
Tier 1
Lite
$10/mo
- ·1,200 requests / 5h
Tier 2
Pro
$50/mo
- ·6,000 requests / 5h
Tier 3
—Tier 4
—Monthly inference credits and daily ZeroGPU compute time
Tier 1
Pro
$9/mo
- ·2M inference credits/month and 25 mins H200 compute/day
Tier 2
—Tier 3
—Tier 4
—Monthly request allowance by plan
Tier 1
Lite
¥40/mo (~$5.50)
- ·18,000 requests/month
Tier 2
Pro
¥200/mo (~$27.50)
- ·90,000 requests/month
Tier 3
—Tier 4
—Monthly request quota with 5-hour and weekly caps
Tier 1
Pro
$49/mo
- ·90,000 requests/month
Tier 2
—Tier 3
—Tier 4
—Metered by GPU time consumption based on model size and request duration rather than fixed token or request caps
Tier 1
Pro
$20/mo
- ·50x Free usage
Tier 2
Max
$100/mo
- ·5x Pro usage
Tier 3
—Tier 4
—Daily token allowance by plan
Tier 1
Pro
$50/mo
- ·24M tokens/day
Tier 2
Max
$200/mo
- ·120M tokens/day
Tier 3
—Tier 4
—Need More Than a Comparison Table
If you want help selecting models, reducing spend, and turning AI tools into a usable workflow, Hermes Guide offers AI automation services and Hermes Agent setup for businesses.
Explore Services