Skip to content

chore(pricing): Update fireworks-ai pricing#549

Open
siddharthsambharia-portkey wants to merge 30 commits intomainfrom
pricing-update/fireworks-ai
Open

chore(pricing): Update fireworks-ai pricing#549
siddharthsambharia-portkey wants to merge 30 commits intomainfrom
pricing-update/fireworks-ai

Conversation

@siddharthsambharia-portkey
Copy link
Collaborator

@siddharthsambharia-portkey siddharthsambharia-portkey commented Mar 17, 2026

🔄 Pricing Update: fireworks-ai

📊 Summary (complete_diff mode)

Change Type Count
➕ Models added 17
🔄 Models updated (merged) 4

➕ New Models

  • deepseek-v3p1
  • deepseek-v3p2
  • glm-4p7
  • glm-5
  • gpt-oss-120b
  • gpt-oss-20b
  • llama-v3p3-70b-instruct
  • minimax-m2p1
  • minimax-m2p5
  • qwen3-8b
  • qwen3-vl-30b-a3b-instruct
  • qwen3-vl-30b-a3b-thinking
  • flux-1-dev-fp8
  • flux-1-schnell-fp8
  • flux-kontext-pro
  • flux-kontext-max
  • qwen3-embedding-8b

🔄 Updated Models

  • kimi-k2-instruct-0905
  • kimi-k2-thinking
  • kimi-k2p5
  • mixtral-8x22b-instruct

Model → Pricing Category Mapping

Named Families (exact page values)

Model ID Input Cached Input Output Notes
deepseek-v3p1 $0.56 $0.28 (50%) $1.68 DeepSeek V3 family
deepseek-v3p2 $0.56 $0.28 (50%) $1.68 DeepSeek V3 family
glm-4p7 $0.60 $0.30 (50%) $2.20 GLM-4.7
glm-5 $1.00 $0.20 (page) $3.20 GLM-5
gpt-oss-120b $0.15 $0.075 (50%) $0.60 OpenAI gpt-oss-120b
gpt-oss-20b $0.07 $0.035 (50%) $0.30 OpenAI gpt-oss-20b
kimi-k2-instruct-0905 $0.60 $0.30 (50%) $2.50 Kimi K2 Instruct
kimi-k2-thinking $0.60 $0.30 (50%) $2.50 Kimi K2 Thinking
kimi-k2p5 $0.60 $0.10 (page) $3.00 Kimi K2.5
minimax-m2p1 $0.30 $0.03 (page) $1.20 MiniMax M2 family
minimax-m2p5 $0.30 $0.03 (page) $1.20 MiniMax M2 family
qwen3-vl-30b-a3b-instruct $0.15 $0.075 (50%) $0.60 Qwen3 VL 30B A3B
qwen3-vl-30b-a3b-thinking $0.15 $0.075 (50%) $0.60 Qwen3 VL 30B A3B

Tier-Based Text/Vision

Model ID Tier Input Cached Input Output
llama-v3p3-70b-instruct >16B $0.90 $0.45 (50%) $0.90
mixtral-8x22b-instruct MoE 56.1–176B $1.20 $0.60 (50%) $1.20
qwen3-8b 4B–16B $0.20 $0.10 (50%) $0.20

Image Models

Model ID Pricing Type Price
flux-1-dev-fp8 Per step $0.0005/step
flux-1-schnell-fp8 Per step $0.00035/step
flux-kontext-pro Per image $0.04/image
flux-kontext-max Per image $0.08/image

Embedding Models

Model ID Input
qwen3-embedding-8b $0.10/1M tokens

Skipped

  • qwen3-reranker-8b — Reranker model (excluded per skill rules)

Generated by Pricing Agent on 2026-03-24

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant