Model Pricing

All prices are per 1 million tokens in USD.

Text Models

GLM 5.2glm-5.2Large

Input Price$1.25

Output Price$4.40

Context200K

Next-Gen Agentic Engineering and Complex Reasoning

GLM 5.1glm-5.1#16Large

Input Price$1.50

Output Price$5.00

Context200K

Agentic Engineering and Complex Reasoning

Also available as glm-5.1-non-thinking (same pricing, without extended thinking)

GLM 5glm-5#25Large

Input Price$1.00

Output Price$3.20

Context200K

Agentic Engineering and Complex Reasoning

Kimi K2.7 Codekimi-k2.7-codeLarge

Input Price$0.75

Output Price$4.00

Context256K

Code Generation, Agentic Coding, Parallel Agent Workflows

Kimi K2.5kimi-k2.5#32Large

Input Price$0.60

Output Price$3.00

Context256K

Complex Agent Use and Code Development

Kimi K2.6kimi-k2.6Large

Input Price$0.50

Output Price$3.25

Context256K

Visual Reasoning, Math, Parallel Agent Workflows

Gemma 4 31Bgemma-4-31b#29Large

Input Price$0.15

Output Price$0.40

Context256K

Math, Science, Coding, Document Parsing, Visual Reasoning

Gemma 4 26B A4Bgemma-4-26b-a4b#45Large

Input Price$0.15

Output Price$0.40

Context256K

Efficient Reasoning, Low-Latency Inference, Image Analysis

Qwen 3.5 35B A3Bqwen35-35b-a3b#102Medium

Input Price$0.30

Output Price$1.25

Context256K

General Purpose, Long Context, Image Analysis

Qwen 3.5 9Bqwen35-9bMedium

Input Price$0.05

Output Price$0.15

Context256K

Fast Responses, Image Analysis, Simple Tasks

Arcee Trinity Large Thinkingarcee-trinity-large-thinking#123Large

Input Price$0.30

Output Price$1.00

Context256K

Agentic Workflows, Multi-Step Planning, Tool Orchestration

GLM 4.7glm-4.7#42Large

Input Price$0.50

Output Price$2.25

Context198K

Complex Agent Use

GLM 4.7 Thinkingglm-4.7-thinking#42Large

Input Price$0.45

Output Price$2.00

Context198K

Complex Agent Use and Reasoning

GLM 4.7 Flashglm-4.7-flash#131Large

Input Price$0.10

Output Price$0.50

Context128K

Fast Advanced Agent Use

Qwen3 235Bqwen3-235b#62Large

Input Price$0.40

Output Price$3.00

Context128K

Advanced Agent Use and Code Development

MiniMax M2.5minimax-m2.5#95Large

Input Price$0.30

Output Price$1.20

Context198K

AI Agents and Autonomous Workflows

MiniMax M2.7MiniMax-M2.7Large

Input Price$0.35

Output Price$1.50

Context198K

Cost-Efficient AI Agents and Autonomous Workflows

DeepSeek V4 Prodeepseek-v4-proLarge

Input Price$1.60

Output Price$3.50

Context1M

Frontier Reasoning, Complex Coding, Ultra-Long Context

DeepSeek V4 Flashdeepseek-v4-flashMedium

Input Price$0.15

Output Price$0.30

Context1M

Fast Frontier-Tier Reasoning, Long-Context Tasks at Speed

Qwen3 Coder 480Bqwen3-coder-480b-a35b-instruct#113Large

Input Price$0.70

Output Price$2.80

Context256K

Code Development

Qwen3 Next 80Bqwen3-next-80b#93Medium

Input Price$0.15

Output Price$1.50

Context256K

Long-Context Chat and Content

GPT OSS 120Bgpt-oss-120b#144Large

Input Price$0.07

Output Price$0.28

Context128K

Advanced Chat and Content

Hermes 3 Llama 3.1 405Bhermes-3-llama-3.1-405b#172Large

Input Price$1.00

Output Price$3.00

Context128K

Advanced Chat and Content

Llama 3.3 70Bllama-3.3-70b#193Medium

Input Price$0.70

Output Price$2.50

Context128K

General Chat and Content

Llama 3.2 3Bllama-3.2-3b#295Small

Input Price$0.10

Output Price$0.50

Context128K

Classification, Simple QA

Mistral 31 24Bmistral-31-24b#212Medium

Input Price$0.50

Output Price$2.00

Context128K

Basic Agent Functionality

Venice Uncensoredvenice-uncensoredMedium

Input Price$0.20

Output Price$0.90

Context32K

Uncensored Creative Use

Proprietary, Anonymized Models

Frontier proprietary models served through anonymized providers on the Morpheus marketplace. All are also available with web search by appending :web to the model name.

Claude Fable 5claude-fable-5Large

Input Price$12.00

Output Price$60.00

Frontier Agentic Coding and Reasoning

Claude Opus 4.8claude-opus-4.8Large

Input Price$6.00

Output Price$30.00

Advanced Reasoning and Code Development

Gemini 3.1 Pro Previewgemini-3.1-pro-previewLarge

Input Price$2.50

Output Price$15.00

Multimodal Frontier Reasoning

GPT-5.5 ProGPT-5.5-ProLarge

Input Price$38.00

Output Price$250.00

Maximum-Capability Frontier Reasoning

GPT-5.5GPT-5.5Large

Input Price$6.50

Output Price$38.00

Frontier Chat, Reasoning, and Coding

Grok 4.20grok-4.20Large

Input Price$1.40

Output Price$2.80

Fast, Cost-Efficient Frontier Intelligence

Embedding Models

BGE M3text-embedding-bge-m3Embedding

Input Price$0.10

Output Price$0.50

Vector Embeddings for RAG

Legend

🧠 Reasoning — Extended thinking and step-by-step problem solving
⚡ Function Calling — Can invoke tools and external APIs
👀 Vision — Can analyze and understand images
Arena Rank — Position on the Chatbot Arena Leaderboard (lower is better)

​Model Pricing

​Text Models

​Proprietary, Anonymized Models

​Embedding Models

​Legend

Model Pricing

Text Models

Proprietary, Anonymized Models

Embedding Models

Legend