Skip to main content

Available Models

General Compute offers a wide range of open-source and open-weight models. All models are served via our OpenAI-compatible API. These are our top picks for the best balance of quality, speed, and cost.

MiniMax M2.5

Our best general-purpose model. Exceptional quality with a massive 160k context window at an unbeatable price. Perfect for production workloads.Model ID: minimax-m2.5

DeepSeek V3.2

State-of-the-art reasoning model. Best-in-class performance on complex tasks with built-in chain-of-thought reasoning.Model ID: deepseek-v3.2

All Models

ModelModel IDContextInput / 1M tokensOutput / 1M tokensCapabilities
MiniMax M2.5minimax-m2.5160k$0.20$1.17
DeepSeek V3.2deepseek-v3.28k$3.00$4.50Reasoning
DeepSeek R1 0528deepseek-r1-0528128k$5.00$7.00Reasoning
DeepSeek V3 0324deepseek-v3-0324128k$3.00$4.50
DeepSeek V3.1deepseek-v3.1128k$3.00$4.50Reasoning
DeepSeek V3.1 Terminusdeepseek-v3.1-terminus128k$3.00$4.50Reasoning
DeepSeek V3.1 CBdeepseek-v3.1-cb128k$0.15$0.75Reasoning
DeepSeek R1 Distill Llama 70Bdeepseek-r1-distill-llama-70b128k$0.70$0.80Reasoning
Llama 3.3 70Bllama-3.3-70b128k$0.60$1.20
Llama 3.1 8Bllama-3.1-8b16k$0.10$0.20
Llama 4 Maverick 17Bllama-4-maverick-17b128k$0.63$1.80Vision
Llama 3.3 Swallow 70Bllama-3.3-swallow-70b16k$0.60$1.20
GPT-OSS 120Bgpt-oss-120b128k$0.21$0.79
Qwen3 235Bqwen3-235b64k$0.46$1.82
Qwen3 32Bqwen3-32b32k$0.08$0.24
Gemma 3 12Bgemma-3-12b-it131k$0.04$0.13Vision

Model Capabilities

  • Reasoning — Models with built-in chain-of-thought reasoning. These models think step-by-step before producing a final answer, leading to significantly better results on complex tasks like math, code, and analysis.
  • Vision — Models that accept image inputs alongside text. Pass images via the standard OpenAI-compatible image_url content type.

Choosing a Model

Use CaseRecommended ModelWhy
General-purpose chat & generationminimax-m2.5Best quality-to-cost ratio, 160k context
Complex reasoning & analysisdeepseek-v3.2State-of-the-art reasoning capabilities
Budget-friendly reasoningdeepseek-v3.1-cbStrong reasoning at $0.15/M input
Fast & cheapllama-3.1-8b$0.10/M input, great for simple tasks
Vision tasksllama-4-maverick-17bMultimodal with strong image understanding
Large context windowsminimax-m2.5160k context at low cost
Cost-optimized bulk processinggemma-3-12b-it$0.04/M input, supports vision too
All prices are in USD. Pricing is based on token usage with no minimum commitment on the Pay As You Go plan.