Skip to main content

Available Models

General Compute offers a wide range of open-source and open-weight models. All models are served via our OpenAI-compatible API. These are our top picks for the best balance of quality, speed, and cost.

MiniMax M2.7

Our best general-purpose model. Exceptional quality with a 192k context window at an unbeatable price. Perfect for production workloads.Model ID: minimax-m2.7

DeepSeek V3.2

State-of-the-art reasoning model. Best-in-class performance on complex tasks with built-in chain-of-thought reasoning.Model ID: deepseek-v3.2

All Models

ModelModel IDContextInput / 1M tokensOutput / 1M tokensCapabilities
MiniMax M2.7minimax-m2.7192k$0.28$1.20
DeepSeek V3.2deepseek-v3.232k$0.25$0.38Reasoning
DeepSeek V3.1deepseek-v3.1128k$0.21$0.79Reasoning
GPT-OSS 120Bgpt-oss-120b128k$0.21$0.79

Model Capabilities

  • Reasoning — Models with built-in chain-of-thought reasoning. These models think step-by-step before producing a final answer, leading to significantly better results on complex tasks like math, code, and analysis.

Choosing a Model

Use CaseRecommended ModelWhy
General-purpose chat & generationminimax-m2.7Best quality-to-cost ratio, 192k context
Complex reasoning & analysisdeepseek-v3.2State-of-the-art reasoning capabilities
Longer-context reasoningdeepseek-v3.1128k context window for longer reasoning tasks
Large context windowsminimax-m2.7192k context at low cost

Custom checkpoints

Bring your own LoRA, GGUF, or full-finetuned checkpoints and run them on the same ultra-low latency infrastructure:
  • Share the model artifact (S3, Hugging Face, or direct upload) with the GeneralCompute team.
  • We containerize the checkpoint, attach accelerators in us-west-2, and expose it behind a private model ID (for example acme/my-custom-model).
  • Your private IDs behave exactly like any other model parameter in the OpenAI-compatible API, including streaming, tool calling, and function execution.
  • Enterprise plans layer on SLAs, dedicated pools, and per-org allow lists.
Contact support@generalcompute.com to schedule onboarding or to request deployments in additional regions.
All prices are in USD. Pricing is based on token usage with no minimum commitment on the Pay As You Go plan.