OpenAI-compatible surface
Every GeneralCompute endpoint mirrors the OpenAI API shape (chat.completions, models, etc.). Use any existing OpenAI SDK or the official @generalcompute/sdk / generalcompute packages — no request body changes required.
Tool calling & JSON mode
Function and tool calling are fully supported. Define tools exactly as you would with OpenAI and inspect the returnedtool_calls array to decide which function to execute.
Tool calls stream through the same delta events you receive from OpenAI, so orchestrators such as LangChain, Vercel AI SDK, and LlamaIndex work out of the box.
Vision
Pass images alongside text in themessages array using the OpenAI "image_url" parts. Vision is available on models like llama-4-maverick-17b and gemma-3-12b-it.
Reasoning models
Reasoning-first models (deepseek-v3.2, deepseek-r1-0528, etc.) are deployed with long context windows and higher thinking timeouts. Use them when you need chain-of-thought quality:

