Models - OpenServ Docs

These are the models available through SERV. Use any model ID below with the SERV endpoint and your OpenAI or Anthropic SDK, or try them in the Playground. Prices are per million tokens and include SERV Reasoning.

OpenAI

Model	API ID	Input	Output	Context
GPT-5.5	`gpt-5.5`	$6.50	$39.00	1M
GPT-5.4	`gpt-5.4`	$3.25	$20.00	1M
GPT-5.4 Mini	`gpt-5.4-mini`	$1.00	$6.00	400K
GPT-5.4 Nano	`gpt-5.4-nano`	$0.250	$1.60	400K
o3	`o3`	$2.50	$10.00	200K
o3 Mini	`o3-mini`	$1.40	$5.50	200K
o3 Pro	`o3-pro`	$26.00	$104.00	200K
o4 Mini	`o4-mini`	$1.40	$5.50	200K

Anthropic

Model	API ID	Input	Output	Context
Claude Opus 4.6	`claude-opus-4.6`	$6.50	$33.00	1M
Claude Sonnet 4.6	`claude-sonnet-4.6`	$4.00	$20.00	1M
Claude Haiku 4.5	`claude-haiku-4.5`	$1.25	$6.50	200K

Google

Model	API ID	Input	Output	Context
Gemini Flash Latest	`gemini-flash-latest`	$0.650	$4.00	1M
Gemini Pro Latest	`gemini-pro-latest`	$2.50	$16.00	1M
Gemma 4 26B A4B	`gemma-4-26b-a4b-it`	$0.080	$0.430	262K
Gemma 4 31B	`gemma-4-31b-it`	$0.170	$0.500	262K

xAI

Model	API ID	Input	Output	Context
Grok 4.3	`grok-4.3`	$1.60	$3.25	1M
Grok 4.20	`grok-4.20`	$1.60	$3.25	2M

Qwen

Model	API ID	Input	Output	Context
Qwen3.6 Flash	`qwen3.6-flash`	$0.330	$2.00	1M
Qwen3.6 Max Preview	`qwen3.6-max-preview`	$1.40	$8.00	262K

DeepSeek

Model	API ID	Input	Output	Context
DeepSeek V4 Pro	`deepseek-v4-pro`	$0.550	$1.10	1M
DeepSeek V4 Flash	`deepseek-v4-flash`	$0.180	$0.350	1M

See also

Quickstart — make your first request.
SDK Integration — endpoints and parameter mapping.
Endpoint compatibility — which models work on which endpoint.

SERV Tools Resources

⌘I