Skip to main content
stagewise connects to large language models through providers. A provider hosts models — Anthropic, OpenAI, Google, DeepSeek, Z.AI, MiniMax, or any custom endpoint. Within each provider, you choose which model to use.

Models vs providers vs endpoint modes

ConceptWhat it means
ProviderThe service that hosts models (Anthropic, OpenAI, Google, DeepSeek, etc.)
ModelThe specific LLM you use (Claude Sonnet, GPT-5, Gemini Flash, etc.)
Endpoint modeHow requests reach the provider — through your stagewise Account, your own API key, or a custom endpoint

Three ways to power your agents

Each provider is configured independently. Mix and match as needed.

1. stagewise Account

Start immediately with a curated library of pre-configured models. No API keys, no setup — just sign in and choose a model. Usage is billed through your stagewise account. Best for: getting started fast, trying different models, not managing multiple API subscriptions.

2. Import your subscriptions (BYOK)

Already paying for Claude, GPT, Gemini, Kimi, Qwen, DeepSeek, GLM, or MiniMax? Bring your existing API keys and use them directly in stagewise. You control billing, rate limits, and data routing. Best for: teams with existing provider contracts, maximizing subscriptions you already pay for. Set up BYOK →

3. Custom & local providers

Connect any service that speaks a standard API — including models running on your machine:
  • Local inference: Ollama, LM Studio, vLLM — zero external API calls
  • Cloud platforms: Azure OpenAI, AWS Bedrock, Google Vertex AI
  • Enterprise proxies: LiteLLM, any OpenAI-compatible gateway
Best for: air-gapped environments, data privacy, fixed-cost local inference. Connect a custom provider →

Built-in providers

stagewise supports these providers out of the box:
ProviderAvailable models
AnthropicClaude Opus, Sonnet, Haiku
OpenAIGPT-5 series, GPT-5 Codex
GoogleGemini 3 series
Moonshot AIKimi K2 series
Alibaba CloudQwen 3 series
DeepSeekDeepSeek V-series
Z.AIGLM series
MiniMaxMiniMax M-series

Mixing modes

Each provider is configured independently. You can use your stagewise Account for OpenAI, your own Anthropic key, and a local Ollama endpoint for Google — all at the same time. The agent uses whichever mode is active for the provider of the currently selected model.

Switching models

Switch models from the selector in the chat sidebar. Each agent instance can use a different model — run one agent on a fast model for simple tasks, another on a powerful model for complex architecture work.

Custom models

Add models not in the default list — fine-tuned models or models served by your custom endpoint. Add custom models →

What’s next

Import your subscriptions

Step-by-step guide to bringing your own API keys.

Custom providers

Connect local models via Ollama or any custom endpoint.

Models & providers reference

Full configuration reference for every provider and setting.