How the catalog works
When you send a request with a model name likegpt-4o, the gateway automatically:
- Resolves the model using the catalog
- Determines the correct provider (for example, OpenAI) and applies the appropriate upstream credentials
- Routes the request to the provider’s endpoint
provider:model format in the model string. For example:
Built-in providers
The catalog includes built-in providers—public AI APIs with known models and routing metadata. You can manage them on the Providers page.Authentication
Send traffic using an access key. This key authenticates your requests to the gateway, not to the upstream provider.Billing and provider credentials
The AI Gateway splits billing into two parts: using the gateway and paying the model provider. Some built-in providers don’t need a provider key. You can spot them in the tables above—if “Provider key required” says “No,” you can start using them right away as long as you have credits. The gateway takes care of authentication and charges your credits. Other providers do need a key. For those, add your provider key in app.ngrok.ai. The provider will bill you directly for model usage, and your credits will cover the gateway fee. This is often called BYOK (bring your own key).OpenAI
| Field | Value |
|---|---|
| Provider ID | openai |
| Aliases | openAI, open-ai, open-AI |
| Base URL | https://api.openai.com/v1/ |
| Website | openai.com |
| Provider key required | No |
OpenAI models
| Model ID | Display Name | Context Window | Output Tokens | Modalities |
|---|---|---|---|---|
gpt-5.4-pro | GPT-5.4 Pro | 1,050,000 | 128,000 | text, image |
gpt-5.4 | GPT-5.4 | 1,050,000 | 128,000 | text, image |
gpt-5.3-codex | GPT-5.3-Codex | 400,000 | 128,000 | text, image |
gpt-5.2-codex | GPT-5.2-Codex | 400,000 | 128,000 | text, image |
gpt-5.2-pro | GPT-5.2 Pro | 400,000 | 100,000 | text, image |
gpt-5.2 | GPT-5.2 | 400,000 | 128,000 | text, image |
gpt-5.2-chat-latest | GPT-5.2 Chat Latest | 400,000 | 128,000 | text, image |
gpt-5.1 | GPT-5.1 | 400,000 | 128,000 | text, image |
gpt-5.1-chat-latest | GPT-5.1 Chat Latest | 256,000 | 32,768 | text, image |
gpt-5 | GPT-5 | 400,000 | 128,000 | text, image |
gpt-5-mini | GPT-5 Mini | 400,000 | 128,000 | text, image |
gpt-5-nano | GPT-5 Nano | 400,000 | 128,000 | text, image |
gpt-5-chat-latest | GPT-5 Chat | 400,000 | 128,000 | text, image |
gpt-4.1 | GPT-4.1 | 1,000,000 | - | text, image |
gpt-4.1-mini | GPT-4.1 mini | 1,000,000 | - | text, image |
gpt-4.1-nano | GPT-4.1 nano | 1,000,000 | - | text, image |
gpt-4o | GPT-4o | 128,000 | 16,384 | text, image, audio |
gpt-4o-mini | GPT-4o Mini | 128,000 | 16,384 | text, image |
o4-mini | O4-Mini | 200,000 | 100,000 | text |
o4-mini-deep-research | O4-Mini-Deep-Research | 200,000 | 100,000 | text |
o3-pro | O3-Pro | 128,000 | 100,000 | text |
o3 | O3 | 128,000 | 100,000 | text |
o3-mini | O3 Mini | 200,000 | 100,000 | text |
o3-deep-research | O3-Deep-Research | 200,000 | 100,000 | text |
o1-pro | O1-Pro | 200,000 | 100,000 | text |
o1 | O1 | 128,000 | 100,000 | text |
gpt-4-turbo | GPT-4 Turbo | 128,000 | 4,096 | text, image |
gpt-4 | GPT-4 | 8,192 | 8,192 | text |
gpt-3.5-turbo | GPT-3.5 Turbo (deprecated, retires September 28, 2026) | 16,385 | 4,096 | text |
Anthropic
| Field | Value |
|---|---|
| Provider ID | anthropic |
| Aliases | Anthropic |
| Base URL | https://api.anthropic.com/v1/ |
| Website | anthropic.com |
| Provider key required | No |
Anthropic models
| Model ID | Display Name | Context Window | Output Tokens | Modalities |
|---|---|---|---|---|
claude-opus-4-6 | Claude Opus 4.6 | 1,000,000 | 128,000 | text, image |
claude-sonnet-4-6 | Claude Sonnet 4.6 | 1,000,000 | 64,000 | text, image |
claude-haiku-4-5 | Claude Haiku 4.5 | 200,000 | 64,000 | text, image |
claude-sonnet-4-5 | Claude Sonnet 4.5 | 1,000,000 | 64,000 | text, image |
claude-opus-4-5 | Claude Opus 4.5 | 200,000 | 64,000 | text, image |
claude-opus-4-1 | Claude Opus 4.1 | 200,000 | 32,000 | text, image |
claude-sonnet-4-0 | Claude Sonnet 4 | 1,000,000 | 64,000 | text, image |
claude-opus-4-0 | Claude Opus 4 | 200,000 | 32,000 | text, image |
claude-3-haiku-20240307 | Claude Haiku 3 (deprecated, retires April 20, 2026) | 200,000 | 4,096 | text, image |
| Field | Value |
|---|---|
| Provider ID | google |
| Aliases | Google, gemini |
| Base URL | https://generativelanguage.googleapis.com/v1beta/openai/ |
| Website | aistudio.google.com |
| Provider key required | Yes |
Google models
| Model ID | Display Name | Context Window | Output Tokens | Modalities |
|---|---|---|---|---|
gemini-2.5-pro | Gemini 2.5 Pro | 1,048,576 | 65,535 | text, image, audio, video, file |
gemini-2.5-flash | Gemini 2.5 Flash | 1,048,576 | 65,535 | text, image, audio, video, file |
gemini-2.5-flash-lite | Gemini 2.5 Flash-Lite | 1,048,576 | 65,535 | text, image, audio, video, file |
gemini-2.0-flash | Gemini 2.0 Flash | 1,048,576 | 8,192 | text, image, audio, video, file |
gemini-2.0-flash-lite | Gemini 2.0 Flash-Lite | 1,048,576 | 8,192 | text, image, audio, video, file |
gemini-3-pro-preview | Gemini 3 Pro Preview | 1,000,000 | 65,536 | text, image, audio, video, file |
DeepSeek
| Field | Value |
|---|---|
| Provider ID | deepseek |
| Aliases | DeepSeek, deep-seek |
| Base URL | https://api.deepseek.com |
| Website | deepseek.com |
| Provider key required | Yes |
DeepSeek models
| Model ID | Display Name | Context Window | Output Tokens | Modalities |
|---|---|---|---|---|
deepseek-reasoner | deepseek-reasoner | 128,000 | 64,000 | text |
deepseek-chat | deepseek-chat | 128,000 | 8,192 | text |
Groq
| Field | Value |
|---|---|
| Provider ID | groq |
| Base URL | https://api.groq.com/openai/v1 |
| Website | groq.com |
| Provider key required | Yes |
Groq models
| Model ID | Display Name | Context Window | Output Tokens | Modalities |
|---|---|---|---|---|
meta-llama/llama-3.1-8b-instant | Llama 3.1 8B Instant | 131,072 | 131,072 | text |
meta-llama/llama-3.3-70b-versatile | Llama 3.3 70B Versatile | 131,072 | 32,768 | text |
meta-llama/llama-prompt-guard-2-22m | Llama Prompt Guard 2 22M | 512 | 512 | text |
meta-llama/llama-prompt-guard-2-86m | Llama Prompt Guard 2 86M | 512 | 512 | text |
meta-llama/llama-guard-4-12b | Llama Guard 4 12B | 131,072 | 1,024 | text, image |
meta-llama/llama-4-maverick-17b-128e-instruct | Llama 4 Maverick 17B 128E Instruct | 131,072 | 8,192 | text, image |
meta-llama/llama-4-scout-17b-16e-instruct | Llama 4 Scout 17B 16E Instruct | 131,072 | 8,192 | text, image |
moonshotai/kimi-k2-instruct-0905 | Kimi K2 | 262,144 | 16,384 | text |
openai/gpt-oss-120b | GPT OSS 120B | 131,072 | 131,072 | text |
openai/gpt-oss-20b | GPT OSS 20B | 131,072 | 131,072 | text |
openai/gpt-oss-safeguard-20b | Safety GPT OSS 20B | 131,072 | 65,536 | text |
qwen/qwen3-32b | Qwen3-32B | 131,072 | 40,960 | text |
OpenRouter
| Field | Value |
|---|---|
| Provider ID | openrouter |
| Base URL | https://openrouter.ai/api/v1/ |
| Website | openrouter.ai |
| Provider key required | Yes |
Hyperbolic
| Field | Value |
|---|---|
| Provider ID | hyperbolic |
| Base URL | https://api.hyperbolic.xyz/v1/ |
| Website | hyperbolic.xyz |
| Provider key required | Yes |
InceptionLabs
| Field | Value |
|---|---|
| Provider ID | inceptionlabs |
| Website | inceptionlabs.ai |
| Provider key required | Yes |
Inference.net
| Field | Value |
|---|---|
| Provider ID | inference-net |
| Base URL | https://api.inference.net/v1/ |
| Website | inference.net |
| Provider key required | Yes |
Using models from the catalog
Simple model reference
Reference models directly by their ID:Explicit provider
Use theprovider:model format for explicit routing:
Custom providers
Built-in providers cover major public APIs. For self-hosted models (Ollama, vLLM, LM Studio) or other endpoints, create a custom provider in the Custom tab in app.ngrok.ai. Custom providers always require a provider key when the upstream needs authentication. See Custom providers to add one.Catalog updates
The model catalog is updated periodically to include new models and providers. For immediate access to models not yet in the catalog, add them explicitly to your provider configuration.Aliases reference
Model and provider names are not case-sensitive. For example,gpt-4o, GPT-4o, and Gpt-4O all resolve to the same model.
The following aliases are available in addition to the primary IDs listed above.
Provider aliases
| Provider ID | Aliases |
|---|---|
openai | openAI, open-ai, open-AI |
anthropic | Anthropic |
google | gemini |
deepseek | deep-seek |
openrouter | open-router |
inceptionlabs | inception-labs, inception |
inference.net | inference-net, inference_net |
groq | groqcloud |
OpenAI model aliases
| Alias | Resolves to |
|---|---|
gpt-4-omni | gpt-4o |
gpt-4o-2024-05-13 | gpt-4o |
gpt-4o-2024-08-06 | gpt-4o |
gpt-4o-2024-11-20 | gpt-4o |
chatgpt-4o-latest | gpt-4o |
gpt-4o-mini-2024-07-18 | gpt-4o-mini |
gpt-4-turbo-2024-04-09 | gpt-4-turbo |
gpt-4-turbo-preview | gpt-4-turbo |
gpt-4-1106-preview | gpt-4-turbo |
gpt-4-0125-preview | gpt-4-turbo |
gpt-4-0613 | gpt-4 |
gpt-4-0314 | gpt-4 |
gpt-4-32k | gpt-4 |
gpt-4.1-2025-04-14 | gpt-4.1 |
gpt-4.1-mini-2025-04-14 | gpt-4.1-mini |
gpt-4.1-nano-2025-04-14 | gpt-4.1-nano |
gpt-5.1-2025-11-13 | gpt-5.1 |
gpt-5.4-2026-03-05 | gpt-5.4 |
gpt-5.4-pro-2026-03-05 | gpt-5.4-pro |
gpt-5.2-pro-2025-12-11 | gpt-5.2-pro |
GPT-5.3-Codex | gpt-5.3-codex |
gpt-5.3-Codex | gpt-5.3-codex |
gpt5.3-codex | gpt-5.3-codex |
GPT-5 mini | gpt-5-mini |
GPT-5 nano | gpt-5-nano |
GPT-5 Chat | gpt-5-chat-latest |
gpt-3.5-turbo-0125 | gpt-3.5-turbo |
gpt-3.5-turbo-16k | gpt-3.5-turbo |
o4-mini-2025-04-16 | o4-mini |
o4-mini-deep-research-2025-06-26 | o4-mini-deep-research |
o3-pro-2025-06-10 | o3-pro |
o3-2025-04-16 | o3 |
o3-mini-2025-01-31 | o3-mini |
Anthropic model aliases
| Alias | Resolves to |
|---|---|
claude-opus-4-6-20260205 | claude-opus-4-6 |
claude-sonnet-4-6-20260217 | claude-sonnet-4-6 |
claude-haiku-4-5-20251001 | claude-haiku-4-5 |
claude-sonnet-4-5-20250929 | claude-sonnet-4-5 |
claude-opus-4-5-20251101 | claude-opus-4-5 |
claude-opus-4-1-20250805 | claude-opus-4-1 |
claude-opus-4.1 | claude-opus-4-1 |
claude-sonnet-4-20250514 | claude-sonnet-4-0 |
claude-sonnet-4 | claude-sonnet-4-0 |
claude-opus-4-20250514 | claude-opus-4-0 |
claude-opus-4 | claude-opus-4-0 |
claude-haiku-3 | claude-3-haiku-20240307 |
Google model aliases
| Alias | Resolves to |
|---|---|
gemini-3 | gemini-3-pro-preview |
gemini-3-pro | gemini-3-pro-preview |
Meta model aliases
| Alias | Resolves to |
|---|---|
llama-3.1-8b | llama-3.1-8b-instant |
llama-3.3-70b | llama-3.3-70b-versatile |
llama-4-maverick-17b-128e | llama-4-maverick-17b-128e-instruct |
llama-4-scout-17b-16e | llama-4-scout-17b-16e-instruct |
Moonshot AI model aliases
| Alias | Resolves to |
|---|---|
kimi-k2-instruct | kimi-k2 |
kimi-k2-instruct-0905 | kimi-k2 |
moonshotai/kimi-k2-instruct-0905 | kimi-k2 |