Respan API (gateway) | Respan Docs

The Respan API is the gateway itself. Every request is automatically traced — no SDK or instrumentation needed.

Endpoints

API surface	Base URL	Request path	Auth
OpenAI-compatible Chat Completions	`https://api.respan.ai/api`	`/chat/completions`	`Authorization: Bearer $RESPAN_API_KEY`
Anthropic Messages	`https://api.respan.ai/api/anthropic`	`/v1/messages`	`Authorization: Bearer $RESPAN_API_KEY` or SDK `api_key`
Google Gemini SDK proxy	`https://api.respan.ai/api/google/gemini`	`/v1beta/models/{model}:generateContent`	`x-goog-api-key: $RESPAN_API_KEY`

The one-key OpenAI-compatible route verified for these setup examples is Chat Completions. Live verification with a Respan gateway key returned a provider-credential error on /api/responses; use /chat/completions unless Responses API access is configured for your key.

Environment switching: Respan doesn’t support an env parameter in API calls. To switch between environments (test/production), use different API keys — one for your test environment and another for production. Manage keys in API Keys settings.

Setup

Install packages

Install requests if you want to run the Python example below.

$ pip install requests

Set environment variables

$ export RESPAN_API_KEY="YOUR_RESPAN_API_KEY"

Make a request

1 import os
2 import requests
3 
4 response = requests.post(
5     "https://api.respan.ai/api/chat/completions",
6     headers={
7         "Content-Type": "application/json",
8         "Authorization": f"Bearer {os.environ['RESPAN_API_KEY']}",
9     },
10     json={
11         "model": "gpt-5.5",
12         "messages": [{"role": "user", "content": "Say 'Hello World'"}],
13     },
14 )
15 print(response.json())

Verify

Open the Logs page to see your gateway requests captured as traces.

Switch models

Change the model parameter to use any supported provider through the same endpoint.

1 response = requests.post(
2     "https://api.respan.ai/api/chat/completions",
3     headers=headers,
4     json={"model": "gpt-5.5", "messages": messages},
5 )
6 response = requests.post(..., json={"model": "claude-sonnet-4-5-20250929", "messages": messages})
7 response = requests.post(..., json={"model": "gemini/gemini-3.5-flash", "messages": messages})

Browse the full model list to see all available models.

OpenAI-compatible parameters

All standard OpenAI chat completion parameters are supported.

Parameter	Type	Description
`messages`	`array`	List of messages in OpenAI format (`role` + `content`).
`model`	`string`	Model to use (e.g. `gpt-5.5`, `claude-sonnet-4-5-20250929`).
`stream`	`boolean`	Stream back partial progress token by token.
`temperature`	`number`	Controls randomness (0-2).
`max_tokens`	`number`	Maximum tokens to generate.
`top_p`	`number`	Nucleus sampling threshold.
`tools`	`array`	List of tools/functions the model may call.
`tool_choice`	`string\|object`	Controls tool selection.
`response_format`	`object`	Force JSON output.

Respan parameters

Pass Respan-specific parameters in the request body alongside OpenAI parameters. When using the OpenAI SDK, pass them via extra_body.

Observability

Parameter	Type	Description
`customer_identifier`	`string`	Tag to identify the user. See customer identifier.
`metadata`	`object`	Custom key-value pairs for filtering and search.
`custom_identifier`	`string`	Extra indexed tag (shows as “Custom ID” in spans).
`disable_log`	`boolean`	When `true`, only metrics are recorded.
`request_breakdown`	`boolean`	Returns a summarization of the response (tokens, cost, latency).

Reliability

Parameter	Type	Description
`fallback_models`	`array`	Backup models ranked by priority.
`load_balance_group`	`object`	Balance requests across models.
`retry_params`	`object`	Configure retries (`retry_enabled`, `num_retries`, `retry_after`).

Caching

Parameter	Type	Description
`cache_enabled`	`boolean`	Enable response caching.
`cache_ttl`	`number`	Cache time-to-live in seconds (default: 30 days).
`cache_options`	`object`	Set `cache_by_customer: true` to scope cache per customer.

Credentials

Parameter	Type	Description
`customer_credentials`	`object`	Pass your customer’s provider API keys.
`credential_override`	`object`	One-off credential overrides for specific models.
`model_name_map`	`object`	Map default model names to custom Azure deployment names.

Prompt management

Parameter	Type	Description
`prompt`	`object`	Use a Respan-managed prompt template.

1 {
2   "model": "gpt-5.5",
3   "messages": [],
4   "prompt": {
5     "prompt_id": "your-prompt-id",
6     "variables": {"user_name": "Sarah"}
7   }
8 }

Response format

1 {
2   "id": "chatcmpl-e1b9665b-c354-41c5-bbe5-178bd0b69773",
3   "object": "chat.completion",
4   "created": 1761546960,
5   "model": "claude-sonnet-4-5-20250929",
6   "choices": [
7     {
8       "index": 0,
9       "finish_reason": "stop",
10       "message": {
11         "role": "assistant",
12         "content": "I'm doing well, thank you for asking!"
13       }
14     }
15   ],
16   "usage": {
17     "completion_tokens": 20,
18     "prompt_tokens": 2619,
19     "total_tokens": 2639
20   }
21 }