Chat completion

Set up Respan

Sign up — Create an account at platform.respan.ai
Create an API key — Generate one on the API keys page
Add credits or a provider key — Add credits on the Credits page or connect your own provider key on the Integrations page

This integration is for the Respan gateway.

Overview

OpenAI SDK provides the most robust integration method for accessing multiple model providers. Since most AI providers prioritize OpenAI SDK compatibility, you can seamlessly call all 250+ models available through the Respan platform gateway.

Quickstart

Step 1: Install OpenAI SDK

Get a Respan API key
Add your provider credentials
Install packages

pip install openai

Step 2: Initialize Client

from openai import OpenAI

client = OpenAI(
    base_url="https://api.respan.ai/api/",
    api_key="YOUR_RESPAN_API_KEY",  # Get from Respan dashboard
)

Step 3: Make Your First Request

response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "Hello, world!"}],
)
print(response.choices[0].message.content)

Step 4: See your log on platform

Switch models

# OpenAI GPT models
model = "gpt-4o"       
# model = "claude-3-5-sonnet-20241022"  
# model = "gemini-1.5-pro"           

response = client.chat.completions.create(
    model=model,
    messages=[{"role": "user", "content": "Your message"}],
)

See the full model list for all available models.

Supported parameters

OpenAI parameters

We support all the OpenAI parameters. You can pass them directly in the request body.

response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "Tell me a story"}],
    temperature=0.7,          # Control randomness
    max_tokens=1000,          # Limit response length
    top_p=0.9,               # Nucleus sampling
    frequency_penalty=0.1,    # Reduce repetition
    presence_penalty=0.1,     # Encourage topic diversity
    stream=True,             # Enable streaming
)

Respan Parameters

Respan parameters can be passed for better handling and customization.

response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "Tell me a story"}],
    extra_body={
        "customer_identifier": "user_123",           # Track specific users
        "fallback_models": ["gpt-3.5-turbo"],       # Automatic fallbacks
        "metadata": {"session_id": "abc123"},        # Custom metadata
        "thread_identifier": "conversation_456",     # Group related messages
        "group_identifier": "team_alpha",           # Organize by groups
    }
)

Prompt composition

A variable in one prompt can reference another prompt. The child prompt is rendered first and injected into the parent. See Prompt composition for setup details.

response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[],
    extra_body={
        "prompt": {
            "prompt_id": "PARENT_PROMPT_ID",
            "override": True,
            "variables": {
                "request": "dispute a charge from last month",
                "conversation": {
                    "_type": "prompt",
                    "prompt_id": "CHILD_PROMPT_ID",
                    "version": 2,
                    "variables": {
                        "customer_name": "Sarah",
                        "department": "billing"
                    }
                }
            }
        }
    },
)

Prompt schema (v1 vs v2)

The prompt object supports a schema_version field that controls how prompt configuration and request parameters are merged. See the full guide for details.

Prompt schema v1 (default, legacy): override flag controls which side wins for conflicts.
Prompt schema v2 (recommended, schema_version=2): prompt config always wins. Supports a patch field for non-message parameter overrides.

OpenAI SDKs strip fields like schema_version, patch, and prompt_slug during validation. Prompt schema v2 requires raw HTTP requests instead of the OpenAI SDK. See the Standard API examples.

Azure OpenAI

To call Azure OpenAI models, instead of using azure OpenAI’s client, the easier way is to use the OpenAI client.

Go to [Respan Providers](https://platform.respan.ai/platform/api/providers)
Add your Azure OpenAI credentials
Configure your Azure deployment settings
Use Azure models through the same Respan endpoint

View your analytics

Access your Respan dashboard to see detailed analytics

Next Steps

Advanced settings

Track user behavior and patterns

Prompt Management

Manage and version your prompts

Ecosystem

Respan native

Agent frameworks

LLM SDKs

Memory

Structured output

Analytics

Coding agents

Search

Voice

Automation

Migrate

Providers

Chat completion

Overview

Quickstart

Step 1: Install OpenAI SDK

Step 2: Initialize Client

Step 3: Make Your First Request

Step 4: See your log on platform

Switch models

Supported parameters

OpenAI parameters

Respan Parameters

Prompt composition

Prompt schema (v1 vs v2)

Azure OpenAI

View your analytics

Next Steps

Advanced settings

Prompt Management

Ecosystem

Respan native

Agent frameworks

LLM SDKs

Memory

Structured output

Analytics

Coding agents

Search

Voice

Automation

Migrate

Providers

​Overview

​Quickstart

​Step 1: Install OpenAI SDK

​Step 2: Initialize Client

​Step 3: Make Your First Request

​Step 4: See your log on platform

​Switch models

​Supported parameters

​OpenAI parameters

​Respan Parameters

​Prompt composition

​Prompt schema (v1 vs v2)

​Azure OpenAI

View your analytics

​Next Steps

Advanced settings

Prompt Management

Overview

Quickstart

Step 1: Install OpenAI SDK

Step 2: Initialize Client

Step 3: Make Your First Request

Step 4: See your log on platform

Switch models

Supported parameters

OpenAI parameters

Respan Parameters

Prompt composition

Prompt schema (v1 vs v2)

Azure OpenAI

Next Steps