Send a response request through the Respan gateway using the OpenAI Responses API format. Supports streaming, tool use, and prompt management.
Respan-specific parameters can be passed three ways:
respan_params - explicit namespacing to avoid conflictsX-Data-Respan-Params - base64-encoded JSON headerMerge order: top-level body fields > respan_params > header.
Legacy compatibility:
keywordsai_params is still accepted and merged into respan_paramsX-Data-Keywordsai-Params is still accepted and auto-renamed internallyBearer token. Use Bearer YOUR_API_KEY.
Base64-encoded JSON object of Respan parameters. Legacy X-Data-Keywordsai-Params is still accepted.
Pin the request to a specific provider without changing the model slug. Example: vertex_ai routes a claude-sonnet-4-5-20250929 request to Vertex AI Claude.
Comma-separated beta feature flags. Available: token-breakdown-2026-03-26, env-scoped-integrations-2026-03-28
Stream the response as server-sent events.
Sampling temperature (0-2).
ID of a previous response for multi-turn conversations.
Per-customer LLM provider credentials.
One-off credential overrides per provider.
Cache TTL in seconds. Default: 30 days.
Cache behavior options. Properties: cache_by_customer, is_cached_by_model, omit_log.
Load balance group selection. Use {"group_id": "..."} to route through a configured group.
Prompt template config. Properties: prompt_id (required), variables, version, echo. See Prompt management.
Retry config. Properties: retry_enabled (boolean), num_retries, retry_after (seconds).
When true, omits input/output from the log. Metrics still recorded.
Custom key-value metadata attached to the span.
Extended customer info. Properties: customer_identifier (required), group_identifier, name, email, period_budget, budget_duration (daily/weekly/monthly), total_budget, markup_percentage.
User feedback. true = liked, false = disliked.