Provider: Custom / Self-hosted
Provider: Custom / Self-hosted
Call your own self-hosted or custom LLM endpoint through Respan Gateway.
Bring your own OpenAI-compatible endpoint and call it through Respan Gateway to get unified observability (logs, cost, latency, reliability) alongside the rest of your models.
Quick setup
Add your custom provider
On the Providers page, click Add Custom Provider and fill in a name, base URL, and API key (if your endpoint requires one).
Create a custom model
On the Models page, add a model entry that points to your custom provider. Use the model ID your endpoint expects.
Call your custom model
The base URL is https://api.respan.ai/api and the only key needed is your RESPAN_API_KEY. Replace MY_CUSTOM_MODEL_ID with the model ID you created.
OpenAI SDK
Respan API
Point the OpenAI SDK at the Respan gateway and call your custom model.
More integrations
Custom models work with every Respan gateway integration over the OpenAI-compatible API:
Override credentials per model (Optional)
Use credential_override when a single request should route to a different endpoint or API key than the one configured on your custom provider.
Log without proxying (Optional)
Already calling your endpoint directly? Send logs to Respan asynchronously to track cost, latency, and performance for those external calls.
See the logging guide for the full setup.