Provider: Baseten
Provider: Baseten
Use Respan Gateway to call Baseten-hosted models (DeepSeek, Llama, Qwen, and other open-source deployments) while keeping unified observability (logs, cost, latency, reliability) in Respan.
Quick setup
Add credits (recommended)
Top up credits to pay through Respan. No Baseten key required, Respan handles provider auth and billing.
Prefer to route through your own Baseten account? See Use your own Baseten key.
Send your first request
Pick the integration that matches your stack. The base URL is https://api.respan.ai/api and the only key needed is your RESPAN_API_KEY.
OpenAI SDK
Vercel AI SDK
Respan API
Baseten is OpenAI-compatible. Point the OpenAI SDK at the Respan gateway and call any Baseten model.
More integrations
Baseten models work with every Respan gateway integration:
Switch models
Change the model parameter to call any supported model through the same client. Use the baseten/ prefix to disambiguate when routing across providers. Browse the full list on the Models page.
Use your own Baseten key (BYOK)
Credits are the default path. If you’d rather bill Baseten directly, attach your own provider key.
Global (UI)
Per-request (Code)
Override credentials per model (Optional)
Use credential_override when one model on a request should use a different Baseten key than the default.
Log without proxying (Optional)
Already calling Baseten directly? Send logs to Respan asynchronously to track cost, latency, and performance for those external calls.
See the logging guide for the full setup.