Connect any LLM provider through Respan. Route requests, compare models, and monitor performance across all your providers in one place.
OpenAI
Monitor OpenAI API usage, cost, and performance.
Anthropic
Track Anthropic Claude API calls, cost, and latency.
OpenRouter
Monitor OpenRouter requests across multiple models.
Groq
Track Groq inference calls and ultra-low latency metrics.
Fireworks
Monitor Fireworks AI inference usage and performance.
Together AI
Track Together AI model requests and token usage.
Perplexity
Monitor Perplexity API calls and search-augmented responses.
Azure OpenAI
Track Azure OpenAI Service deployments and usage.
AWS Bedrock
Monitor AWS Bedrock model invocations and costs.
Google Vertex AI
Track Google Vertex AI model calls and performance.
Google Gemini
Monitor Google Gemini API requests and token usage.
Nebius AI
Track Nebius AI inference calls and usage metrics.
Novita AI
Monitor Novita AI model requests and performance.
AI21 Labs
Monitor AI21 Labs model requests and token usage.
AssemblyAI
Monitor AssemblyAI speech-to-text and audio intelligence calls.
Baseten
Monitor Baseten model deployments and inference calls.
Cohere
Track Cohere model requests, embeddings, and reranking calls.
DeepSeek
Monitor DeepSeek model requests and reasoning token usage.
Inference
Monitor Inference API calls and model performance.
Mistral
Track Mistral model requests and performance metrics.
Moonshot
Monitor Moonshot AI model requests and token usage.
Nextbit
Monitor Nextbit model requests and inference metrics.
Parasail
Monitor Parasail inference calls and model performance.
Reducto
Monitor Reducto document processing and extraction calls.
Replicate
Monitor Replicate model predictions and GPU usage.
xAI
Monitor xAI Grok model requests and performance.