Groq

Route Groq model calls through Respan Gateway using your own Groq credits.
  1. Sign up — Create an account at platform.respan.ai
  2. Create an API key — Generate one on the API keys page
  3. Add credits or a provider key — Add credits on the Credits page or connect your own provider key on the Integrations page

Add the Docs MCP to your AI coding tool to get help building with Respan. No API key needed.

1{
2 "mcpServers": {
3 "respan-docs": {
4 "url": "https://docs.respan.ai/mcp"
5 }
6 }
7}
This section is for Respan LLM gateway users.

Use Respan Gateway to call Groq models while keeping unified observability (logs, cost, latency, and reliability metrics) in Respan — and optionally charge usage to your own Groq credits.

Prerequisites

  • A Respan API key
  • A Groq API key (BYOK credits)

Supported SDKs / integrations

Configuration

There are 2 ways to add your Groq credentials to your requests:

Via UI (Global)

2

Add your Groq API Key

Select Groq and paste your API key.

Add Groq Credentials
3

Configure Load Balancing (Optional)

You can add multiple Groq API keys for redundancy. Use the Load balancing weight field to determine how traffic is distributed between keys.

Via code (Per-Request)

You can pass credentials dynamically in the request body. This is useful if you need to use your users’ own API keys (BYOK credits).

Add the customer_credentials parameter to your Gateway request:

1{
2 // Rest of the request body
3 "customer_credentials": {
4 "groq": {
5 "api_key": "YOUR_GROQ_API_KEY"
6 }
7 }
8}

Override credentials for a particular model (Optional)

If you uploaded provider credentials in the UI, you can still override credentials for specific models on a per-request basis.

1{
2 // Rest of the request body
3 "customer_credentials": {
4 "groq": {
5 "api_key": "YOUR_GROQ_API_KEY"
6 }
7 },
8 "credential_override": {
9 "groq/llama-3.1-8b-versatile": {
10 "api_key": "ANOTHER_GROQ_API_KEY"
11 }
12 }
13}

Log Groq requests

If you are not using the Gateway to proxy requests, you can still log your Groq requests to Respan asynchronously. This allows you to track cost, latency, and performance metrics for external calls.

Groq Python SDK
1import requests
2
3url = "https://api.respan.ai/api/request-logs/create/"
4payload = {
5 "model": "llama3-8b-8192",
6 "prompt_messages": [
7 {
8 "role": "user",
9 "content": "Write a short poem about AI"
10 }
11 ],
12 "completion_message": {
13 "role": "assistant",
14 "content": "In circuits bright and data streams, AI awakens from digital dreams..."
15 },
16 "cost": 0.0001,
17 "generation_time": 0.8,
18 "customer_params": {
19 "customer_identifier": "user_101"
20 }
21}
22headers = {
23 "Authorization": "Bearer YOUR_RESPAN_API_KEY",
24 "Content-Type": "application/json"
25}
26
27response = requests.post(url, headers=headers, json=payload)