For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
DiscordPlatform
DocumentationIntegrationsAPI referenceSDKsChangelog
DocumentationIntegrationsAPI referenceSDKsChangelog
    • Overview
  • Tracing
  • Gateway
      • OpenAI SDK
      • Instructor
      • Anthropic SDK
      • Google GenAI
      • LiteLLM
      • RubyLLM
      • Vertex AI
      • AWS Bedrock
      • Cohere
      • Groq
      • Mistral AI
      • Ollama
      • Portkey
      • Watsonx
      • Together AI
      • Aleph Alpha
      • HuggingFace
      • Replicate
      • SageMaker
      • Respan API
  • Others
  • Migrating
    • Braintrust
    • Langfuse
    • Portkey
LogoLogo
DiscordPlatform
On this page
  • Setup
  • Switch models
  • Respan parameters
GatewayLLM SDKs

Groq (gateway)

Was this page helpful?
Previous

Mistral AI (gateway)

Next
Built with

Route Groq model calls through the Respan gateway to use Respan credentials, request logs, routing, fallbacks, and metadata. For direct Groq SDK tracing, see Groq tracing setup.

Setup

1

Install packages

$pip install openai python-dotenv
2

Set environment variables

$export RESPAN_API_KEY="YOUR_RESPAN_API_KEY"

No GROQ_API_KEY is required for gateway calls when your Groq provider credentials are configured in Respan.

3

Point an OpenAI-compatible client to the Respan gateway

1import os
2
3from dotenv import load_dotenv
4from openai import OpenAI
5
6load_dotenv()
7
8client = OpenAI(
9 api_key=os.environ["RESPAN_API_KEY"],
10 base_url=os.getenv("RESPAN_BASE_URL", "https://api.respan.ai/api"),
11)
12
13response = client.chat.completions.create(
14 model="groq/llama-3.1-8b-instant",
15 messages=[{"role": "user", "content": "Say hello in three languages."}],
16)
17print(response.choices[0].message.content)

Switch models

Change the model parameter to use 250+ models from different providers through the same gateway.

1response = client.chat.completions.create(model="groq/llama-3.1-8b-instant", messages=messages)
2response = client.chat.completions.create(model="gpt-4.1-nano", messages=messages)
3response = client.chat.completions.create(model="claude-sonnet-4-5-20250929", messages=messages)

See the full model list.

Respan parameters

Pass additional Respan parameters via extra_body for gateway features.

1response = client.chat.completions.create(
2 model="groq/llama-3.1-8b-instant",
3 messages=[{"role": "user", "content": "Hello"}],
4 extra_body={
5 "customer_identifier": "user_123",
6 "fallback_models": ["gpt-4.1-nano"],
7 "metadata": {"session_id": "abc123"},
8 "thread_identifier": "conversation_456",
9 },
10)

See Respan params & metadata for the full list.