Google GenAI (Gemini)

Set up Respan

Sign up — Create an account at platform.respan.ai
Create an API key — Generate one on the API keys page
Add credits or a provider key — Add credits on the Credits page or connect your own provider key on the Integrations page

Use AI

Add the Docs MCP to your AI coding tool to get help building with Respan. No API key needed.

{
  "mcpServers": {
    "respan-docs": {
      "url": "https://docs.respan.ai/mcp"
    }
  }
}

What is Google GenAI SDK?

The Google GenAI SDK is the official Python client for Google’s Gemini models, supporting content generation, streaming, and structured output. Respan can auto-instrument all GenAI calls for tracing, route them through the Respan gateway, or both.

Setup

Install packages

Tracing
Gateway
Both

OpenInference (Recommended)
Traceloop

pip install respan-ai openinference-instrumentation-google-generativeai google-genai python-dotenv

pip install respan-ai opentelemetry-instrumentation-google-generativeai google-genai python-dotenv

pip install openai python-dotenv

The gateway uses OpenAI SDK format for Google GenAI models.

OpenInference (Recommended)
Traceloop

pip install respan-ai openinference-instrumentation-google-generativeai google-genai openai python-dotenv

pip install respan-ai opentelemetry-instrumentation-google-generativeai google-genai openai python-dotenv

Set environment variables

Tracing
Gateway
Both

export GOOGLE_API_KEY="YOUR_GOOGLE_API_KEY"
export RESPAN_API_KEY="YOUR_RESPAN_API_KEY"

export RESPAN_API_KEY="YOUR_RESPAN_API_KEY"

No GOOGLE_API_KEY needed — the Respan gateway handles provider authentication.

export GOOGLE_API_KEY="YOUR_GOOGLE_API_KEY"
export RESPAN_API_KEY="YOUR_RESPAN_API_KEY"

Initialize and run

Tracing
Gateway
Both

OpenInference (Recommended)
Traceloop

import os
from dotenv import load_dotenv

load_dotenv()

from google import genai
from respan import Respan
from openinference.instrumentation.google_generativeai import GoogleGenerativeAIInstrumentor

# Initialize Respan with Google GenAI instrumentation
respan = Respan(instrumentations=[GoogleGenerativeAIInstrumentor()])

# Calls go directly to Google, auto-traced by Respan
client = genai.Client()

response = client.models.generate_content(
    model="gemini-2.5-flash",
    contents="Say hello in three languages.",
)
print(response.text)
respan.flush()

import os
from dotenv import load_dotenv

load_dotenv()

from google import genai
from respan import Respan

# Auto-discover and activate all installed instrumentors
respan = Respan(is_auto_instrument=True)

# Calls go directly to Google, auto-traced by Respan
client = genai.Client()

response = client.models.generate_content(
    model="gemini-2.5-flash",
    contents="Say hello in three languages.",
)
print(response.text)
respan.flush()

import os
from dotenv import load_dotenv

load_dotenv()

from openai import OpenAI

# Route all LLM calls through the Respan gateway (OpenAI SDK format)
client = OpenAI(
    api_key=os.getenv("RESPAN_API_KEY"),
    base_url=os.getenv("RESPAN_BASE_URL", "https://api.respan.ai/api"),
)

response = client.chat.completions.create(
    model="gemini-2.5-flash",
    messages=[{"role": "user", "content": "Say hello in three languages."}],
)
print(response.choices[0].message.content)

The gateway uses the OpenAI SDK format. You can also use the Google GenAI SDK directly with the gateway by setting base_url to https://api.respan.ai/api/google/gemini.

OpenInference (Recommended)
Traceloop

import os
from dotenv import load_dotenv

load_dotenv()

from google import genai
from openai import OpenAI
from respan import Respan
from openinference.instrumentation.google_generativeai import GoogleGenerativeAIInstrumentor

# Initialize Respan with Google GenAI instrumentation
respan = Respan(instrumentations=[GoogleGenerativeAIInstrumentor()])

# Route calls through the Respan gateway (OpenAI SDK format)
client = OpenAI(
    api_key=os.getenv("RESPAN_API_KEY"),
    base_url=os.getenv("RESPAN_BASE_URL", "https://api.respan.ai/api"),
)

response = client.chat.completions.create(
    model="gemini-2.5-flash",
    messages=[{"role": "user", "content": "Say hello in three languages."}],
)
print(response.choices[0].message.content)
respan.flush()

import os
from dotenv import load_dotenv

load_dotenv()

from google import genai
from openai import OpenAI
from respan import Respan

# Auto-discover and activate all installed instrumentors
respan = Respan(is_auto_instrument=True)

# Route calls through the Respan gateway (OpenAI SDK format)
client = OpenAI(
    api_key=os.getenv("RESPAN_API_KEY"),
    base_url=os.getenv("RESPAN_BASE_URL", "https://api.respan.ai/api"),
)

response = client.chat.completions.create(
    model="gemini-2.5-flash",
    messages=[{"role": "user", "content": "Say hello in three languages."}],
)
print(response.choices[0].message.content)
respan.flush()

View your trace

Open the Traces page to see your auto-instrumented LLM spans.

This step applies to Tracing and Both setups. The Gateway-only setup does not produce traces.

Configuration

Parameter	Type	Default	Description
`api_key`	`str \| None`	`None`	Falls back to `RESPAN_API_KEY` env var.
`base_url`	`str \| None`	`None`	Falls back to `RESPAN_BASE_URL` env var.
`instrumentations`	`list`	`[]`	Plugin instrumentations to activate (e.g. `GoogleGenerativeAIInstrumentor()`).
`is_auto_instrument`	`bool \| None`	`False`	Auto-discover and activate all installed instrumentors via OpenTelemetry entry points.
`customer_identifier`	`str \| None`	`None`	Default customer identifier for all spans.
`metadata`	`dict \| None`	`None`	Default metadata attached to all spans.
`environment`	`str \| None`	`None`	Environment tag (e.g. `"production"`).

Attributes

Attach customer identifiers, thread IDs, and metadata to spans.

In Respan()

Set defaults at initialization — these apply to all spans.

from respan import Respan
from openinference.instrumentation.google_generativeai import GoogleGenerativeAIInstrumentor

respan = Respan(
    instrumentations=[GoogleGenerativeAIInstrumentor()],
    customer_identifier="user_123",
    metadata={"service": "chat-api", "version": "1.0.0"},
)

With propagate_attributes

Override per-request using a context manager.

from respan import Respan, workflow, propagate_attributes
from openinference.instrumentation.google_generativeai import GoogleGenerativeAIInstrumentor

respan = Respan(
    instrumentations=[GoogleGenerativeAIInstrumentor()],
    metadata={"service": "chat-api", "version": "1.0.0"},
)

@workflow(name="handle_request")
def handle_request(user_id: str, question: str):
    with propagate_attributes(
        customer_identifier=user_id,
        thread_identifier="conv_001",
        metadata={"plan": "pro"},  # merged with default metadata
    ):
        response = client.models.generate_content(
            model="gemini-2.5-flash",
            contents=question,
        )
        print(response.text)

Attribute	Type	Description
`customer_identifier`	`str`	Identifies the end user in Respan analytics.
`thread_identifier`	`str`	Groups related messages into a conversation.
`metadata`	`dict`	Custom key-value pairs. Merged with default metadata.

Decorators

Use @workflow and @task to create structured trace hierarchies.

from respan import Respan, workflow, task
from openinference.instrumentation.google_generativeai import GoogleGenerativeAIInstrumentor
from google import genai

respan = Respan(instrumentations=[GoogleGenerativeAIInstrumentor()])
client = genai.Client()

@task(name="generate_outline")
def outline(topic: str) -> str:
    response = client.models.generate_content(
        model="gemini-2.5-flash",
        contents=f"Create a brief outline about: {topic}",
    )
    return response.text

@workflow(name="content_pipeline")
def pipeline(topic: str):
    plan = outline(topic)
    response = client.models.generate_content(
        model="gemini-2.5-flash",
        contents=f"Write content from this outline: {plan}",
    )
    print(response.text)

pipeline("Benefits of API gateways")
respan.flush()

Examples

Basic

response = client.models.generate_content(
    model="gemini-2.5-flash",
    contents="Say hello in three languages.",
)
print(response.text)

Streaming

for chunk in client.models.generate_content_stream(
    model="gemini-2.5-flash",
    contents="Write a haiku about Python.",
):
    print(chunk.text, end="", flush=True)

Structured output

from google.genai import types

response = client.models.generate_content(
    model="gemini-2.5-flash",
    contents="List three programming languages with their year of creation.",
    config=types.GenerateContentConfig(
        response_mime_type="application/json",
    ),
)
print(response.text)

Gateway features

The features below require the Gateway or Both setup from Step 3.

Switch models

Change the model parameter to use 250+ models from different providers through the same gateway.

# Google Gemini
response = client.chat.completions.create(model="gemini-2.5-flash", messages=messages)

# OpenAI
response = client.chat.completions.create(model="gpt-4.1-nano", messages=messages)

# Anthropic
response = client.chat.completions.create(model="claude-sonnet-4-5-20250929", messages=messages)

See the full model list.

Respan parameters

Pass additional Respan parameters via extra_body for gateway features.

response = client.chat.completions.create(
    model="gemini-2.5-flash",
    messages=[{"role": "user", "content": "Hello"}],
    extra_body={
        "customer_identifier": "user_123",
        "fallback_models": ["gpt-4.1-nano"],
        "metadata": {"session_id": "abc123"},
        "thread_identifier": "conversation_456",
    },
)

See Respan parameters for the full list.

Overview

Agent Frameworks

LLM SDKs

Coding Agents

Vector DBs

Others

Model Providers

Google GenAI (Gemini)

What is Google GenAI SDK?

Setup

Configuration

Attributes

In Respan()

With propagate_attributes

Decorators

Examples

Basic

Streaming

Structured output

Gateway features

Switch models

Respan parameters

Overview

Agent Frameworks

LLM SDKs

Coding Agents

Vector DBs

Others

Model Providers

​What is Google GenAI SDK?

​Setup

​Configuration

​Attributes

​In Respan()

​With propagate_attributes

​Decorators

​Examples

​Basic

​Streaming

​Structured output

​Gateway features

​Switch models

​Respan parameters

What is Google GenAI SDK?

Setup

Configuration

Attributes

In Respan()

With propagate_attributes

Decorators

Examples

Basic

Streaming

Structured output

Gateway features

Switch models

Respan parameters