Skip to main content
  1. Sign up — Create an account at platform.respan.ai
  2. Create an API key — Generate one on the API keys page
  3. Add credits or a provider key — Add credits on the Credits page or connect your own provider key on the Integrations page
Add the Docs MCP to your AI coding tool to get help building with Respan. No API key needed.
{
  "mcpServers": {
    "respan-docs": {
      "url": "https://docs.respan.ai/mcp"
    }
  }
}

What is Replicate?

Replicate is a platform for running machine learning models in the cloud. The Replicate Python SDK lets you run open-source models with a simple API. Respan can auto-instrument all Replicate calls for tracing and observability.
Replicate uses a different API format, so only Tracing setup is available. Gateway routing is not directly supported.

Setup

1

Install packages

pip install respan-ai opentelemetry-instrumentation-replicate replicate python-dotenv
2

Set environment variables

export REPLICATE_API_TOKEN="YOUR_REPLICATE_API_TOKEN"
export RESPAN_API_KEY="YOUR_RESPAN_API_KEY"
3

Initialize and run

import os
from dotenv import load_dotenv

load_dotenv()

import replicate
from respan import Respan

# Auto-discover and activate all installed instrumentors (Traceloop)
respan = Respan(is_auto_instrument=True)

# Calls go directly to Replicate, auto-traced by Respan
output = replicate.run(
    "meta/meta-llama-3-70b-instruct",
    input={"prompt": "Say hello in three languages."},
)
print("".join(output))
respan.flush()
4

View your trace

Open the Traces page to see your auto-instrumented LLM spans.

Configuration

ParameterTypeDefaultDescription
api_keystr | NoneNoneFalls back to RESPAN_API_KEY env var.
base_urlstr | NoneNoneFalls back to RESPAN_BASE_URL env var.
is_auto_instrumentbool | NoneFalseAuto-discover and activate all installed instrumentors via OpenTelemetry entry points.
customer_identifierstr | NoneNoneDefault customer identifier for all spans.
metadatadict | NoneNoneDefault metadata attached to all spans.
environmentstr | NoneNoneEnvironment tag (e.g. "production").

Attributes

Attach customer identifiers, thread IDs, and metadata to spans.

In Respan()

Set defaults at initialization — these apply to all spans.
from respan import Respan

respan = Respan(
    is_auto_instrument=True,
    customer_identifier="user_123",
    metadata={"service": "inference-api", "version": "1.0.0"},
)

With propagate_attributes

Override per-request using a context manager.
import replicate
from respan import Respan, workflow, propagate_attributes

respan = Respan(
    is_auto_instrument=True,
    metadata={"service": "inference-api", "version": "1.0.0"},
)

@workflow(name="handle_request")
def handle_request(user_id: str, question: str):
    with propagate_attributes(
        customer_identifier=user_id,
        thread_identifier="conv_001",
        metadata={"plan": "pro"},
    ):
        output = replicate.run(
            "meta/meta-llama-3-70b-instruct",
            input={"prompt": question},
        )
        print("".join(output))
AttributeTypeDescription
customer_identifierstrIdentifies the end user in Respan analytics.
thread_identifierstrGroups related messages into a conversation.
metadatadictCustom key-value pairs. Merged with default metadata.

Decorators

Use @workflow and @task to create structured trace hierarchies.
import replicate
from respan import Respan, workflow, task

respan = Respan(is_auto_instrument=True)

@task(name="generate_outline")
def outline(topic: str) -> str:
    output = replicate.run(
        "meta/meta-llama-3-70b-instruct",
        input={"prompt": f"Create a brief outline about: {topic}"},
    )
    return "".join(output)

@workflow(name="content_pipeline")
def pipeline(topic: str):
    plan = outline(topic)
    output = replicate.run(
        "meta/meta-llama-3-70b-instruct",
        input={"prompt": f"Write content from this outline: {plan}"},
    )
    print("".join(output))

pipeline("Benefits of API gateways")
respan.flush()

Examples

Basic run

output = replicate.run(
    "meta/meta-llama-3-70b-instruct",
    input={"prompt": "Say hello in three languages."},
)
print("".join(output))