Question 1

Does logging add latency to our application?

Accepted Answer

No. Respan uses async background logging — the SDK sends log data after returning the response to your application. P99 ingestion latency is under 80ms and does not block the request path.

Question 2

How does tracing work for multi-step agents?

Accepted Answer

The SDK uses context propagation to link spans automatically. Each LLM call within an agent run is attached to a parent trace. The resulting view shows the full execution tree with per-span latency and outputs.

Question 3

Can we search the content of prompts and responses?

Accepted Answer

Yes. Full-text search is available across all logged content. You can also filter by structured fields — model, cost range, latency percentile, user ID, status code, or custom metadata tags.

Question 4

What data is captured for each request?

Accepted Answer

By default: the full prompt and response, model name, provider, token counts (prompt + completion), cost, latency (first token and total), HTTP status, user ID (if provided), and any custom metadata.

Question 5

Can we export logs to our own data warehouse?

Accepted Answer

Yes. The export API supports JSON and JSONL formats with configurable filters. You can stream logs to S3, BigQuery, or any destination via webhooks, or pull on a schedule via the REST API.

Question 6

How is this different from DataDog or Honeycomb?

Accepted Answer

General APM tools don't understand LLM semantics — they can't parse token counts, route by model, or show cost-per-request. Respan is purpose-built for LLM workloads: the schema, dashboards, alerts, and search are designed around how LLM applications behave.

Question 7

Can we run experiments to compare model variants?

Accepted Answer

Yes. Run the same testset through multiple model/prompt configurations. Respan scores each variant, computes win rates, and shows side-by-side output comparisons with statistical significance testing.

Question 8

How does prompt management work for teams?

Accepted Answer

Prompts are versioned, stored centrally, and pulled at runtime via the SDK. Team members can edit, review, and deploy prompts with role-based access. A/B testing is built in.

Question 9

What alert types are supported?

Accepted Answer

Threshold alerts for cost, latency (P50/P95/P99), error rate, and eval score regressions. Alerts are delivered via Slack, email, or webhook.

Question 10

How long does team onboarding take?

Accepted Answer

The SDK integration takes under 5 minutes. Configuring dashboards, evals, and alerts typically takes a day. Most teams are fully operational within a week.

Respan for AI Teams

Proven at scale

What you get

How AI teams use Respan

How it works

By the numbers

Works with your stack

Frequently asked questions

Frequently asked questions

Explore more

Built for AI agents.
Break less.
Ship more.

Respan for AI Teams

Proven at scale

What you get

How AI teams use Respan

How it works

By the numbers

Works with your stack

Frequently asked questions

Frequently asked questions

Explore more

Built for AI agents.
Break less.
Ship more.

Respan for AI Teams

Proven at scale

What you get

How AI teams use Respan

How it works

By the numbers

Works with your stack

Frequently asked questions

Frequently asked questions

Does logging add latency to our application?

How does tracing work for multi-step agents?

Can we search the content of prompts and responses?

What data is captured for each request?

Can we export logs to our own data warehouse?

How is this different from DataDog or Honeycomb?

Can we run experiments to compare model variants?

How does prompt management work for teams?

What alert types are supported?

How long does team onboarding take?

Explore more

Built for AI agents. Break less. Ship more.

Respan for AI Teams

Proven at scale

What you get

How AI teams use Respan

How it works

By the numbers

Works with your stack

Frequently asked questions

Frequently asked questions

Does logging add latency to our application?

How does tracing work for multi-step agents?

Can we search the content of prompts and responses?

What data is captured for each request?

Can we export logs to our own data warehouse?

How is this different from DataDog or Honeycomb?

Can we run experiments to compare model variants?

How does prompt management work for teams?

What alert types are supported?

How long does team onboarding take?

Explore more

Built for AI agents. Break less. Ship more.

Built for AI agents.
Break less.
Ship more.

Built for AI agents.
Break less.
Ship more.