List LLM metrics

Returns LLM usage metrics (requests, tokens, cost, latency, cache hit rate, etc.) bucketed by time_tick (minute / hour / day).

Authentication

AuthorizationBearer

Use your Respan API key for Respan API authentication. Enter only the Respan API key value; clients send Authorization: Bearer <RESPAN_API_KEY>. For /api/responses, OpenAI or Azure OpenAI provider credentials go in Settings -> Providers or the request body credential_override field, not in this auth field.

Query parameters

summary_typeenumOptionalDefaults to all

Preset time range. Use this or explicit start_time / end_time.

datedateOptional

Base date used with summary_type presets.

start_timedatetimeOptional
Optional explicit ISO start time.
end_timedatetimeOptional
Optional explicit ISO end time.
time_tickenumOptionalDefaults to hour

Bucket granularity for time-series responses.

Allowed values:
timezone_offsetdoubleOptionalDefaults to 0
Timezone offset, in hours, used when resolving preset ranges.
fetch_filtersenumOptionalDefaults to true
Whether to include available filter options in the response.
Allowed values:

Request

This endpoint expects an object.
filtersobjectOptional

Filter criteria. See Filters API Reference for operator syntax.

Response

Successful response.
date_groupdatetime
Bucket start time, ISO 8601.
number_of_requestsinteger
total_costdouble
total_prompt_tokensinteger
total_completion_tokensinteger
total_tokensinteger
error_countinteger
error_percentagedouble
average_prompt_tokensinteger
average_completion_tokensinteger
average_tokensinteger
average_costdouble
average_tpsdouble
average_latencydouble
average_ttftdouble
prompt_cache_hit_tokensinteger
reasoning_tokensinteger
cache_hit_percentagedouble
requests_per_seconddouble