List latency / TTFT / TPS quantiles
List latency / TTFT / TPS quantiles
Returns p50, p90, p95, and p99 distributions for latency, time-to-first-token, and tokens-per-second, bucketed by time_tick.
Headers
Authorization
Bearer token. Use Bearer YOUR_API_KEY.
Request
This endpoint expects an object.
start_time
Inclusive start of the range, ISO 8601.
end_time
Inclusive end of the range, ISO 8601.
time_tick
Bucket granularity for time-series endpoints. If omitted, inferred from the range. Has no effect on /summary/ endpoints.
Allowed values:
Response
Successful response.
date_group
Bucket start time, ISO 8601.
latency_p_50
latency_p_90
latency_p_95
latency_p_99
ttft_p_50
ttft_p_90
ttft_p_95
ttft_p_99
tps_p_50
tps_p_90
tps_p_95
tps_p_99