New
- Reasoning tokens tracking — track reasoning tokens separately for o3, gpt-5, and other reasoning models. Shows as a dedicated field alongside prompt and completion tokens.
- Export pipeline — partition-based export with sampling percentage and field selection.
- MCP tools expansion — added tools for credit transactions, API key management, experiments, scores, traces, automations, and dashboard analytics.
- Rate limiting — global rate limiting across all API endpoints for stability.
Improved
- Prompt cache hit tokens bug fix
- Dead letter queue (DLQ) observability and retry tools