New

  • Reasoning tokens tracking — track reasoning tokens separately for o3, gpt-5, and other reasoning models. Shows as a dedicated field alongside prompt and completion tokens.
  • Export pipeline — partition-based export with sampling percentage and field selection.
  • MCP tools expansion — added tools for credit transactions, API key management, experiments, scores, traces, automations, and dashboard analytics.
  • Rate limiting — global rate limiting across all API endpoints for stability.

Improved

  • Prompt cache hit tokens bug fix
  • Dead letter queue (DLQ) observability and retry tools