New

  • Models summary endpoint — aggregate model usage statistics via API.
  • MCP tools for prompts — create and manage prompt versions directly from Claude Code or Cursor.
  • Experiment V2 eval results — run evaluations and view results directly in experiment tables.

Improved

  • Automation workflows now support name, description, and throttling
  • Streaming logging improvements