Speech to text
Transcribe audio to text through the Respan gateway with automatic logging.
Bearer token. Use Bearer YOUR_API_KEY.
Base64-encoded JSON object of Respan parameters. Legacy X-Data-Keywordsai-Params is still accepted.
Audio file. Supported: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, webm.
Input audio language (ISO-639-1).
Sampling temperature (0-1).
Timestamp granularities. Requires verbose_json response format.
Per-customer LLM provider credentials.
When true, omits input/output from the log. Metrics still recorded.
Custom key-value metadata.
Word-level timestamps (if requested).
Segment-level timestamps (if requested).