Koldan Developer Documentation
Koldan is a speech-to-text platform that provides REST APIs for transcription, summarization, translation, and more. This documentation covers everything you need to develop with Koldan or integrate Koldan into your applications.
Fundamentals
-
Authentication and API Keys
How Bearer tokens and API keys work, scoped permissions, and multi-tenant headers.
-
Tenants
Multi-tenancy, tenant isolation, and Bring Your Own Identity Provider (BYOIdP).
-
Roles and Permissions
System roles, permission scopes, and role-based access control.
-
Speech Models
How models work, model types, capabilities, and language support.
-
Files, Transcriptions, and Summaries
The full pipeline — uploading files, running transcriptions, AI summarization, tags, virtual paths, and content lifecycle.
-
Streaming and WebSocket
Real-time speech-to-text, low-latency applications, session lifecycle, and best practices for streaming audio.
-
Data Retention, Quotas, and Rate Limits
How long data is kept, usage quotas, rate limit headers, and what happens when limits are exceeded.
-
Subscriptions
Subscription plans, how they control quotas and rate limits, and checking your plan status.