Skip to content

Koldan Developer Documentation

Koldan is a speech-to-text platform that provides REST APIs for transcription, summarization, translation, and more. This documentation covers everything you need to develop with Koldan or integrate Koldan into your applications.


Fundamentals

  • Authentication and API Keys


    How Bearer tokens and API keys work, scoped permissions, and multi-tenant headers.

    Authentication

  • Tenants


    Multi-tenancy, tenant isolation, and Bring Your Own Identity Provider (BYOIdP).

    Tenants

  • Roles and Permissions


    System roles, permission scopes, and role-based access control.

    Roles and Permissions

  • Speech Models


    How models work, model types, capabilities, and language support.

    Speech Models

  • Files, Transcriptions, and Summaries


    The full pipeline — uploading files, running transcriptions, AI summarization, tags, virtual paths, and content lifecycle.

    Files and Transcriptions

  • Streaming and WebSocket


    Real-time speech-to-text, low-latency applications, session lifecycle, and best practices for streaming audio.

    Streaming and WebSocket

  • Data Retention, Quotas, and Rate Limits


    How long data is kept, usage quotas, rate limit headers, and what happens when limits are exceeded.

    Retention, Quotas, and Rate Limits

  • Subscriptions


    Subscription plans, how they control quotas and rate limits, and checking your plan status.

    Subscriptions


Where to Go Next

View API Reference