AI Speech Engines


Convert speech in audio or video files in 70 different languages and dialects into text transcripts.

Transcription runs on the aiWARE Enterprise AI platform, which orchestrates a diverse ecosystem of ready-to-deploy machine learning models to transform audio, video, text, and other data sources into actionable intelligence, at scale, with no AI expertise. With aiWARE, leverage digital workers to save manual review time, gain valuable data insights, and cognitively enrich end-to-end workflows.

Learn More

Transcription — often referred to as speech-to-text — engines in the Veritone cognitive engine ecosystem convert spoken words in audio and video recordings into readable and searchable text. They are built and trained to transcribe different languages and dialects. Text analytics can then be applied to those transcriptions for further insight into what was spoken.

Transcription Features:

  • Broad Language Support

    Convert speech to text in 70 different natural languages and dialects in the cloud, with a subset that can be deployed locally, to support a diverse user-base, workforce, or population.

  • Text Transcripts

    Export spoken word recordings as text transcripts in plain text, Microsoft Word, Timed Text Markup Language (TTML), WebVTT, and SubRip text formats via Veritone applications.

  • Machine & Manual

    Choose the right option for your use case – machine transcription with AI or leverage aiWARE to manually transcribe your audio data via Veritone partners.

  • Searchable Results

    Identify the keywords you are looking for quickly within transcripts with searchable transcription engine output via API or Veritone applications.

  • future proof icon
    Near Real-Time Processing

    Process audio and video files in near real-time for use cases requiring quick text extraction.

  • Files or Stream Support

    Transform short-form or long-form audio into text in audio and video recordings, streamed recordings, or live data streams.

  • Flexible Deployment

    Deploy in a new or existing application in the cloud via aiWARE GraphQL APIs, or with a subset that can be deployed on-premise via a Docker container. Learn more

  • Powered by an AI Ecosystem

    Leverage advanced transcription machine learning algorithms from the Veritone managed cognitive engine ecosystem — including algorithms from Veritone, niche providers, and industry giants.

Experience the power of hundreds of AI engines across 20+ cognitive categories.