Shunya Labs Plans

Flexible pricing for every scenario

Pay as you go

Free

$200

of Credit

Then pay-as-you-go. No minimums. No expiration. No credit card required.

  • Industry leading speech to text foundation models
  • Advanced intelligence features
  • Custom voice agent orchestrations

Volume

$500

With pre-paid credits for the year. Credits are redeemed against actual usage.

  • Industry leading speech to text foundation models
  • Advanced intelligence features
  • Custom voice agent orchestrations

Enterprise

Custom pricing

For businesses with large volumes, data or deployment requirements, or support needs.

  • Access all models with our best discounts
  • Access to custom-trained speech-to-text models and intelligence features
  • Highest concurrency support
  • Self-hosted deployment options
  • Dedicated SLAs and support

Voice Agents

Calculate your per minute cost for voice agents.

Speech to text

LLM

Text to Speech

Estimated cost/minute

$0.0154
Speech to text
$ 0.0045
LLM
$ 0.0034
Text to Speech
$ 0.0075

Estimate your monthly custom plan for voice agents:

$ 18.48
10500
Hours required per month
Add $18 to wallet

Speech to Text

Industry-best speech to text foundation models for superior performance.

Model

Price(USD/min)

Zero STT

Supports 200+ languages

$0.0039

Zero STT Indic

Superior accuracy for Indic languages

$0.0045

Zero STT Codeswitch

Native codeswitch model for multilingual speech

$0.0050

Zero STT Med

Specialised model for healthcare transcriptions

$0.0050

Zero STT Numerical

Specialised model for transcripts containing numerical values

$0.0050

Audio Processing

Get better transcripts with cleaner audio.

Product

Price(USD/min)

Denoiser

$0.0039

Enhancer

$0.0039

Speech Intelligence Features

Get analytics directly from speech and formatted outputs for integration into your workflows.

Feature

Price(USD/min)

Language Identification

Automatically detect the language in your audio files

$0.0001

Translation

Translate audio during or after transcription

$0.0003

Transliteration

Convert output to your preferred script

$0.0003

Speaker Diarization

Separate transcripts by speaker automatically

$0.0012

Speaker Identification

Customize speaker labels for personalized transcripts

$0.0009

Word Timestamps

Word-level timing for precise navigation

$0.0012

Profanity and Keyword Hashing

Filter and mask profanity or custom keywords

$0.0003

Intent Detection

Understand the purpose behind every conversation

$0.0003

Sentiment Analysis

Track emotional tone across interactions

$0.0003

Emotion Diarization

Get granular emotion tracking throughout conversations

$0.0005

Summarisation

Generate concise summaries from audio or text

$0.0003

Keyword Normalisation

Standardize brand names, acronyms, and custom terminology

$0.0003

Medical Keyterm Correction

Ensure accurate transcription of medical terminology

$0.0003

Frequently Asked Questions