Skip to main content
HyperWhisper supports three transcription tiers out of the box via HyperWhisper Cloud, plus bring-your-own-key (BYOK) for direct provider access and fully offline local models.

HyperWhisper Cloud at a glance

HyperWhisper Cloud is built-in — no API key, no separate account. Pick a tier based on whether you care more about speed, balance, or accuracy. All three are pay-as-you-go with no markup; you pay what the underlying provider charges.

Fast

Groq Whisper Large v3~$0.11 / hour 1.85 credits/minSub-second latency. Great for English and larger European languages.

Balanced

Deepgram Nova-3~$0.33 / hour 5.5 credits/minStrong English accuracy, low latency, supports custom vocabulary.

Accurate

ElevenLabs Scribe v2~$0.59 / hour 9.83 credits/minBest-in-class multilingual accuracy (e.g. Danish, Dutch, Nordic languages).
Credits are billed at 1 credit = $0.001 USD. A Pro license includes 5,000 credits up front, and top-ups are available in $5 / $10 / $20 bundles.

You only pay for actual speech

HyperWhisper Cloud detects silence and blank audio automatically. If a recording contains no detectable speech, you are charged 0 credits — we don’t bill for dead air at the start of a clip, pauses between thoughts, or an accidentally-triggered empty recording. In practice, across a typical working day of push-to-talk dictation, you’re only billed for the minutes you actually spoke.

Accuracy by language

Word Error Rate (WER) across the three HyperWhisper Cloud tiers. Lower is better. Figures are drawn from Soniox’s 2025 60-language benchmark; Groq runs the same Whisper Large v3 model benchmarked as “OpenAI”. Rule of thumb: for English, any tier works. For anything else — especially Nordic, Slavic, and Greek — Accurate (ElevenLabs) is usually the best pick.

Cost examples

At 1 credit = $0.001 USD, here’s what each tier costs at typical usage levels. Remember: only actual speech is billed, so “30 min/day” means 30 minutes of talking, not 30 minutes of the app being open.
Daily speechFastBalancedAccurate
15 min~$0.03~$0.08~$0.15
30 min~$0.06~$0.17~$0.30
1 hour~$0.11~$0.33~$0.59
2 hours~$0.22~$0.66~$1.18
8 hours~$0.89~$2.64~$4.72
Monthly at 30 min/day: ~$1.80 Fast / ~$5 Balanced / ~$9 Accurate. A one-time $5 top-up covers weeks of heavy use on the Fast tier, or roughly 15 hours of speech on Accurate.

Alternatives

If you already have API credits or want to use your own free tier (Deepgram $200, AssemblyAI $50), plug in a key via API Keys. You pay the provider directly at their published rate.
ProviderModel$/min
GroqWhisper Large v3 Turbo$0.00067
DeepgramNova-3 (batch)$0.0043
AssemblyAIUniversal$0.0037
OpenAIwhisper-1 / gpt-4o-transcribe$0.006
ElevenLabsScribe v2~$0.008
HyperWhisper also supports Fireworks AI, Mistral, and Google Gemini for BYOK. See API Keys for setup.

Boost accuracy on any provider

  • Custom vocabulary — add domain terms (product names, frameworks, jargon, colleagues’ names). Biggest single improvement for technical or professional use.
  • Low-noise environment — every model degrades with background noise. See Best Practices.
  • Natural pace — overly fast or overly slow speech both hurt accuracy.
When using Deepgram Nova-3 with custom vocabulary, set the language explicitly (not auto) — the keyterm parameter is only active in monolingual mode on Nova-3.

FAQ

Does HyperWhisper Cloud mark up the underlying provider cost? No. You pay the same per-minute rate as if you held the provider’s API key directly. What happens if my chosen tier is temporarily unavailable? HyperWhisper Cloud automatically falls back to another provider in the chain, so transcription still succeeds. You’re billed at the actual provider that handled the request. Which tier handles mixed-language speech (e.g. code-switching)? Accurate (ElevenLabs Scribe v2) handles multilingual and code-switched speech best. It’s also the top performer on FLEURS, the standard multilingual benchmark.