Enabling Streaming
Streaming is off by default. You must opt in before any streaming shortcut or setting becomes active.- macOS
- Windows
Open the Streaming section in the sidebar and turn on the Enable Streaming toggle.Once enabled, the shortcut field and all other options appear below the toggle.
Keyboard Shortcut
After enabling streaming, you set the shortcut that triggers it.- macOS
- Windows
The default shortcut is ⌥ ⇧ Space (Option + Shift + Space). To change it, click the shortcut field in the Streaming sidebar section and press your preferred key combination. Press the shortcut again to start or stop streaming.If the shortcut conflicts with another HyperWhisper shortcut, a warning appears and the new binding is not saved until the conflict is resolved.
Choosing a Provider
Open the Engine section (visible when streaming is enabled) and pick a provider.| Provider | API key required | Vocabulary boosting | Model selection |
|---|---|---|---|
| HyperWhisper Cloud | No | Yes | No |
| Deepgram | Yes | Yes (explicit language only) | Yes — Nova 3 |
| ElevenLabs | Yes | No | No |
| OpenAI | Yes | No | No |
| xAI | Yes | No | No |
| Parakeet (On-Device) | No | No | V3 (Multilingual) / V2 (English) |
| Nemotron 3.5 (On-Device) | No | No | Multilingual / Latin |
Vocabulary entries you have added in the Vocabulary section are forwarded to HyperWhisper Cloud and Deepgram (when an explicit language is set). ElevenLabs, OpenAI, and xAI do not support vocabulary boosting via the streaming API — a warning appears if you have vocabulary entries and switch to one of those providers.
Deepgram Options
When Deepgram is selected, two additional settings appear.Model
Choose between Nova 3 General (default) and Nova 3 Medical. The medical model is tuned for healthcare terminology and clinical language.Fast Formatting
When enabled (the default), Deepgram returns smart-formatted results immediately without waiting for additional surrounding context. This minimises the delay before words appear on screen. When disabled, Deepgram waits for more context before finalising punctuation and number formatting, which produces slightly more accurate formatting at the cost of extra latency.On-Device Providers (macOS)
Parakeet
Parakeet is NVIDIA’s open-weight speech model, running locally via the integrated on-device engine.- Parakeet V3 (Multilingual) — supports 25 European languages; this is the default.
- Parakeet V2 (English) — English only, highest recall.
Nemotron 3.5
Nemotron 3.5 is NVIDIA’s streaming ASR model family, also running fully on-device.- Nemotron 3.5 (Multilingual) — approximately 40 languages including Chinese, Japanese, Korean, and Arabic; this is the default variant.
- Nemotron 3.5 (Latin) — approximately 6 Latin-script languages; faster.
Language
The Language setting tells the provider which language to expect. Setting an explicit language generally improves accuracy and, for Deepgram, is required for vocabulary boosting to work.- macOS
- Windows
The language picker appears in the Language section below the Engine card. The available choices depend on the selected provider. For cloud providers, choosing Automatic lets the server detect the language from your audio — note that vocabulary boosting is disabled when Automatic is selected.For Parakeet V2, the language is fixed to English and the picker is disabled. For Parakeet V3 and both Nemotron variants, only the languages supported by that specific model are shown.
