What custom endpoints do
Custom endpoints are an alternative post-processing provider. When a Mode has AI cleanup enabled and its provider is set to a custom endpoint, HyperWhisper POSTs the raw transcript to your server using the standard OpenAI chat-completions format and uses the reply as the cleaned-up output. They are not used for transcription itself — only for the AI cleanup/formatting pass that runs after speech recognition.Adding an endpoint
- macOS
- Windows
Find the OpenAI-compatible endpoints card
Scroll to the OpenAI-compatible endpoints card near the bottom of the page. Click Add endpoint.
Choose a provider tab
The sheet has three tabs:
- LMStudio — pre-fills the base URL
http://localhost:1234/v1and name “LMStudio”; fetches your running models automatically. - Ollama — pre-fills
http://localhost:11434and name “Ollama”; fetches your running models automatically. - Custom — for OpenRouter, other hosted APIs, or any other OpenAI-compatible server; you enter the base URL and model name manually.
Fill in the fields
| Field | What to enter |
|---|---|
| Name | A label you recognize, e.g. “My Ollama Server” or “OpenRouter GPT-4o” |
| Base URL | The base URL of your server (see URL format) |
| Model | The model identifier your server expects, e.g. llama3.2 or mistral-7b-instruct |
| API Key | Optional. Leave blank for local servers that don’t require authentication. |
Test the connection (recommended)
Click Test connection before saving. HyperWhisper sends a small request to your server and shows the response. A green indicator means the endpoint is reachable and responding correctly.
URL format
Enter the base URL of your server. For Ollama and LMStudio tabs, HyperWhisper appends the correct path automatically. For the Custom tab, see the platform notes below:| Provider tab | Path appended | Example full URL stored |
|---|---|---|
| Ollama | /v1/chat/completions | http://localhost:11434/v1/chat/completions |
| LMStudio | /chat/completions | http://localhost:1234/v1/chat/completions |
| Custom (Windows) | /chat/completions (if not already present) | https://openrouter.ai/api/v1/chat/completions |
| Custom (macOS) | (none — URL used verbatim) | https://openrouter.ai/api/v1/chat/completions |
/chat/completions if your URL does not already end with it. On macOS, the URL you enter is stored exactly as-is, so include the full path if your server requires it.
API key storage
API keys are stored securely and never in plain text:- macOS — keys are stored in the system Keychain.
- Windows — keys are stored in Windows Credential Manager.
Testing an endpoint
The test button is available both when adding a new endpoint and when editing an existing one. It sends a minimal request to your server:choices[0].message.content field. The last test result (pass or fail) is saved and shown next to the endpoint name so you can see at a glance whether it was working the last time you checked.
The test result is cleared automatically if you change the endpoint URL, since the prior result no longer applies to the new address.
Using a custom endpoint in a Mode
Once an endpoint is saved, it appears as a provider option when you edit a Mode’s post-processing settings. Select it the same way you would select any built-in AI provider. See Transcription Modes for how to configure post-processing on a Mode.Managing endpoints
Each endpoint in the list has three actions:| Action | What it does |
|---|---|
| Edit (pencil) | Change the name, URL, model, or API key |
| Duplicate | Creates a copy with a new name — useful for trying different models on the same server without re-entering the URL |
| Delete | Permanently removes the endpoint and its stored API key |
Deleting an endpoint removes it from the list and erases the stored API key. Any Modes that were using that endpoint will need to be updated to a different post-processing provider.
Common providers
Here are base URLs for popular OpenAI-compatible services. Use the Custom tab for all of these:| Provider | Base URL |
|---|---|
| OpenRouter | https://openrouter.ai/api/v1 |
| Ollama (local) | http://localhost:11434 |
| LM Studio (local) | http://localhost:1234/v1 |
| Together AI | https://api.together.xyz/v1 |
| Groq (OpenAI-compatible endpoint) | https://api.groq.com/openai/v1 |
