Back to docs

Auto-Editing

What is auto-editing?

Auto-editing is an optional post-processing step that runs after Whisper transcribes your dictated audio. Before your text is inserted, GPT-4 cleans it up by:

  • Removing filler words — um, uh, like, you know, sort of, kind of, basically, right, so, well
  • Fixing grammar errors introduced by natural speech patterns
  • Adding punctuation — commas, periods, and capitalisation where appropriate
  • Smoothing out false starts and repeated words

The result is polished, readable text suitable for documents, emails, or messages — without you having to say "period" or "comma" while speaking.

Enabling or disabling auto-editing

Auto-editing is enabled by default. To toggle it:

  1. Open the Settings tab.
  2. Find the Auto-Edit toggle.
  3. Toggle it on or off.
  4. Click Save Settings.

Changes take effect for the next dictation session.

When to disable auto-editing

Auto-editing is designed for natural dictation. You may want to disable it if:

  • You need the raw transcript without any changes (e.g., for verbatim records or debugging).
  • You are dictating code or structured content where filler word removal could corrupt the output.
  • GPT-4 is making unwanted edits to technical terminology or proper nouns.
  • You want to minimise API calls and costs (auto-editing uses an additional GPT-4 API call).

How auto-editing works

  1. Whisper returns the raw transcript text.
  2. The text is sent to GPT-4 with a prompt instructing it to clean up the transcription.
  3. GPT-4 returns the edited text.
  4. The edited text is inserted at the cursor.

This process adds a short delay (typically 1–3 seconds) compared to inserting the raw Whisper output directly.

API usage

Auto-editing uses a separate GPT-4 API call for each dictation session. This contributes to your OpenAI API usage. Check your OpenAI account dashboard to monitor costs if you dictate frequently.