By Denys Medvediev

Guide

Dictation software for bloggers

Most blog drafts are slow because typing is slow. Dictation software fixes the draft, not the editor: press a hotkey, talk, and clean text lands at your cursor in WordPress, Ghost, Substack, Medium, or Notion. No plugin to install, and it runs offline.

Last updated: June 2026

A laptop and a cup of coffee on a wooden desk by a window, the everyday writing spot of a blogger

Dictation software for bloggers turns a spoken first draft into typed text inside any editor. A system-wide tool like Whisper works through a hotkey: speak, release, and the words paste at the cursor in WordPress, Ghost, Substack, Medium, or Notion. No CMS plugin is needed, it runs offline, and the local tier is free.

A blog post is two jobs wearing one deadline. There's the thinking — what you actually want to say — and there's the typing, which is the slow part where ideas go to wait in line. I write a fair amount on the side, and the gap between "I know what this paragraph says" and "the paragraph is on the screen" was always typing. So I started talking the first draft instead.

People search for "dictation software for bloggers" and land on lists of apps that all want to live inside one editor — a WordPress plugin here, a Chrome extension there. The honest answer is simpler and a little boring. You don't want dictation inside your CMS. You want it on top of your whole machine, so the same hotkey fills a Gutenberg block, a Substack post, and the reply to the reader who emailed you. One tool, every editor.

Here's the thing the listicles skip. Your editor — Gutenberg, the Ghost composer, the Substack box, Medium, a Notion page — is a text field. Dictation that pastes at your cursor doesn't care which one it is. The CMS is not the integration. The cursor is.

So the real question isn't "which blogging platform has the best voice typing." None of them have great built-in dictation, and you don't need them to. The question is "which dictation tool do I run on top of all of them," and the answer turns on whether you want it free, offline, and the same in every editor. I'll walk the why, set one up in two minutes, show the cleanup pass that turns spoken mush into a draft, and tell you when to skip dictation entirely and reach for a different tool.

Why bloggers reach for dictation

A person typing at a laptop with a notebook and coffee nearby, mid-draft on a blog post

The draft is the bottleneck, not the edit. Most bloggers I know can outline a post in two minutes and then spend an hour turning the outline into sentences. Talking is faster than typing — for me it's roughly three to four times faster, and the research backs the rough shape of that, even if your exact mileage depends on your hands and your topic. The point of dictation isn't to skip writing. It's to get the messy first version out of your head and onto the page before you lose the thread.

There's a second reason that matters more the longer you've been doing this: your hands. Bloggers type a lot. A weekly newsletter, a couple of posts, the comments, the emails — it adds up to a quiet, daily volume of keystrokes. Dictation lets you rest the hands and keep producing, which is a productivity choice, not a medical one. I'm not going to tell you it fixes anything; I'll tell you that talking a draft is one fewer hour of typing, and on a heavy writing week that's the difference between shipping and not.

And the ideas don't queue up politely. The good line for the intro arrives while you're making coffee, not while you're sitting at the keyboard. With a hotkey, the cost of capturing it drops to near zero — you talk for ten seconds and the line is in your draft. The honest framing here is that voice gives you the words fast. The shaping — the headers, the pull quotes, the order of paragraphs — is still your job. Dictation just removes the part where you were going to lose the sentence to a slow keyboard.

Press a hotkey, talk, the text lands in your editor

This is the whole mechanic, and it's boring in the best way. You press a hotkey, you speak, you release, and the transcript pastes at your cursor — in whatever text field has focus. Whisper holds a short tail after you let go of the key, so your last word doesn't get clipped. Because it pastes at the OS cursor, a Gutenberg block, a Ghost paragraph, the Substack composer, and a Medium story are all just "any text box." Same hotkey, same behaviour, whichever tab you're in.

That's the part the landing pages overcomplicate. There's no plugin to install into WordPress, no extension to bolt onto your browser, no API token to paste into Ghost. Your cursor is in the editor, you talk, the words appear in the editor. A small capsule shows up while you speak so you know it's listening:

Cancel
The recording overlay: a small capsule that appears while you speak, so you know Whisper is catching every word.

The hotkey is the one thing worth getting right up front. On Windows it's Ctrl+Space; on Mac it's Command+Option, a modifier-only push-to-talk you hold while speaking and release to stop. Both are changeable in Settings if they clash with a shortcut you already use. (My younger daughter once told me a hotkey "didn't work" in her drawing app. It was a conflict, not a bug, which is how I learned the average person has no idea what a hotkey conflict even is. So every hotkey is customisable now.) If you've already set up dictation on Windows or on Mac, this is the same muscle memory pointed at your CMS.

Set it up in two minutes (Windows or Mac)

You need a Mac on Apple Silicon or a Windows 10-or-newer PC, a working microphone, and your blog editor open in a browser tab or its desktop app. The whole local pipeline is free for any signed-in account, with no payment method asked for at sign-up. Here's the sequence.

Step 1 — Install Whisper and sign in.

Download from the download page, install, and create a free account. No card. The whole local transcription pipeline opens right away.

You'll know it worked when the app's tray icon appears and the setup wizard offers to pick a model.

Step 2 — Pick a transcription path.

The app doesn't choose for you. You get three: Cloud (OpenAI, bring your own key), Local Parakeet, or Local Whisper. For drafting posts on your own machine, start local — more on that two sections down.

You'll know it worked when a model finishes downloading and shows as ready.

Step 3 — Confirm your hotkey.

Windows defaults to Ctrl+Space, Mac to Command+Option held as push-to-talk. On Mac, grant the Accessibility permission when prompted; without it, the paste-at-cursor can't reach your browser or editor.

You'll know it worked when a test recording pastes into any text field.

Step 4 — Put your cursor in your editor and talk.

Open your post in WordPress, Ghost, Substack, Medium, or Notion, click into the body, hold the hotkey, say a sentence, release. The transcript appears where the cursor is.

You'll know it worked when your spoken sentence is sitting in the editor as text.

Whisper
The real Whisper desktop app on the settings screen, with the Transcription and AI panels open.

The slow part is the model download, not the setup. Everything else is the four steps above. Once it's running, writing the draft stops being a typing task and starts being a talking task, and the editor never knows the difference.

voice to text on Windows · on Mac

Dictate into WordPress, Ghost, Substack, Medium, or Notion

Here's how it plays out per editor, because bloggers ask. In WordPress Gutenberg, click into a paragraph block and dictate; the words fill the block, and you hit Enter for the next block the way you always do. The Ghost composer is the same — cursor in the body, talk, text lands. Substack's post editor and Medium's story editor are plain rich-text boxes, so dictation drops straight in. Notion treats each block as a text field, so a hotkey fills the block your cursor sits in. None of these needed a plugin, an extension, or a setting changed inside the CMS. You installed one tool and every editor inherited it.

The reason this works everywhere is the reason it sounds too simple: the tool pastes at the operating-system cursor, not into a specific app's API. So the same key that fills a Gutenberg block also fills your email reply to a reader, your Slack message to an editor, and the commit message on the repo behind your site. One tool, every text field, on both Windows and Mac. You don't relearn anything when you switch from drafting to answering comments.

What dictation won't do is the formatting, and I'd rather say that plainly than let you find out at the worst moment. It gives you the words. The H2s, the bold, the pull quote, the block order — that's still you, with the keys and buttons you already use. Anyone selling you "say heading two and watch it format" is selling a demo, not a Tuesday. Get the prose down by voice, then shape the post the way you'd shape any draft. If you live mostly in one editor, the platform-specific guides for the Ghost editor and the Substack editor go deeper on the per-app quirks.

Local or cloud: which mode for drafting posts

For blogging, try local mode first. Your drafts are your drafts — the unpublished post, the newsletter you're half-sure about, the spicy take you might cut before it goes live. It would be a strange habit to route every rough sentence through a cloud server just to get it into your editor when your laptop already has a microphone and a CPU. If your Mac is Apple Silicon or your PC is from the last few years, local handles everyday drafting without complaint, and cloud becomes the escape hatch rather than the default.

Here's how the three paths differ, because the app makes you pick and I'd rather you pick well:

  • Local ParakeetNVIDIA's TDT engine, around 600 MB, and the fastest local option — 5 to 10 times faster than Whisper on CPU. Covers English plus 24 other European languages, 25 in total. No translate-to-English. If you write your blog in English or another European language, this is the quick, fully offline pick.
  • Local Whisperslower than Parakeet on the same machine, but the multilingual builds cover 99 languages and can translate to English. The English-only builds are English-only, not 99. Pick this if you blog in Chinese, Japanese, Korean, or need translation, which Parakeet can't do. The default English model is around 480 MB.
  • Cloud (OpenAI, BYOK)best accuracy and web access, using your own OpenAI key billed straight by OpenAI. Transcription runs on gpt-4o-mini-transcribe by default. Needs internet, so it's the one path that leaves your machine. The Cloud surface is part of Whisper Pro.

The boring truth is that for the kind of prose most bloggers write, local is plenty. Both local engines run fully on your machine with nothing sent to a server, which is the right default for unpublished work. Cloud earns its place when you want top-tier accuracy on a tricky recording or you want the model to pull a fact off the web mid-sentence. For a weekly writing habit, start local and only reach for cloud when local leaves you wanting.

Turning a spoken draft into clean prose

Raw dictation comes out as a run-on. You say "okay so the intro should explain why most blog drafts are slow and then get into the hotkey thing," and that's the unpunctuated wall any speech engine hands you, filler words and all. Cleaning it up is where the paths diverge.

Windows Voice Typing adds punctuation as you speak, and macOS Dictation handles basic punctuation when you say "comma" or "period." For heavier cleanup — stripping the "ums," fixing the run-ons, turning a spoken paragraph into something you'd actually paste into a post — Whisper can run an AI pass. Say the activation phrase "Hey whisper" and the text gets enhanced before it lands. On a local model that runs through Ollama; in cloud mode it's gpt-5-mini by default.

Thinking...
Raw

okay so the intro should explain why most blog drafts are slow and then um get into the hotkey thing and how it works in any editor you know

Cleaned

The intro should explain why most blog drafts are slow, then get into the hotkey and how it works in any editor.

This is the part that makes dictation usable for actual publishing rather than just notes. A cleaned spoken paragraph isn't a finished post — it's a finished first draft, which is the part that used to take the longest. You still read it back, cut the line that wandered, tighten the verbs, and do the editing pass every post needs. Voice gets you a clean draft fast; the editorial judgment is yours, the same as it would be on anything you typed. The honest version of the pitch is that it shortens the slow half of the job, not that it writes the post for you.

That same speak-then-clean flow pays off the moment you stop drafting and start everything else around a blog — you can also type faster with your voice in your email, your social posts, and the captions, all with the one hotkey you already set up.

When to skip dictation and use another tool

Two arrows painted on a road pointing in different directions, illustrating a choice between tools

Sometimes dictation isn't the tool for the job, and pretending otherwise would cost you an afternoon. The clearest case: you're not drafting, you're transcribing. If you recorded a podcast episode, an interview, or a long voice memo and you want the text of that recording, that's a different job. Dictation types what you say live; it doesn't ingest an existing audio file. For that, reach for a transcription service built to take a file in and give a transcript out — that's the right shape of tool, and it isn't this one.

The other case is the quick capture away from your desk. If a line for tomorrow's post arrives while you're on the bus, your phone keyboard already has a microphone — tap it, talk, done. Whisper is a desktop tool for Windows and macOS, so on a phone the keyboard mic is the practical route. And for genuinely short bits at your desk, the built-ins are free: on Windows, Windows key + H opens Voice Typing wherever your cursor is, though it routes through Microsoft's servers and needs internet, so it isn't offline. On Mac, Dictation lets you speak into any text field from System Settings under Keyboard, and on Apple Silicon general text can be processed on-device.

Reach for a dedicated, system-wide tool when the built-ins start hurting: long drafts, multilingual posts, offline privacy on Windows, or wanting one hotkey that behaves the same in your CMS, your email, and your editor. Below that bar, use what's free or use the right-shaped tool. I'm not going to tell you to fire up a dictation app to capture one line on a bus, and I'm not going to tell you it transcribes your podcast — because it doesn't, and you'd be back here annoyed by Thursday.

If most of your drafting really happens in one editor, the platform-specific walkthrough for voice to text in the Substack editor covers the per-app details this overview skips.

No blogging platform shipped great dictation, and after writing this I'm fairly sure none of them needs to. The cursor is the integration. Talk into the editor, get a clean draft, shape it with the keys and buttons you already know. I dictated most of this guide into a text box that wasn't my CMS, with a tool that doesn't care which box it is, then pasted the lot into the post you're reading. The typing I'd have done instead is the hour I got back.

Draft your next post by talking

Hold the hotkey, talk, release. The clean draft lands in whatever editor your cursor is in — WordPress, Ghost, Substack, Medium, Notion, and every other app too.

Free local mode for any signed-in account. No card required to start.

Photo of Denys Medvediev

Denys Medvediev

I'm the one who reads our support email, most probably by dictating the replies.