- iPhone & iPad: Settings → General → Keyboard → Enable Dictation. Tap the mic on the keyboard.
- Mac: System Settings → Keyboard → Dictation. Default shortcut is press Control twice.
- Windows 11: Press Windows + H in any text field. Click the mic icon. Microphone permission required first time.
- Android: Open Gboard, tap the mic icon next to the space bar. Grant permission.
- Privacy check: Toggle Airplane Mode and try again. If it still works, the model is on-device. If not, it is cloud.
- The 30-second answer per platform
- iPhone & iPad — exact setup steps
- Mac — exact setup steps
- Windows 11 — exact setup steps
- Android — exact setup steps
- Where your voice actually goes
- Voice commands every platform supports
- Accuracy benchmarks across platforms
- Making dictation work offline
- The system-wide upgrade for Mac users
- Troubleshooting when dictation breaks
- FAQ
- About the author
The 30-second answer per platform
If you came here for one specific device, jump straight to that section. The full sequence is below in a single scannable block.| Platform | Path to enable | How to invoke | On-device? |
|---|---|---|---|
| iPhone / iPad | Settings → General → Keyboard → Enable Dictation | Tap the mic on keyboard | Yes (XS+) |
| Mac | System Settings → Keyboard → Dictation | Press Control twice | Yes (Apple Silicon) |
| Windows 11 | Windows + H (in text field) | Click mic in panel | No (cloud) |
| Android | Gboard mic icon | Tap mic on keyboard | Pixel + many recent |
| Microsoft Word (any OS) | Home → Dictate | Click Dictate button | No (Azure) |
| Google Docs (browser) | Tools → Voice typing | Click mic icon | No (cloud) |
iPhone & iPad — exact setup steps
iPhone dictation has been on-device for English on iPhone XS and newer since iOS 16, and Apple keeps expanding language coverage every release. As of iOS 19, on-device dictation supports more than 30 languages.Open Settings → General → Keyboard
From the Home Screen, tap Settings. Scroll to General. Inside General, scroll to Keyboard. This is where every keyboard, autocorrect, and voice setting lives.
Toggle on Enable Dictation
Scroll to the bottom of the Keyboard screen. Tap the Enable Dictation toggle. iOS will prompt you to confirm — tap Enable Dictation in the dialog. The first time you do this, iOS may download a small on-device language model in the background. This usually takes under 30 seconds on Wi-Fi.
Open any app with a text field
Open Notes, Messages, Mail, Safari, or any app where you type. Tap into a text field so the keyboard appears.
Tap the microphone icon
The mic icon appears at the bottom-right of the keyboard, next to the keyboard switcher. Tap it. The keyboard slides down and a small soundwave indicator appears. Speak normally. The text appears in real time as you talk.
Tap the mic again to stop
Tap the same mic icon to end dictation. The keyboard returns. iOS automatically capitalizes sentences and adds periods if you pause long enough. To force a period mid-sentence, say "period." For commas, "comma." For new line, "new line."
Verify it is running on-device (iPhone XS or newer)
Open Control Center and tap Airplane Mode. Now try dictation again. If it still works, your model is local. If you get a "Dictation Unavailable" error, your iPhone is using Apple's server-based dictation. This applies to older devices and to languages that have not yet shipped on-device.
Switching languages
iOS dictation auto-detects your keyboard language. If you have multiple keyboards installed (Settings → General → Keyboard → Keyboards → Add New Keyboard), tap and hold the globe icon while dictation is active to switch.Using Siri to dictate
A second iOS path: hold the side button (or say "Hey Siri") and start with "Send a message to..." or "Make a note that says..." This routes through Siri's cloud and is a different system from keyboard dictation. The wording matters: keyboard dictation is what most people mean by "voice to text." ---Mac — exact setup steps
Mac dictation went fully on-device in macOS Ventura (2022) for English and most major languages. Before that, the Mac had a similar feature called "Enhanced Dictation" with download-and-run language packs. The 2022 rebuild made all of that automatic — and removed the old 30-second limit on dictation sessions.Open System Settings → Keyboard
Click the Apple menu in the top-left of your screen, then System Settings. In the sidebar, click Keyboard. (On macOS Monterey and earlier, the path is System Preferences → Keyboard.)
Scroll to Dictation
In the Keyboard pane, scroll to the Dictation section near the bottom. You will see a toggle, language picker, microphone source, and a shortcut field.
Toggle Dictation on
Click the toggle next to Dictation. macOS may prompt you to download a language model — about 200 to 300 MB depending on language. Click Enable in the dialog. The download is one-time.
Set the keyboard shortcut
The default shortcut is press the Control key twice. You can change it to Command twice, Option twice, or a custom combination via the dropdown. I prefer Right Option as a single-press shortcut, which requires a third-party tool — covered in the system-wide section later.
Test it in TextEdit
Open TextEdit (Cmd + Space, type "TextEdit"). Click in the document. Press Control twice. A small mic indicator appears at the cursor. Speak. The text appears live. Press Control twice again to stop, or just stop talking — Mac dictation auto-ends after a pause.
Verify on-device mode
Disconnect Wi-Fi (Control Center → Wi-Fi off) and try dictation again. If it still works, you are on the local model. If you get an error, your Mac is using Apple's cloud service. On Apple Silicon Macs from 2020 onward, English, German, French, Spanish, Italian, Japanese, Mandarin, and several other languages run locally.
Punctuation commands on Mac
The same commands as iOS work: "period," "comma," "question mark," "exclamation point," "new line," "new paragraph," "open quote," "close quote," "smiley face," "frowny face." On macOS Ventura and newer, dictation also adds basic punctuation automatically when it detects sentence boundaries — you can speak in long flowing sentences and the system inserts periods.What Mac dictation cannot do
Built-in Mac dictation does not support custom vocabulary, multi-speaker handling, formatting commands ("make this a list"), or transcription of pre-recorded audio files. For those, you need a dedicated app. See the best voice-to-text apps for Mac roundup for the comparison. ---Windows 11 — exact setup steps
Windows voice typing is the simplest of all four platforms to invoke — one keyboard shortcut works system-wide. The trade-off is privacy. As of Windows 11 23H2, Microsoft documents that voice typing routes through their cloud for transcription. The audio leaves your PC every time you press the shortcut.Click in any text field
Open Notepad, Word, Outlook, or any app with a text field. Click the field so it has focus.
Press Windows + H
Hold down the Windows key and press H. A small voice typing panel appears at the top of your screen with a microphone icon.
Grant microphone permission
The first time you press Windows + H, Windows asks for microphone access. Click Allow. If you missed the prompt, go to Settings → Privacy & security → Microphone and toggle on Microphone access and Let apps access your microphone.
Click the microphone icon
In the voice typing panel, click the microphone. It turns red. Speak. Text appears in the focused field. The panel shows a real-time transcription preview.
Toggle auto-punctuation
Click the gear icon in the voice typing panel. Toggle Auto punctuation on. Microsoft's auto-punctuation is roughly comparable to Apple's in 2026 — both insert reasonable periods and commas when you speak naturally.
Stop dictation
Say "stop listening," click the mic icon again, or press Windows + H to close the panel. Closing the panel ends the session.
Voice access vs voice typing
Windows 11 also has Voice access, a separate accessibility feature for navigating the entire OS by voice — opening apps, clicking buttons, scrolling. That is not voice typing. Voice typing is just for inserting text into fields. Voice access lives at Settings → Accessibility → Speech → Voice access. If you only want to type by voice, ignore Voice access and stick with Windows + H.Privacy in Windows voice typing
Microsoft's official documentation confirms that voice typing transcription happens in the cloud. The Windows Voice Typing FAQ states that audio is sent to Microsoft servers, processed, and the resulting text is returned to your machine. There is currently no on-device equivalent built into Windows 11 for general voice typing. ---Android — exact setup steps
Android voice typing lives in Gboard, Google's keyboard app. Most non-Pixel Android phones come with the manufacturer's keyboard pre-installed (Samsung Keyboard, OneUI Keyboard, etc.). For consistent voice-typing behaviour, install Gboard from the Play Store and set it as default.Install Gboard if you do not have it
Pixel phones come with Gboard. Samsung, OnePlus, Xiaomi, and other Android phones often ship with their own keyboards. To get Google's voice typing, install Gboard - the Google Keyboard from the Play Store. Then open Settings → System → Languages & input → On-screen keyboard → Manage keyboards and enable Gboard. Set it as default in the same screen.
Open any app with a text field
Tap a text field anywhere — Messages, Gmail, Notes, the Google search bar. Gboard opens.
Tap the microphone icon
The mic icon sits to the right of the space bar on Gboard. Tap and hold it briefly to start voice typing. The keyboard collapses and a "Speak now" indicator replaces it.
Grant microphone permission
If Gboard does not have microphone permission, Android will prompt you. Tap While using the app. If you missed the prompt, go to Settings → Apps → Gboard → Permissions → Microphone and set it to Allow.
Speak
Speak naturally. Gboard transcribes in real time. Pause for a couple of seconds and Gboard will stop listening — tap the mic again to resume. Or you can use voice commands: "send" to send a message in supported apps, "delete" to remove the last word, "comma," "period," "new line."
Enable offline voice typing (optional)
To run voice typing locally without a network round-trip, open Gboard Settings → Voice typing → Offline speech recognition. Tap your language to download the on-device model — usually 80 to 200 MB. After download, voice typing in that language works offline. Google's Gboard Help documents which languages support offline mode; on Pixel devices, every language Gboard supports is offline-capable.
Samsung Keyboard alternative
If you stay on Samsung Keyboard, the equivalent path is: tap the mic icon in the toolbar above the keyboard. Samsung uses a different speech engine, configurable in Samsung Keyboard settings → Voice input → Default voice input service. You can switch between Samsung's own engine and Google's. Google's engine is more accurate in 2026; Samsung's engine offers better integration with Galaxy AI features for native callers. ---Where your voice actually goes
This is the part the support pages skip. Built-in dictation looks identical to the user across platforms — you press a button, you talk, text appears. What happens between the microphone and the text differs dramatically, and the difference matters if you ever dictate sensitive content.The four privacy modes
- On-device (local-only): Audio never leaves the machine. The model runs on your CPU, GPU, or neural processor. Examples: Mac dictation on Apple Silicon for supported languages, iPhone XS+ dictation, Pixel voice typing offline pack, on-device Whisper apps like MetaWhisp.
- Cloud-only: Audio uploads to a vendor server, transcription happens remotely, text returns. Examples: Windows 11 voice typing, Google Docs Voice Typing, Microsoft Word Dictate, Otter.ai.
- Hybrid (cloud-with-fallback): Cloud preferred, on-device fallback when offline. Examples: older iPhone dictation, some Android keyboards.
- On-device-with-improvement: Local transcription, but the vendor asks permission to send corrected samples back to improve future models. Examples: Apple's "Improve Dictation" opt-in toggle.
How to verify your mode
The Airplane Mode test is the only reliable check. Toggle Airplane Mode on, then try to dictate. If it still works, your model is local. If you get an error or silence, the platform was relying on a cloud connection. This test cuts through marketing language. A platform that markets "private, on-device voice processing" but fails the Airplane Mode test was lying with a comma — it has on-device features, but voice was not one of them.What gets stored after the cloud trip
Each cloud provider has its own retention policy. Microsoft, Google, and Apple all publish privacy summaries. Microsoft's documentation states that voice typing audio is processed without permanent retention by default. Google's documentation states that Voice Typing audio is processed in real-time, but if you have Web & App Activity enabled in your Google account, transcripts may be saved. Apple's iOS dictation transcripts are not saved beyond the immediate session unless you opt into the "Improve Dictation" sample sharing. The honest summary: cloud dictation is fine for casual messages. It is not fine for medical notes, legal drafts, financial details, or any content covered by NDA, HIPAA, or attorney-client privilege. For those, you need on-device dictation. See the deep-dive on private voice-to-text for Mac for a detailed audit of what that takes. ---Voice commands every platform supports
Once dictation is on, the next productivity gain is learning the punctuation and editing commands. They differ slightly between vendors but the core set is universal.| What you say | What you get | iPhone | Mac | Windows | Android |
|---|---|---|---|---|---|
| "period" | . | Yes | Yes | Yes | Yes |
| "comma" | , | Yes | Yes | Yes | Yes |
| "question mark" | ? | Yes | Yes | Yes | Yes |
| "exclamation point" / "exclamation mark" | ! | Yes | Yes | Yes | Yes |
| "new line" | line break | Yes | Yes | Yes | Yes |
| "new paragraph" | blank line + line break | Yes | Yes | Yes | Sometimes |
| "open quote" / "close quote" | " " | Yes | Yes | Yes | Some |
| "smiley face" | :) | Yes | Yes | No | No |
| "all caps" + word | WORD | Yes | Yes | Caps lock | No |
| "delete that" / "scratch that" | removes last sentence | No | No | Yes | No |
| "stop listening" | ends dictation | No | No | Yes | Implicit |
The commands that matter
If you only learn three commands across platforms, learn these:- "period" — ends a sentence cleanly, even when auto-punctuation misses it.
- "new line" — inserts a line break for lists, addresses, or bullets you build by voice.
- "open quote" / "close quote" — lets you quote someone without going back to fix straight quotes vs curly quotes.
Accuracy benchmarks across platforms
The standard metric for voice-to-text quality is Word Error Rate (WER), the percentage of words in the transcript that do not match the reference. Lower is better.| System | Clean speech WER | Noisy / accented WER | On-device? |
|---|---|---|---|
| iPhone Dictation (iOS 19, on-device) | ~6–8% | ~12–18% | Yes |
| Mac Dictation (macOS 16, on-device) | ~6–8% | ~12–18% | Yes |
| Windows 11 Voice Typing | ~5–7% | ~10–15% | No |
| Google Voice Typing (Pixel offline) | ~5–7% | ~10–14% | Yes |
| Whisper large-v3-turbo (on-device) | ~4–6% | ~8–12% | Yes |
| Microsoft Word Dictate (Azure) | ~5–7% | ~10–14% | No |
| Otter.ai (cloud, multi-speaker) | ~4–6% | ~9–13% | No |
| Dragon Professional (one-speaker) | ~2–4% | ~6–10% | Yes |
Accuracy gotchas
- Names and proper nouns are the worst case. Built-in dictation has no awareness of your contacts, projects, or industry. "Alex" might come out as "Alec" or "Alex's" depending on context.
- Technical terms are mid-tier. "useState" becomes "use state." "TypeScript" becomes "type script." "Postgres" sometimes becomes "post grass."
- Code dictation is universally bad on built-in systems. None of them handle "open paren, var x equals one" gracefully. For coding by voice, the GitHub Copilot Voice and Whisper-based wrappers are dramatically better.
- Background noise hurts cloud systems less than on-device because cloud models are larger and trained on broader data. The trade-off is privacy.
Making dictation work offline
Offline dictation matters in three places: airplanes, secure facilities (hospitals, law offices, courthouses, government buildings), and rural areas with bad cell signal. None of the platform vendors makes the offline path obvious.iPhone offline setup
If you have an iPhone XS or newer running iOS 16 or later, on-device dictation is automatic for English and several other languages. No setup needed. To check coverage for your language, go to Settings → General → Keyboard → Dictation → Language. Languages running on-device show a small icon next to them.Mac offline setup
On Apple Silicon Macs running macOS Ventura or later, dictation runs on-device by default for major languages. To verify, System Settings → Keyboard → Dictation → Language. Languages with the on-device label work offline. Older Intel Macs require an internet connection for dictation.Windows 11 offline status
There is no offline path for Windows 11 voice typing as of 23H2. If you need offline voice-to-text on Windows, your options are: install a Whisper-based desktop app (free, open-source), use Dragon Professional ($300+ one-time), or run the Web Speech API in a browser tab pointing to a local server.Android offline setup
Open Gboard Settings → Voice typing → Offline speech recognition. Pick your language and download. The offline pack is 80 to 200 MB. After download, Gboard's mic icon will work offline. On Pixel devices the offline mode is the default — Pixel keeps voice processing local for privacy.Verifying offline really works
Toggle Airplane Mode on. Open a text field. Try dictation. Speak a long sentence. If the text appears, you are running locally. If you see an error or no text, your platform was secretly using cloud and the offline pack was either not installed or not active. ---The system-wide upgrade for Mac users
Built-in Mac dictation has a hard limit: you must press a shortcut every time, and the shortcut requires two presses of the same modifier (Control + Control by default). For occasional use this is fine. For daily use — replying to messages, drafting emails, taking notes — the friction adds up. I built MetaWhisp partly to fix this exact gap. It runs Whisper large-v3-turbo on the Apple Neural Engine and binds dictation to a single key press: hold Right Option, talk, release. Text appears wherever your cursor is. Word, Slack, your IDE, the URL bar, anywhere.What it adds over built-in Mac dictation
- Single-press hotkey: push-to-talk on Right Option. No double-press, no menu bar click. See the global hotkey feature for details.
- Whisper-grade accuracy: 4-6% WER on clean speech vs 6-8% for built-in dictation.
- System-wide: works in any text field, including ones where macOS dictation is sometimes disabled (banking apps, secure notes).
- 30+ languages with auto-detect: no need to manually switch keyboards mid-session.
- Punctuation and formatting: AI-driven smart formatting beyond just "comma" and "period."
- 100% on-device: verifiable with Little Snitch or Lulu — zero network traffic during transcription. See the on-device transcription page.
Limitations
- Mac-only. iPhone, iPad, Windows, and Android use whatever the OS provides.
- Apple Silicon required for the fast model — older Intel Macs run a smaller variant.
- 1.5 GB initial model download.
Troubleshooting when dictation breaks
Voice typing tends to break in predictable ways. Here is the quick triage list."The microphone icon is missing"
- iPhone: Verify Settings → General → Keyboard → Enable Dictation is on. Restart the iPhone if it is on but the icon is missing.
- Mac: System Settings → Keyboard → Dictation must be on. The mic only appears mid-dictation, not on the keyboard itself.
- Windows: The voice typing panel only opens with Windows + H, not from a keyboard icon.
- Android: Make sure Gboard is the active keyboard. Some apps disable voice input in privacy-sensitive fields.
"It says 'dictation unavailable'"
- Check internet connection if your platform is cloud-based.
- Check that the language pack downloaded successfully — sometimes the download silently fails on Wi-Fi changes.
- Restart the device. The dictation service occasionally hangs.
- On iPhone and Mac, sign out and back into iCloud. Dictation is tied to your Apple ID for some features.
"It misheard everything"
- Check microphone source. On Mac, System Settings → Keyboard → Dictation → Microphone Source. On Windows, Settings → System → Sound → Input. Make sure the right mic is selected.
- Reduce background noise. Built-in dictation degrades sharply above ~50 dB ambient noise.
- Speak closer to the mic. 6–12 inches is the sweet spot for most laptop microphones.
- Check for accent coverage. Some platforms support fewer accents than the language overall.
"It worked yesterday but stopped today"
- Operating system updates frequently reset privacy permissions. Re-grant microphone access.
- Check if a third-party app (Krisp, Loopback, BlackHole) is occupying the microphone exclusively.
- Restart the dictation service. On Mac:
killall com.apple.SpeechRecognitionCore.speechrecognitiondin Terminal. On Windows: restart the Speech service in services.msc.
"I want to disable it permanently"
- iPhone: Settings → General → Keyboard → toggle off Enable Dictation.
- Mac: System Settings → Keyboard → Dictation → toggle off. Click Delete to also remove the language model.
- Windows: Settings → Privacy & security → Speech → toggle off Online speech recognition.
- Android: Gboard Settings → Voice typing → toggle off "Use voice typing."
Frequently asked questions
How do I turn on voice to text?
On iPhone: Settings → General → Keyboard → Enable Dictation, then tap the mic on the keyboard. On Mac: System Settings → Keyboard → Dictation, then press Control twice. On Windows 11: press Windows + H in any text field. On Android: open Gboard, tap the mic icon next to the space bar. Each takes under a minute and is free.Is voice to text free on every device?
Yes. Apple includes dictation in iOS, iPadOS, and macOS at no charge. Microsoft includes voice typing in Windows 11 free. Google includes voice typing in Gboard free. There is no subscription required for the built-in features. Premium services like Microsoft Word's Dictate or Otter.ai add features like cloud transcription and multi-speaker labelling but are optional.Does voice to text work offline?
On iPhone XS or newer and Apple Silicon Macs, dictation runs on-device for English and several major languages — fully offline. On Pixel devices and Android phones with the offline pack downloaded, voice typing works offline. Windows 11 voice typing currently requires internet. To verify, toggle Airplane Mode and try dictation. If it still works, your model is local.Why is voice to text not working on my phone?
The most common causes: microphone permission was not granted, dictation was disabled in Settings, the offline language pack was not downloaded, the device is muted, or background noise is too high. On iPhone, restart after enabling dictation. On Android, clear Gboard cache. On Windows, check Privacy & Security → Microphone permissions.Is voice to text accurate enough for real work?
On clean speech in a quiet room, modern voice-to-text systems hit 5 to 10 percent word error rate. Apple, Microsoft, and Google built-in dictation are roughly comparable in 2026. For everyday email, notes, and messages, accuracy is high enough that fixing errors takes less time than typing the same text would have. For specialized vocabulary (medical, legal, code), dedicated tools like Dragon Professional or MetaWhisp outperform built-in dictation.What is the difference between dictation and voice typing?
Same feature, different vendor names. Apple calls it Dictation. Google calls it Voice Typing in Gboard. Microsoft calls it Voice Typing in Windows 11. All three convert speech into text in real time. The differences are in the underlying model and in privacy: Apple's runs on-device on modern hardware, Google's runs on-device on Pixel and recent phones with offline packs, Microsoft's runs through Azure cloud.Does voice to text send my voice to the cloud?
Sometimes. iPhone XS and newer plus Apple Silicon Macs run short-form dictation on-device for major languages. Windows 11 voice typing routes audio through Microsoft's cloud. Android Gboard runs on-device on Pixel devices and offline-pack-enabled devices, otherwise sends audio to Google. To verify privately, enable Airplane Mode and try dictation. Working dictation in Airplane Mode means local. Failing means cloud.Can I turn on voice to text in any app?
Yes, system-wide dictation works in any text field across the operating system. iOS dictation works wherever a keyboard appears. macOS dictation works in any text field. Windows 11 voice typing works in any focused text field. Android voice typing works through Gboard which appears in most text fields. Some banking and healthcare apps block third-party keyboards or dictation; those are intentional security restrictions.How do I turn off voice to text?
On iPhone: Settings → General → Keyboard → toggle off Enable Dictation. On Mac: System Settings → Keyboard → Dictation → toggle off. On Windows 11: voice typing closes when you close the panel; to disable globally, Settings → Privacy & security → Speech. On Android: Gboard Settings → Voice typing → toggle off. Disabling does not delete cached language packs unless you also clear them.What is the best voice to text app overall?
For Mac users who want system-wide on-device dictation, MetaWhisp running Whisper large-v3-turbo. For iPhone users, the built-in iOS dictation is excellent and free. For one-speaker professional dictation, Dragon Professional (Windows). For multi-speaker meeting transcription, Otter.ai. The "best" depends on whether privacy, accuracy, multi-speaker, or platform breadth matters most. See the best voice-to-text apps roundup for the side-by-side.About the author
Andrew Dyuzhov
CEO & Solo Founder, MetaWhisp
I run MetaWhisp, a free on-device voice-to-text app for Mac that runs Whisper large-v3-turbo on the Apple Neural Engine. I have spent four years studying every dictation system the major platforms ship — Apple, Microsoft, Google, Nuance, OpenAI. The setup paths in this guide are the ones I use myself when I move between an iPhone, a Mac, a Windows test machine, and an Android phone for testing.
If something here is wrong, the OS shipped a path change since publication, or you have a workflow that beats what I described, email me. I update this guide every quarter to track release changes.
What MetaWhisp adds for Mac users specifically:
- Single-press push-to-talk hotkey (Right Option). No double-tap, no menu bar.
- Whisper-grade accuracy (4-6% WER) vs the 6-8% built-in Mac dictation hits.
- Works in any app, including ones where macOS dictation is sometimes disabled.
- 30+ languages with auto-detect mid-sentence.
- Verifiably on-device — zero network calls during transcription, auditable with Little Snitch.
- Free forever for unlimited local use.
Follow the build journey on X (@hypersonq).
Mac users: get system-wide voice-to-text in 60 seconds
If the built-in Mac dictation is not enough, MetaWhisp gives you single-press push-to-talk in any app, runs entirely on-device, and is free forever for unlimited local use.
Download MetaWhisp