4
Platforms covered
<1min
Setup time per device
$0
Cost — built into every OS
5-10%
Modern WER on clean speech
AD
Andrew Dyuzhov
CEO & Solo Founder, MetaWhisp · @hypersonq
To turn on voice to text, open your device settings, find Keyboard or Dictation, and toggle it on. On iPhone the path is Settings → General → Keyboard → Enable Dictation. On Mac it is System Settings → Keyboard → Dictation. On Windows 11 you press Windows + H in any text field. On Android you tap the microphone icon next to the space bar in Gboard. Every platform builds it in. None charges for it. The whole thing takes less than a minute per device. What the support pages do not tell you is where your audio actually goes after you press the mic. Some platforms run the speech model on your device. Some upload your voice to a cloud server, transcribe it remotely, and send the text back. The instructions below cover both — how to enable dictation on every modern platform, and how to verify which mode you are in before you dictate something you would not want a third party to read. I run MetaWhisp, a free on-device voice-to-text app for Mac. I have spent four years studying every dictation system the major platforms ship. The differences matter more than any of the platform vendors will admit, and the setup paths drift each release because Apple, Google, and Microsoft keep moving the toggles around. This guide is current as of May 2026 — iOS 19, macOS 16, Windows 11 23H2, and Android 15.
The 60-second answer:
  • iPhone & iPad: Settings → General → Keyboard → Enable Dictation. Tap the mic on the keyboard.
  • Mac: System Settings → Keyboard → Dictation. Default shortcut is press Control twice.
  • Windows 11: Press Windows + H in any text field. Click the mic icon. Microphone permission required first time.
  • Android: Open Gboard, tap the mic icon next to the space bar. Grant permission.
  • Privacy check: Toggle Airplane Mode and try again. If it still works, the model is on-device. If not, it is cloud.
---

The 30-second answer per platform

If you came here for one specific device, jump straight to that section. The full sequence is below in a single scannable block.
PlatformPath to enableHow to invokeOn-device?
iPhone / iPadSettings → General → Keyboard → Enable DictationTap the mic on keyboardYes (XS+)
MacSystem Settings → Keyboard → DictationPress Control twiceYes (Apple Silicon)
Windows 11Windows + H (in text field)Click mic in panelNo (cloud)
AndroidGboard mic iconTap mic on keyboardPixel + many recent
Microsoft Word (any OS)Home → DictateClick Dictate buttonNo (Azure)
Google Docs (browser)Tools → Voice typingClick mic iconNo (cloud)
The pattern across vendors: classic OS-level dictation is increasingly on-device. Anything that runs in a browser or a third-party app that you did not pre-install (Microsoft Word's Dictate, Google Docs Voice typing, Otter.ai, Krisp) is cloud-based. The cloud route gives slightly better punctuation and formatting on long-form audio. The on-device route never sends your audio off your device. ---

iPhone & iPad — exact setup steps

iPhone dictation has been on-device for English on iPhone XS and newer since iOS 16, and Apple keeps expanding language coverage every release. As of iOS 19, on-device dictation supports more than 30 languages.
1

Open Settings → General → Keyboard

From the Home Screen, tap Settings. Scroll to General. Inside General, scroll to Keyboard. This is where every keyboard, autocorrect, and voice setting lives.

2

Toggle on Enable Dictation

Scroll to the bottom of the Keyboard screen. Tap the Enable Dictation toggle. iOS will prompt you to confirm — tap Enable Dictation in the dialog. The first time you do this, iOS may download a small on-device language model in the background. This usually takes under 30 seconds on Wi-Fi.

3

Open any app with a text field

Open Notes, Messages, Mail, Safari, or any app where you type. Tap into a text field so the keyboard appears.

4

Tap the microphone icon

The mic icon appears at the bottom-right of the keyboard, next to the keyboard switcher. Tap it. The keyboard slides down and a small soundwave indicator appears. Speak normally. The text appears in real time as you talk.

5

Tap the mic again to stop

Tap the same mic icon to end dictation. The keyboard returns. iOS automatically capitalizes sentences and adds periods if you pause long enough. To force a period mid-sentence, say "period." For commas, "comma." For new line, "new line."

6

Verify it is running on-device (iPhone XS or newer)

Open Control Center and tap Airplane Mode. Now try dictation again. If it still works, your model is local. If you get a "Dictation Unavailable" error, your iPhone is using Apple's server-based dictation. This applies to older devices and to languages that have not yet shipped on-device.

Switching languages

iOS dictation auto-detects your keyboard language. If you have multiple keyboards installed (Settings → General → Keyboard → Keyboards → Add New Keyboard), tap and hold the globe icon while dictation is active to switch.

Using Siri to dictate

A second iOS path: hold the side button (or say "Hey Siri") and start with "Send a message to..." or "Make a note that says..." This routes through Siri's cloud and is a different system from keyboard dictation. The wording matters: keyboard dictation is what most people mean by "voice to text." ---

Mac — exact setup steps

Mac dictation went fully on-device in macOS Ventura (2022) for English and most major languages. Before that, the Mac had a similar feature called "Enhanced Dictation" with download-and-run language packs. The 2022 rebuild made all of that automatic — and removed the old 30-second limit on dictation sessions.
1

Open System Settings → Keyboard

Click the Apple menu in the top-left of your screen, then System Settings. In the sidebar, click Keyboard. (On macOS Monterey and earlier, the path is System Preferences → Keyboard.)

2

Scroll to Dictation

In the Keyboard pane, scroll to the Dictation section near the bottom. You will see a toggle, language picker, microphone source, and a shortcut field.

3

Toggle Dictation on

Click the toggle next to Dictation. macOS may prompt you to download a language model — about 200 to 300 MB depending on language. Click Enable in the dialog. The download is one-time.

4

Set the keyboard shortcut

The default shortcut is press the Control key twice. You can change it to Command twice, Option twice, or a custom combination via the dropdown. I prefer Right Option as a single-press shortcut, which requires a third-party tool — covered in the system-wide section later.

5

Test it in TextEdit

Open TextEdit (Cmd + Space, type "TextEdit"). Click in the document. Press Control twice. A small mic indicator appears at the cursor. Speak. The text appears live. Press Control twice again to stop, or just stop talking — Mac dictation auto-ends after a pause.

6

Verify on-device mode

Disconnect Wi-Fi (Control Center → Wi-Fi off) and try dictation again. If it still works, you are on the local model. If you get an error, your Mac is using Apple's cloud service. On Apple Silicon Macs from 2020 onward, English, German, French, Spanish, Italian, Japanese, Mandarin, and several other languages run locally.

Punctuation commands on Mac

The same commands as iOS work: "period," "comma," "question mark," "exclamation point," "new line," "new paragraph," "open quote," "close quote," "smiley face," "frowny face." On macOS Ventura and newer, dictation also adds basic punctuation automatically when it detects sentence boundaries — you can speak in long flowing sentences and the system inserts periods.

What Mac dictation cannot do

Built-in Mac dictation does not support custom vocabulary, multi-speaker handling, formatting commands ("make this a list"), or transcription of pre-recorded audio files. For those, you need a dedicated app. See the best voice-to-text apps for Mac roundup for the comparison. ---

Windows 11 — exact setup steps

Windows voice typing is the simplest of all four platforms to invoke — one keyboard shortcut works system-wide. The trade-off is privacy. As of Windows 11 23H2, Microsoft documents that voice typing routes through their cloud for transcription. The audio leaves your PC every time you press the shortcut.
1

Click in any text field

Open Notepad, Word, Outlook, or any app with a text field. Click the field so it has focus.

2

Press Windows + H

Hold down the Windows key and press H. A small voice typing panel appears at the top of your screen with a microphone icon.

3

Grant microphone permission

The first time you press Windows + H, Windows asks for microphone access. Click Allow. If you missed the prompt, go to Settings → Privacy & security → Microphone and toggle on Microphone access and Let apps access your microphone.

4

Click the microphone icon

In the voice typing panel, click the microphone. It turns red. Speak. Text appears in the focused field. The panel shows a real-time transcription preview.

5

Toggle auto-punctuation

Click the gear icon in the voice typing panel. Toggle Auto punctuation on. Microsoft's auto-punctuation is roughly comparable to Apple's in 2026 — both insert reasonable periods and commas when you speak naturally.

6

Stop dictation

Say "stop listening," click the mic icon again, or press Windows + H to close the panel. Closing the panel ends the session.

Voice access vs voice typing

Windows 11 also has Voice access, a separate accessibility feature for navigating the entire OS by voice — opening apps, clicking buttons, scrolling. That is not voice typing. Voice typing is just for inserting text into fields. Voice access lives at Settings → Accessibility → Speech → Voice access. If you only want to type by voice, ignore Voice access and stick with Windows + H.

Privacy in Windows voice typing

Microsoft's official documentation confirms that voice typing transcription happens in the cloud. The Windows Voice Typing FAQ states that audio is sent to Microsoft servers, processed, and the resulting text is returned to your machine. There is currently no on-device equivalent built into Windows 11 for general voice typing. ---

Android — exact setup steps

Android voice typing lives in Gboard, Google's keyboard app. Most non-Pixel Android phones come with the manufacturer's keyboard pre-installed (Samsung Keyboard, OneUI Keyboard, etc.). For consistent voice-typing behaviour, install Gboard from the Play Store and set it as default.
1

Install Gboard if you do not have it

Pixel phones come with Gboard. Samsung, OnePlus, Xiaomi, and other Android phones often ship with their own keyboards. To get Google's voice typing, install Gboard - the Google Keyboard from the Play Store. Then open Settings → System → Languages & input → On-screen keyboard → Manage keyboards and enable Gboard. Set it as default in the same screen.

2

Open any app with a text field

Tap a text field anywhere — Messages, Gmail, Notes, the Google search bar. Gboard opens.

3

Tap the microphone icon

The mic icon sits to the right of the space bar on Gboard. Tap and hold it briefly to start voice typing. The keyboard collapses and a "Speak now" indicator replaces it.

4

Grant microphone permission

If Gboard does not have microphone permission, Android will prompt you. Tap While using the app. If you missed the prompt, go to Settings → Apps → Gboard → Permissions → Microphone and set it to Allow.

5

Speak

Speak naturally. Gboard transcribes in real time. Pause for a couple of seconds and Gboard will stop listening — tap the mic again to resume. Or you can use voice commands: "send" to send a message in supported apps, "delete" to remove the last word, "comma," "period," "new line."

6

Enable offline voice typing (optional)

To run voice typing locally without a network round-trip, open Gboard Settings → Voice typingOffline speech recognition. Tap your language to download the on-device model — usually 80 to 200 MB. After download, voice typing in that language works offline. Google's Gboard Help documents which languages support offline mode; on Pixel devices, every language Gboard supports is offline-capable.

Samsung Keyboard alternative

If you stay on Samsung Keyboard, the equivalent path is: tap the mic icon in the toolbar above the keyboard. Samsung uses a different speech engine, configurable in Samsung Keyboard settings → Voice input → Default voice input service. You can switch between Samsung's own engine and Google's. Google's engine is more accurate in 2026; Samsung's engine offers better integration with Galaxy AI features for native callers. ---

Where your voice actually goes

This is the part the support pages skip. Built-in dictation looks identical to the user across platforms — you press a button, you talk, text appears. What happens between the microphone and the text differs dramatically, and the difference matters if you ever dictate sensitive content.

The four privacy modes

  1. On-device (local-only): Audio never leaves the machine. The model runs on your CPU, GPU, or neural processor. Examples: Mac dictation on Apple Silicon for supported languages, iPhone XS+ dictation, Pixel voice typing offline pack, on-device Whisper apps like MetaWhisp.
  2. Cloud-only: Audio uploads to a vendor server, transcription happens remotely, text returns. Examples: Windows 11 voice typing, Google Docs Voice Typing, Microsoft Word Dictate, Otter.ai.
  3. Hybrid (cloud-with-fallback): Cloud preferred, on-device fallback when offline. Examples: older iPhone dictation, some Android keyboards.
  4. On-device-with-improvement: Local transcription, but the vendor asks permission to send corrected samples back to improve future models. Examples: Apple's "Improve Dictation" opt-in toggle.

How to verify your mode

The Airplane Mode test is the only reliable check. Toggle Airplane Mode on, then try to dictate. If it still works, your model is local. If you get an error or silence, the platform was relying on a cloud connection. This test cuts through marketing language. A platform that markets "private, on-device voice processing" but fails the Airplane Mode test was lying with a comma — it has on-device features, but voice was not one of them.

What gets stored after the cloud trip

Each cloud provider has its own retention policy. Microsoft, Google, and Apple all publish privacy summaries. Microsoft's documentation states that voice typing audio is processed without permanent retention by default. Google's documentation states that Voice Typing audio is processed in real-time, but if you have Web & App Activity enabled in your Google account, transcripts may be saved. Apple's iOS dictation transcripts are not saved beyond the immediate session unless you opt into the "Improve Dictation" sample sharing. The honest summary: cloud dictation is fine for casual messages. It is not fine for medical notes, legal drafts, financial details, or any content covered by NDA, HIPAA, or attorney-client privilege. For those, you need on-device dictation. See the deep-dive on private voice-to-text for Mac for a detailed audit of what that takes. ---

Voice commands every platform supports

Once dictation is on, the next productivity gain is learning the punctuation and editing commands. They differ slightly between vendors but the core set is universal.
What you sayWhat you getiPhoneMacWindowsAndroid
"period".YesYesYesYes
"comma",YesYesYesYes
"question mark"?YesYesYesYes
"exclamation point" / "exclamation mark"!YesYesYesYes
"new line"line breakYesYesYesYes
"new paragraph"blank line + line breakYesYesYesSometimes
"open quote" / "close quote"" "YesYesYesSome
"smiley face":)YesYesNoNo
"all caps" + wordWORDYesYesCaps lockNo
"delete that" / "scratch that"removes last sentenceNoNoYesNo
"stop listening"ends dictationNoNoYesImplicit

The commands that matter

If you only learn three commands across platforms, learn these:
  1. "period" — ends a sentence cleanly, even when auto-punctuation misses it.
  2. "new line" — inserts a line break for lists, addresses, or bullets you build by voice.
  3. "open quote" / "close quote" — lets you quote someone without going back to fix straight quotes vs curly quotes.
The advanced commands ("delete that," "all caps," "select last word") are useful but inconsistent. Practice the three above, you will already be twice as fast as a typist on most short-form text. ---

Accuracy benchmarks across platforms

The standard metric for voice-to-text quality is Word Error Rate (WER), the percentage of words in the transcript that do not match the reference. Lower is better.
SystemClean speech WERNoisy / accented WEROn-device?
iPhone Dictation (iOS 19, on-device)~6–8%~12–18%Yes
Mac Dictation (macOS 16, on-device)~6–8%~12–18%Yes
Windows 11 Voice Typing~5–7%~10–15%No
Google Voice Typing (Pixel offline)~5–7%~10–14%Yes
Whisper large-v3-turbo (on-device)~4–6%~8–12%Yes
Microsoft Word Dictate (Azure)~5–7%~10–14%No
Otter.ai (cloud, multi-speaker)~4–6%~9–13%No
Dragon Professional (one-speaker)~2–4%~6–10%Yes
These ranges come from public benchmarks on standard test sets (LibriSpeech, Common Voice, CHiME) plus my own informal testing. Your actual numbers depend on your accent, your microphone, your background noise, and your vocabulary (technical jargon hits accuracy hard).

Accuracy gotchas

OpenAI's Whisper paper documents that the large-v3 model was trained on 680,000 hours of multilingual audio. That breadth is why open-source Whisper-based tools tend to outperform every built-in system on noisy or accented speech. The trade-off is the 1.5 GB model download and a slower inference time on older hardware. ---

Making dictation work offline

Offline dictation matters in three places: airplanes, secure facilities (hospitals, law offices, courthouses, government buildings), and rural areas with bad cell signal. None of the platform vendors makes the offline path obvious.

iPhone offline setup

If you have an iPhone XS or newer running iOS 16 or later, on-device dictation is automatic for English and several other languages. No setup needed. To check coverage for your language, go to Settings → General → Keyboard → Dictation → Language. Languages running on-device show a small icon next to them.

Mac offline setup

On Apple Silicon Macs running macOS Ventura or later, dictation runs on-device by default for major languages. To verify, System Settings → Keyboard → Dictation → Language. Languages with the on-device label work offline. Older Intel Macs require an internet connection for dictation.

Windows 11 offline status

There is no offline path for Windows 11 voice typing as of 23H2. If you need offline voice-to-text on Windows, your options are: install a Whisper-based desktop app (free, open-source), use Dragon Professional ($300+ one-time), or run the Web Speech API in a browser tab pointing to a local server.

Android offline setup

Open Gboard Settings → Voice typing → Offline speech recognition. Pick your language and download. The offline pack is 80 to 200 MB. After download, Gboard's mic icon will work offline. On Pixel devices the offline mode is the default — Pixel keeps voice processing local for privacy.

Verifying offline really works

Toggle Airplane Mode on. Open a text field. Try dictation. Speak a long sentence. If the text appears, you are running locally. If you see an error or no text, your platform was secretly using cloud and the offline pack was either not installed or not active. ---

The system-wide upgrade for Mac users

Built-in Mac dictation has a hard limit: you must press a shortcut every time, and the shortcut requires two presses of the same modifier (Control + Control by default). For occasional use this is fine. For daily use — replying to messages, drafting emails, taking notes — the friction adds up. I built MetaWhisp partly to fix this exact gap. It runs Whisper large-v3-turbo on the Apple Neural Engine and binds dictation to a single key press: hold Right Option, talk, release. Text appears wherever your cursor is. Word, Slack, your IDE, the URL bar, anywhere.

What it adds over built-in Mac dictation

Limitations

MetaWhisp is free to download. There is a paid upgrade for cloud sync of presets and a longer history, but the core dictation is free forever. ---

Troubleshooting when dictation breaks

Voice typing tends to break in predictable ways. Here is the quick triage list.

"The microphone icon is missing"

"It says 'dictation unavailable'"

"It misheard everything"

"It worked yesterday but stopped today"

"I want to disable it permanently"

---

Frequently asked questions

How do I turn on voice to text? On iPhone: Settings → General → Keyboard → Enable Dictation, then tap the mic on the keyboard. On Mac: System Settings → Keyboard → Dictation, then press Control twice. On Windows 11: press Windows + H in any text field. On Android: open Gboard, tap the mic icon next to the space bar. Each takes under a minute and is free.
Is voice to text free on every device? Yes. Apple includes dictation in iOS, iPadOS, and macOS at no charge. Microsoft includes voice typing in Windows 11 free. Google includes voice typing in Gboard free. There is no subscription required for the built-in features. Premium services like Microsoft Word's Dictate or Otter.ai add features like cloud transcription and multi-speaker labelling but are optional.
Does voice to text work offline? On iPhone XS or newer and Apple Silicon Macs, dictation runs on-device for English and several major languages — fully offline. On Pixel devices and Android phones with the offline pack downloaded, voice typing works offline. Windows 11 voice typing currently requires internet. To verify, toggle Airplane Mode and try dictation. If it still works, your model is local.
Why is voice to text not working on my phone? The most common causes: microphone permission was not granted, dictation was disabled in Settings, the offline language pack was not downloaded, the device is muted, or background noise is too high. On iPhone, restart after enabling dictation. On Android, clear Gboard cache. On Windows, check Privacy & Security → Microphone permissions.
Is voice to text accurate enough for real work? On clean speech in a quiet room, modern voice-to-text systems hit 5 to 10 percent word error rate. Apple, Microsoft, and Google built-in dictation are roughly comparable in 2026. For everyday email, notes, and messages, accuracy is high enough that fixing errors takes less time than typing the same text would have. For specialized vocabulary (medical, legal, code), dedicated tools like Dragon Professional or MetaWhisp outperform built-in dictation.
What is the difference between dictation and voice typing? Same feature, different vendor names. Apple calls it Dictation. Google calls it Voice Typing in Gboard. Microsoft calls it Voice Typing in Windows 11. All three convert speech into text in real time. The differences are in the underlying model and in privacy: Apple's runs on-device on modern hardware, Google's runs on-device on Pixel and recent phones with offline packs, Microsoft's runs through Azure cloud.
Does voice to text send my voice to the cloud? Sometimes. iPhone XS and newer plus Apple Silicon Macs run short-form dictation on-device for major languages. Windows 11 voice typing routes audio through Microsoft's cloud. Android Gboard runs on-device on Pixel devices and offline-pack-enabled devices, otherwise sends audio to Google. To verify privately, enable Airplane Mode and try dictation. Working dictation in Airplane Mode means local. Failing means cloud.
Can I turn on voice to text in any app? Yes, system-wide dictation works in any text field across the operating system. iOS dictation works wherever a keyboard appears. macOS dictation works in any text field. Windows 11 voice typing works in any focused text field. Android voice typing works through Gboard which appears in most text fields. Some banking and healthcare apps block third-party keyboards or dictation; those are intentional security restrictions.
How do I turn off voice to text? On iPhone: Settings → General → Keyboard → toggle off Enable Dictation. On Mac: System Settings → Keyboard → Dictation → toggle off. On Windows 11: voice typing closes when you close the panel; to disable globally, Settings → Privacy & security → Speech. On Android: Gboard Settings → Voice typing → toggle off. Disabling does not delete cached language packs unless you also clear them.
What is the best voice to text app overall? For Mac users who want system-wide on-device dictation, MetaWhisp running Whisper large-v3-turbo. For iPhone users, the built-in iOS dictation is excellent and free. For one-speaker professional dictation, Dragon Professional (Windows). For multi-speaker meeting transcription, Otter.ai. The "best" depends on whether privacy, accuracy, multi-speaker, or platform breadth matters most. See the best voice-to-text apps roundup for the side-by-side.
---

About the author

AD

Andrew Dyuzhov

CEO & Solo Founder, MetaWhisp

I run MetaWhisp, a free on-device voice-to-text app for Mac that runs Whisper large-v3-turbo on the Apple Neural Engine. I have spent four years studying every dictation system the major platforms ship — Apple, Microsoft, Google, Nuance, OpenAI. The setup paths in this guide are the ones I use myself when I move between an iPhone, a Mac, a Windows test machine, and an Android phone for testing.

If something here is wrong, the OS shipped a path change since publication, or you have a workflow that beats what I described, email me. I update this guide every quarter to track release changes.

What MetaWhisp adds for Mac users specifically:

  • Single-press push-to-talk hotkey (Right Option). No double-tap, no menu bar.
  • Whisper-grade accuracy (4-6% WER) vs the 6-8% built-in Mac dictation hits.
  • Works in any app, including ones where macOS dictation is sometimes disabled.
  • 30+ languages with auto-detect mid-sentence.
  • Verifiably on-device — zero network calls during transcription, auditable with Little Snitch.
  • Free forever for unlimited local use.

Follow the build journey on X (@hypersonq).

---

Mac users: get system-wide voice-to-text in 60 seconds

If the built-in Mac dictation is not enough, MetaWhisp gives you single-press push-to-talk in any app, runs entirely on-device, and is free forever for unlimited local use.

Download MetaWhisp
↓ Download this guide as PDF
Related reading: