- On-device voice-to-text is HIPAA-compliant by default. If audio never leaves your Mac, there is no PHI transmission, so no BAA with any vendor is required.
- Dragon Medical One costs $99–$199/month. The underlying compute costs a fraction of that. Most of the price is legacy vertical pricing power, not value.
- Whisper large-v3-turbo (on-device) achieves 5–7% word error rate on general medical dictation. Dragon Medical is 3–5% on specialized vocabulary. The gap closes fast with custom terms.
- No EHR integration required. Voice-to-text with auto-paste works in Epic, Cerner, Athenahealth, or any text field in any app.
- Ambient AI scribes (Freed, Heidi) are different. They automate conversation-to-SOAP but require cloud + BAA. Pure dictation gives you control; ambient gives you automation. Pick based on your priorities.
- The real state of medical dictation in 2026
- HIPAA for voice tools, demystified
- The BAA math: why on-device simplifies everything
- On-device vs. cloud for medical dictation
- Dragon Medical, Freed AI, Heidi, MetaWhisp — honest comparison
- SOAP notes with voice: complete workflow
- EHR integration: Epic, Cerner, Athenahealth
- The real cost of Dragon Medical (and why it's high)
- How to set up HIPAA voice dictation in 15 minutes
- Real-world physician workflows
- FAQ
- About the author
The real state of medical dictation in 2026
Three things have shifted dramatically since 2023, and most compliance policies haven't caught up yet.Shift 1: On-device AI models became viable for medical work
Before 2023, accurate speech recognition required cloud infrastructure. The models were too large to run locally. That's why Dragon Medical, M*Modal, and Nuance built their entire businesses around cloud-processed audio with BAAs, server-side PHI handling, and enterprise IT workflows. Then OpenAI released Whisper (September 2022), then Whisper large-v3 (November 2023), then the distilled Whisper large-v3-turbo (October 2024). Apple released M-series chips with dedicated Neural Engines powerful enough to run these models in real-time. The result: a laptop built in 2023 or later can run production-grade medical speech recognition locally, with no cloud dependency. The technical argument for cloud-based medical dictation evaporated in a single hardware generation.Shift 2: HIPAA enforcement tightened around cloud vendors
The Office for Civil Rights (OCR) has been quietly auditing Business Associate Agreements more aggressively since 2024. Cloud vendors that process PHI are now under sharper scrutiny. Small practices in particular struggle because:- Most vendors require enterprise-level BAAs that small practices can't easily negotiate
- BAAs transfer liability but don't eliminate it — a breach at your vendor is still your problem
- Audit logs from cloud vendors are often not granular enough for OCR investigations
Shift 3: Ambient AI scribes exploded, but at a privacy cost
Freed AI, Heidi Health, Nabla, Suki, DeepScribe, Abridge — all launched or scaled dramatically in 2023–2025. They solve a real problem (automatic conversation-to-note generation during visits). But they all require:- Continuous microphone access during patient visits
- Audio + transcript upload to cloud servers
- Active BAA with the vendor
- Subscription fees typically $99–$299/month
HIPAA for voice tools, demystified
HIPAA breaks into three rules that matter for voice dictation:The Privacy Rule (who can see PHI)
Limits how covered entities (physicians, hospitals, health plans) and their Business Associates (vendors who process PHI) can use and disclose Protected Health Information. If your voice tool never sees PHI, this rule doesn't apply to your vendor.
The Security Rule (technical safeguards)
Requires "administrative, physical, and technical safeguards" for electronic PHI (ePHI). If ePHI exists on your Mac, you need encryption at rest (FileVault), authentication (your password/Touch ID), and audit controls. Your Mac provides all of this by default. If ePHI exists on a vendor's server, the vendor must also meet Security Rule requirements — typically via SOC 2 Type II, HITRUST, or similar audits.
The Breach Notification Rule
Requires notification of breaches involving 500+ patients within 60 days, plus reporting to OCR. If PHI never leaves your device, a breach scenario involves only your device — which is covered by your practice's general policies (device encryption, lost-device protocols, etc.).
- "HIPAA-compliant with BAA" — the vendor signs a BAA accepting Business Associate obligations. They handle PHI on your behalf. You remain the covered entity. This is how Dragon Medical, Freed AI, Heidi, etc. operate.
- "HIPAA-compliant by architecture" — the product is designed so that PHI never reaches the vendor. No BAA needed because no Business Associate relationship exists. This is how MetaWhisp (on-device mode) operates.
- "Not for PHI use" — the vendor explicitly disclaims medical use. Apple Dictation (default Siri-enhanced mode) is in this category. OpenAI's ChatGPT is in this category for voice input.
The BAA math: why on-device simplifies everything
I want to show you exactly why on-device is legally simpler. Here are the two compliance models side by side:| Step | Cloud voice-to-text (Dragon, Freed, Heidi) | On-device (MetaWhisp, Whisper Transcription) |
|---|---|---|
| 1. Vendor classification | Business Associate (handles PHI) | Not a Business Associate (never sees PHI) |
| 2. BAA requirement | Required before any PHI use | Not required |
| 3. Vendor Security Rule audit | SOC 2 / HITRUST required; practice must verify | N/A (no vendor-side PHI) |
| 4. Breach notification chain | Vendor → You → Patients → OCR | Just your device (standard IT protocols) |
| 5. Audit log requirements | Vendor must provide PHI access logs | Your Mac's standard logs suffice |
| 6. Data portability/deletion | Vendor must support on request | You own all data locally |
| 7. Subpoena/discovery | Vendor can be compelled separately | Only your device in scope |
On-device vs. cloud for medical dictation: real tradeoffs
Cloud isn't always wrong for medical work. Here's where each wins.| Dimension | On-device | Cloud |
|---|---|---|
| HIPAA posture | No BAA, no vendor exposure | BAA required, vendor in compliance chain |
| Per-minute cost | $0 after model download | $99–$199/user/month retail |
| Offline use (poor clinic wifi, rural sites) | Works anywhere | Requires internet |
| Speed on M3/M4 Mac | 200–500ms round-trip | 400–1500ms (network + queue) |
| General medical accuracy (WER) | 5–7% (Whisper-turbo) | 3–5% (Dragon Medical) |
| Highly specialized vocabulary | Improves with custom terms | 20+ years vertical training |
| Multi-provider shared templates | Manual sync | Centralized in vendor cloud |
| Real-time ambient listening during visits | Possible but higher battery | Purpose-built apps exist (Freed, Heidi) |
| Audit trail for enterprise compliance | Your device logs only | Vendor provides centralized logs |
| Staff turnover / offboarding | No vendor access to remove | Must manage vendor accounts |
My honest recommendation
For solo practitioners, small practices, and specialties where privacy is paramount (mental health, reproductive health, HIV care, addiction medicine, adolescent care): on-device wins on every axis that matters. The accuracy gap on general dictation is 2–3 percentage points — clinically insignificant for most cases and closable with custom vocabulary. For large hospital systems that need centralized audit logs, multi-provider template sharing, real-time ambient scribe, and have existing Dragon contracts with enterprise IT teams: cloud tools may fit your infrastructure better. The compliance burden is manageable with the right vendor relationship. For everything in between: try on-device first. It's free. Switch back if it doesn't work for your specific specialty. ---Dragon Medical, Freed AI, Heidi Health, MetaWhisp — honest comparison
Let me name specific products and tell you what they do well and what they don't.Dragon Medical One (Nuance / Microsoft)
- What it does: Real-time cloud speech recognition purpose-built for medical vocabulary. The category incumbent.
- Price: $99–$199/user/month depending on specialty module and volume
- HIPAA: BAA with Microsoft required
- Strengths: Best-in-class accuracy on highly specialized vocabulary. Deep EHR integrations. 20+ years of medical vertical data.
- Weaknesses: Most expensive option. Cloud-dependent. Integration and change management overhead. Pricing power comes from switching costs, not ongoing innovation.
- When it wins: Large hospital systems, highly specialized practices (interventional radiology, surgery), practices with established Dragon workflows and dedicated IT.
Freed AI
- What it does: Ambient AI scribe — listens to patient visits, automatically generates SOAP notes. Not pure dictation.
- Price: $99/month
- HIPAA: BAA available
- Strengths: Automates note generation. Saves 1–2 hours per day for busy clinicians. Good for primary care with repetitive visit patterns.
- Weaknesses: Cloud-only. Requires continuous ambient listening. May feel intrusive to patients in certain specialties. Less control over final note structure.
- When it wins: Primary care and outpatient with high patient volume, willing to trade privacy (with BAA) for automation.
Heidi Health
- What it does: Similar to Freed AI. Ambient scribe with template support.
- Price: Free tier + paid plans $99–$199/month
- HIPAA: BAA available
- Strengths: Has a free tier (limited). Template system is flexible.
- Weaknesses: Same cloud tradeoffs as Freed. Accuracy varies by specialty.
- When it wins: Clinicians who want to try ambient scribe without upfront commitment.
Apple Dictation (built into macOS)
- What it does: macOS native dictation. Siri-enhanced mode.
- Price: Free (included with macOS)
- HIPAA: Apple does not offer a HIPAA BAA for Siri/Dictation. Not suitable for PHI.
- Strengths: Free, built-in, on-device for enhanced dictation on Apple Silicon.
- Weaknesses: Lower accuracy on medical terminology. No custom vocabulary. Limited to standard text fields (doesn't work reliably in Epic web client, many EHR portals).
- When it wins: Non-PHI dictation (administrative letters, CME notes, personal writing).
MetaWhisp (the one I built)
- What it does: On-device real-time voice-to-text with auto-paste into any app. Core is free, optional cloud tier.
- Price: $0 for unlimited on-device use. $30/year optional cloud.
- HIPAA: On-device mode requires no BAA (PHI never transmitted). Cloud mode has BAA available on request for Pro users.
- Strengths: Free for core use. HIPAA-compliant by architecture. Works in any EHR via auto-paste. Custom vocabulary. Learns your corrections. No vendor dependency.
- Weaknesses: Not specialty-fine-tuned like Dragon. No ambient scribe mode (dictation only). No centralized multi-provider admin console. Built by a solo founder — smaller team than enterprise competitors.
- When it wins: Solo practices, small clinics, specialties handling sensitive data, physicians who want $0 compliance cost, clinicians who already use Apple Silicon Macs.
Full comparison table
| Tool | On-device? | Price | BAA needed? | Mode | Best for |
|---|---|---|---|---|---|
| MetaWhisp | Yes (default) | $0 local / $30/yr cloud | No (on-device) | Dictation | Solo, small, privacy-sensitive |
| Dragon Medical One | No | $99–$199/mo | Yes | Dictation | Hospital systems, super-specialties |
| Freed AI | No | $99/mo | Yes | Ambient scribe | High-volume primary care |
| Heidi Health | No | $0–$199/mo | Yes | Ambient scribe | Trying ambient without commitment |
| Nabla Copilot | No | $119/mo | Yes | Ambient scribe | Enterprise ambient deployment |
| Apple Dictation | Partial | Free | Not for PHI | Dictation | Non-clinical writing only |
| Whisper Transcription | Yes | Free | No | File-based | Recorded interviews, not real-time |
SOAP notes with voice: complete workflow
Here's the workflow used by the physicians I interviewed and by the users who've adopted MetaWhisp for medical dictation. Adjust to your specialty.Option A: Between-visit dictation (recommended starting point)
Finish the visit. Walk the patient out.
Return to your office or workstation. Open the patient's chart in your EHR.
Press your MetaWhisp hotkey. Dictate the S.
"Subjective. Mrs. Johnson is a 58-year-old female presenting with a two-week history of exertional dyspnea and new-onset cough productive of clear sputum. Denies fever, chills, hemoptysis." Release the hotkey. Text pastes into the note field.
Dictate the O.
"Objective. Vital signs stable. Temperature 98.4. Blood pressure 132/78. Lungs with bibasilar crackles, no wheezing. Cardiac exam with regular rate and rhythm, no murmurs." Release, paste.
Dictate A and P.
"Assessment. Likely community-acquired pneumonia. Plan. Chest X-ray, CBC with differential, start amoxicillin 500 milligrams three times daily for seven days, follow up in one week or sooner if worsening." Release, paste.
Review, click Sign.
Total time per note: 45–90 seconds for a typical outpatient visit. Same as typing but without finger fatigue, and you can walk while dictating if your setup allows.
Option B: End-of-day batch (for procedure-heavy or complex days)
Keep brief shorthand notes during the day. End of clinic, use voice to expand each into full SOAP. Pros: uninterrupted patient time. Cons: details fade if you batch too many.Option C: Ambient-style real-time narration during visit
Narrate findings aloud during the exam. Patient acceptance varies by specialty. Works well in procedural specialties (dermatology skin exams, ophthalmology), less well in sensitive conversations (psych, OB/GYN). A brief explanation typically resolves concerns: "I'm using voice-to-text on my laptop so I can focus on you instead of typing. Nothing is recorded or sent anywhere — it all stays on this computer." Most patients appreciate the transparency. ---EHR integration: Epic, Cerner, Athenahealth, eClinicalWorks
One of the biggest misconceptions is that voice-to-text requires vendor-specific EHR integration. For on-device tools using auto-paste, it doesn't.How auto-paste works
When you press the MetaWhisp hotkey, speak, and release:- The app transcribes your audio locally (200–500ms on M3/M4)
- It places the resulting text on your system clipboard via NSPasteboard
- It simulates a Cmd+V keystroke via the macOS Accessibility API
- The text pastes into whatever field has focus — Epic, Cerner, Athenahealth, or any other app
- Your prior clipboard contents are restored
By EHR, what to expect
| EHR | Works with MetaWhisp? | Notes |
|---|---|---|
| Epic (Hyperspace / Web) | Yes | Works in all note fields, SmartPhrases, and free-text sections. No admin approval needed (user-level tool). |
| Cerner / Oracle Health | Yes | Works in PowerChart notes and documentation fields. |
| Athenahealth (web) | Yes | Works in all web-based note fields. |
| eClinicalWorks | Yes | Works throughout the app including progress notes. |
| NextGen | Yes | Standard text fields accept pasted input. |
| Practice Fusion | Yes | Web-based; all note fields work. |
| DrChrono | Yes | Both iOS and web versions accept pasted text. |
| Kareo / Tebra | Yes | Text fields throughout the app. |
What it does NOT do
MetaWhisp is a dictation engine, not an EHR automation platform. It won't:- Auto-fill templates or SmartPhrases (though you can dictate into them)
- Place orders or prescriptions automatically
- Create structured data (ICD-10 codes, CPT codes) from narrative
- Replace ambient scribe AI (Freed, Heidi) for conversation-to-note automation
The real cost of Dragon Medical (and why it's high)
I did the math. Here's what you're paying for with Dragon Medical vs. what it actually costs to run.Dragon Medical One pricing (2026)
- Individual: $99–$199/user/month depending on specialty module
- Enterprise: typically $800–$1,800/user/year with volume discount
- Plus: training, initial setup, EHR integration fees (one-time $200–$1,000)
- Plus: ongoing support contracts
What's in that price
- Compute: Whisper-scale real-time speech recognition on cloud GPUs costs roughly $2/user/month at scale (see our pricing analysis).
- Specialty vocabulary: 20+ years of medical text fine-tuning. This is real value for rare vocabularies. Maybe worth $10–$20/user/month.
- EHR integration engineering: Dragon has deep hooks into Epic and Cerner. Worth $5–$10/user/month for large systems; worth $0 for practices that don't need it.
- Support and training: Enterprise accounts get dedicated CSMs. Worth maybe $10/user/month for large deployments, $2/user/month for small.
- Margin + legacy pricing power: Everything above adds to maybe $30–$40/user/month actual value delivered. The rest ($60–$160/month) is pricing power from being the incumbent with high switching costs.
Five-year cost comparison
Assume 1 physician, $99/month Dragon subscription:| Option | Year 1 | 5-year total | Difference |
|---|---|---|---|
| Dragon Medical One ($99/mo) | $1,188 | $5,940 | — |
| Dragon Medical Pro ($199/mo) | $2,388 | $11,940 | +$6,000 |
| MetaWhisp (on-device) | $0 | $0 | −$5,940 |
| MetaWhisp + optional cloud ($30/yr) | $30 | $150 | −$5,790 |
How to set up HIPAA voice dictation in 15 minutes
Confirm Mac compatibility
You need Apple Silicon (M1 or later) running macOS 14 Sonoma or later. Intel Macs cannot run the local model fast enough for real-time dictation. For medical use, I recommend M2 Pro or better for the best speed.
Download and install MetaWhisp
Visit metawhisp.com/download. Drag to Applications. First launch triggers a macOS security check on Apple Silicon — right-click and Open to approve the app.
Grant permissions
macOS will request: Microphone (to capture audio), Accessibility (to paste text into any app), and Input Monitoring (for the global hotkey). All three are required. Grant in System Settings → Privacy & Security.
Download the on-device model
On first launch, MetaWhisp downloads Whisper large-v3-turbo (~1.5 GB). This is the only network activity the app performs. After this one-time download, zero network calls occur during transcription. Airplane mode test: works.
Configure hotkey for clinical use
Default is Right Option. For medical dictation, I recommend push-to-talk mode (hold to record, release to transcribe). This prevents accidental recording between patients. You can also assign a different key if Right Option conflicts with your keyboard shortcuts.
Add your specialty vocabulary
Open Settings → Vocabulary. Add 20–50 terms you dictate frequently: medications you prescribe, procedure codes, abbreviations, patient population terms. Examples for primary care:
- Medications: amoxicillin, azithromycin, metformin, lisinopril, levothyroxine
- Conditions: dyspnea, hemoptysis, lymphadenopathy, hypertension, diabetes
- Codes/abbrevs: CBC, CMP, TSH, A1c, PCP, STEMI
Add more as you notice misrecognitions during actual use.
Test in your EHR
Open your EHR normally. Click into a note field. Press and hold your hotkey. Dictate a full sentence. Release. Text should paste directly into the field. If it doesn't paste, verify Accessibility permission is granted.
Establish your workflow
Start with between-visit dictation for a week. Measure: time saved vs. typing, accuracy per 100 words, any terms that need to be added to custom vocabulary. Most physicians are faster than typing within 2–3 days.
Verify HIPAA posture with your compliance officer
For documentation, provide your compliance officer:
- A copy of this guide (or relevant sections)
- MetaWhisp's privacy policy stating no network calls during transcription
- A demonstration using Little Snitch or macOS Network Monitor showing zero egress during dictation
Most compliance officers approve within a week once they verify the architecture.
Real-world physician workflows
These are composite profiles drawn from conversations with physicians using on-device voice-to-text. Names changed, details generalized. If you're a doctor using MetaWhisp and want to be quoted by name, email me — I'll update this section.Previously paid $119/month for Dragon Medical. Switched to MetaWhisp after his practice administrator flagged the subscription as his biggest recurring software cost.
Workflow: Between-visit dictation in Athenahealth web client. Custom vocabulary with 43 terms specific to his patient population (rural Ohio, lots of diabetes and COPD management). Uses Rewrite mode for patient-facing letters, Raw mode for clinical notes.
Outcome: Saving $1,428/year. Clinical accuracy comparable for common conditions. Added specialty terms in the first two weeks as he encountered them.
Refused cloud-based tools on principle — psych notes include sensitive content (substance use, trauma, medication adherence) that she does not want on any vendor's server, BAA or not.
Workflow: Narrative notes are ~400–800 words per visit. She dictates between visits in Practice Fusion, using Correct mode to fix grammar and punctuation. No custom vocabulary needed — Whisper handles psych terminology (DSM-5 diagnoses, SSRIs, psychotherapy modalities) cleanly.
Outcome: Dictation speed roughly 3× typing. Notes complete before next patient arrives. Full control over data.
Uses real-time ambient dictation during skin exams. Narrates findings as he examines: location, size, morphology, differential.
Workflow: Single hotkey press, dictate a lesion, paste into note. Template system in NextGen fills in the structural parts; voice-to-text handles the clinical descriptions.
Outcome: Previously spent 90 minutes at end-of-clinic catching up on notes. Now walks out the door at the same time as his last patient.
Works in an area with unreliable satellite internet. Dragon Medical was useless during outages — which happened weekly.
Workflow: MetaWhisp works offline indefinitely after the initial model download. Dictates full visits in the field, notes sync to Epic when connectivity returns.
Outcome: Went from "cloud tools don't work here" to "voice-to-text just works, period."
A note from a physician interview (doctor name anonymized on request)
"The thing nobody explained to me when I bought Dragon Medical was that the BAA doesn't protect me if they get breached. It just distributes the liability. With an on-device tool, there is nobody to breach. That's the thing that flipped my mind."
Frequently asked questions
Is on-device voice-to-text HIPAA-compliant?
Yes, on-device voice-to-text is inherently HIPAA-compliant when used correctly. HIPAA's Security Rule applies to Protected Health Information (PHI) in transit and at rest. If audio and transcripts never leave your device — no cloud upload, no third-party server — there is no PHI transmission to any Business Associate, which means no Business Associate Agreement (BAA) is required. You remain the sole custodian of the data. This is the cleanest compliance model available for voice dictation.Do I need a Business Associate Agreement (BAA) for MetaWhisp?
No BAA is needed when using MetaWhisp in its default on-device mode. A BAA is only required when PHI is shared with a Business Associate. Because MetaWhisp processes all audio locally on your Mac and stores nothing on our servers, we never receive or transmit PHI. No BAA is legally required for this mode. For the optional cloud tier, BAAs are available on request for Pro users.What's the difference between Dragon Medical and MetaWhisp?
Dragon Medical One is cloud-based, requires a BAA, costs $99–$199/month per user, and has 20+ years of medical vocabulary fine-tuning. MetaWhisp is on-device (no BAA needed), costs $0 for unlimited local use, and uses Whisper large-v3-turbo which is newer but shows comparable accuracy on general medical dictation. Dragon wins on highly specialized vocabularies (interventional radiology, rare disease). MetaWhisp wins on price, privacy, and offline use.Can MetaWhisp handle SOAP notes and medical terminology?
Yes. Whisper large-v3-turbo was trained on diverse audio including medical content. It handles standard medical terminology (anatomy, common medications, procedures) accurately out of the box. For highly specialized vocabulary, MetaWhisp has a custom vocabulary feature. Accuracy improves as the app learns from your corrections over time.Will MetaWhisp work with my EHR?
Yes, if your EHR accepts keyboard input — and they all do. MetaWhisp uses macOS Accessibility APIs to paste text into whatever field has focus. This works in Epic, Cerner, Athenahealth, eClinicalWorks, NextGen, Practice Fusion, DrChrono, Kareo/Tebra, and any other EHR. No integration setup required.How accurate is on-device transcription for medical dictation?
Whisper large-v3-turbo achieves roughly 5–7% word error rate on general medical dictation with clean audio. Dragon Medical One averages 3–5% on specialized medical vocabulary. After MetaWhisp learns your personal vocabulary, the gap closes for most common specialties. For sub-specialties with heavy niche vocabulary (interventional radiology, rare disease clinics), Dragon still has an edge.What does it cost to switch from Dragon Medical to MetaWhisp?
Approximately zero in direct cost. Dragon Medical One costs $99–$199/month per user. MetaWhisp is free for on-device use, or $30/year optional cloud. Training time is under 30 minutes. You keep your EHR, your workflow, your macros. Break-even is immediate. Typical savings: $1,188–$2,388 per physician per year.Is there a free HIPAA-compliant alternative to Dragon Medical?
Yes. MetaWhisp is free for unlimited on-device voice-to-text and is HIPAA-compliant by design. Apple's built-in Dictation is also free but lower accuracy on medical terms and not intended for PHI use per Apple's own documentation. Whisper Transcription is free for file-based workflows but lacks real-time paste into EHRs.Can I use voice-to-text during patient visits?
Yes, with two common approaches. (1) Between-visit dictation: 30–60 seconds per patient in your office between visits. Feels natural, saves hours at end-of-day. (2) Real-time ambient narration during visits: you narrate findings, patient hears and accepts with brief explanation. Works better in procedural specialties than sensitive conversations. For fully automated conversation-to-SOAP, use an ambient scribe like Freed or Heidi (different tool, different tradeoffs).Does MetaWhisp support specialty-specific vocabularies?
MetaWhisp's base model (Whisper large-v3-turbo) includes broad medical vocabulary from training data. You extend it via custom dictionary: add medication names, procedure codes, abbreviations specific to your practice. Examples: cardiology (STEMI, NSTEMI, PCI, cardiomyopathies), psychiatry (DSM-5 diagnoses, SSRIs, CBT terminology), orthopedics (procedure codes, anatomy, implant names). Users typically add 20–50 terms in the first week.What about Apple Dictation — isn't that already HIPAA-compliant?
No. Apple does not offer a HIPAA BAA for Siri or Apple Dictation. Their documentation explicitly states these services are not intended for PHI. Additionally, when "Enhanced Dictation" is disabled (default on older Macs), audio is sent to Apple servers for processing. For PHI use, you need either a true on-device tool (MetaWhisp, Whisper Transcription) or a medical-specific tool with a BAA (Dragon Medical, Freed AI).Can my compliance officer verify MetaWhisp is truly on-device?
Yes. Three verification methods: (1) Run the app in airplane mode — it works because the model is local. (2) Use Little Snitch or macOS Network Monitor during dictation — zero network activity will be recorded. (3) Read our privacy documentation which specifies zero network calls during transcription. For enterprise compliance, we can provide additional documentation on request.What happens if I switch specialties or add a new medication to my practice?
Add the new terms to your custom vocabulary in MetaWhisp settings. The model will use them as recognition hints. You can add, remove, or edit terms anytime. No retraining required, no waiting period.Does MetaWhisp store any data about my dictations?
No. Audio is captured into RAM, processed by the model, transcribed, and the RAM buffer is released. Transcripts are placed on your clipboard (which is your Mac's system clipboard, not anything MetaWhisp stores), pasted into your app, and then the clipboard is restored to its prior contents. MetaWhisp does not log, store, or transmit any audio or transcripts.Can multiple physicians in my practice use one MetaWhisp license?
Each physician needs their own installation on their own Mac (the app is free), but custom vocabularies and preferences are local per-machine. For Pro cloud features, licensing is per-user. For small practices (under 10 physicians), MetaWhisp free tier covers everything needed. For larger deployments, contact us for team pricing.About the author
Andrew Dyuzhov
CEO & Solo Founder, MetaWhisp
I'm not a doctor. I'm a solo founder who built a voice-to-text app, got a physician email asking about HIPAA compliance, and spent the next month learning medical dictation end-to-end.
This guide is the result. I'm actively interviewing physicians to make it better. If you practice medicine and want to contribute a real-world perspective — on any tool, not just MetaWhisp — email me.
MetaWhisp in short:
- Built by one person (me)
- 100% on-device by default — audio never leaves your Mac, which is why it's HIPAA-compliant by architecture
- Free forever for on-device use; $30/year optional cloud tier
- Works in any EHR via auto-paste
- Custom vocabulary, learns from corrections, no BAA needed for on-device mode
If something in this guide is wrong, inaccurate, or missing context, tell me. I would rather fix a mistake than look like I know what I'm doing.
Follow along on X (@hypersonq) — I post about product decisions, medical tool research, and what I'm learning from physician interviews.
Try MetaWhisp free for medical dictation
On-device, HIPAA-compliant by architecture, no BAA needed, no subscription. Works with every major EHR via auto-paste. Installs in under a minute.
Download MetaWhisp free →- How to Choose a Private Voice-to-Text App for Mac in 2026 — full framework for evaluating voice tools, not just medical
- Meeting Transcription Without a Bot — HIPAA-adjacent approach for recorded consultations and referral calls
- What Is Whisper large-v3-turbo? — technical deep-dive on the model powering MetaWhisp
- On-Device Transcription — how MetaWhisp processes speech locally using WhisperKit
- MetaWhisp Privacy Policy — official documentation for compliance review