Free Real-Time Speech Translation for macOS and Windows – My Translator

A free desktop app that translates system audio and mic input in real time. Supports 70+ languages, TTS narration, and offline mode for Apple Silicon.

My Translator is a free, open-source real-time speech translation app that captures audio from your system or microphone, transcribes it, and displays the translation on the screen.

The app is fully free for Apple Silicon Mac users. The built-in offline local mode runs entirely on-device using MLX, Whisper, and Gemma. No API key and internet connection required.

For users who want cloud-quality translation, My Translator connects directly to the Soniox speech recognition API at roughly $0.12 per hour of audio. A $10 deposit covers 80-plus hours of active use.

Text-to-speech narration is available through three AI providers, one of which (Microsoft Edge TTS) is completely free and requires no API key. So macOS users on Apple Silicon can run the full feature set at zero ongoing cost.

Features

  • Captures system audio and microphone input simultaneously.
  • Transcribes and translates audio with approximately 2–3 seconds of latency in cloud mode.
  • Supports 70+ source languages with translation into any target language.
  • Runs an experimental local offline mode on Apple Silicon using MLX, Whisper, and Gemma, with no internet required.
  • Detects multiple speakers automatically and labels them in the transcript.
  • Displays output in a dual-panel view: source text on the left, translation on the right, with independent scrolling for each panel.
  • Reads translations aloud through one of three TTS providers: Edge TTS (free), Google Chirp 3 HD, or ElevenLabs.
  • Saves transcripts locally as Markdown or plain text files, one file per session.
  • Accepts custom translation terms so domain-specific vocabulary (medical, legal, religious) renders consistently.

Use Cases

  • Follow a foreign-language lecture, webinar, or YouTube video in real time.
  • Translate a live Zoom or Teams meeting where participants speak a language you do not read fluently.
  • Use the dual-panel view to study a language by reading both the source text and the translation side by side as a podcast plays.
  • Enable TTS narration during a presentation so the translated speech plays aloud for an audience listening in a different language.

How To Use It

Install on macOS

1. Download the .dmg file from the Releases page.

  • My.Translator_x.x.x_aarch64.dmg for Apple Silicon (M1/M2/M3/M4)
  • My.Translator_x.x.x_x64.dmg for Intel Mac

2. Open the .dmg, drag My Translator to your Applications folder, and eject the disk image.

3. On first launch, macOS will ask for Screen & System Audio Recording permission. Click Open System Settings, find My Translator in the list, toggle the switch on, then click Quit & Reopen when macOS prompts you.

Install on Windows

1. Download the .exe installer from the Releases page. Choose x64 for Intel/AMD machines or arm64 for Surface Pro X and Snapdragon devices.

2. Windows SmartScreen will display a warning because the app does not yet hold a Windows code-signing certificate. Click More info, then click Run anyway to proceed through the setup wizard. Accept the default install location and check Run My Translator on the final screen.

3. If the app does not launch, install the Microsoft Edge WebView2 Runtime separately.

Get a Soniox API Key

Create a free account at console.soniox.com. Add a payment method under Billing, deposit a minimum of $10, then go to API Keys and click Create API Key. Copy the key; it starts with soniox_.

Configure the App

Open Settings with ⌘ , (macOS) or Ctrl+, (Windows). Paste your Soniox API key into the API key field. Set the source language and target language, then choose your translation engine:

ModeLatencyQualityCostInternet Required
Soniox API (Cloud)~2–3s9/10~$0.12/hrYes
Local MLX (Offline)~10s7/10FreeNo

Local MLX mode requires Apple Silicon (M1 or later) and about 5 GB of free disk space. Models download automatically on first use.

Click Save & Close.

Start Translating

Press ⌘ Enter (macOS) or Ctrl+Enter (Windows) to start. Play audio on your computer. Translations appear in the overlay in real time.

Switch between view modes with the panel button at the bottom-right corner of the overlay (hover to reveal it):

  • Single (default): Translation text only.
  • Dual: Source text and translation side by side.

Adjust font size with the A-/A+ buttons (also revealed on hover).

Set Up TTS Narration

TTS is off by default. Open Settings, go to the TTS tab, and pick a provider:

ProviderCostSpeed ControlSetup Required
Edge TTSFreeYes (-50% to +100%)None
Google Chirp 3 HDFree up to 1M chars/mo, then $30/1MYes (0.5x–2.0x)Google Cloud API key
ElevenLabs~$5–$22/moNoElevenLabs API key

Edge TTS setup (no key needed):

  1. Settings → TTS tab → Provider: Edge TTS.
  2. Pick a voice from the list.
  3. Click Save & Close.
  4. Toggle TTS on from the main screen with the TTS button or ⌘ T.

Google Chirp 3 HD setup:

  1. Go to console.cloud.google.com and create a project.
  2. Enable the Text-to-Speech API at console.cloud.google.com/apis/library/texttospeech.googleapis.com.
  3. Go to console.cloud.google.com/apis/credentials -> Create Credentials -> API Key.
  4. Restrict the key to Cloud Text-to-Speech API only (recommended).
  5. Paste the key in Settings -> TTS -> Google Chirp 3 HD -> Save & Close.

ElevenLabs setup:

  1. Create an account at elevenlabs.io and subscribe to the Starter plan ($5/month).
  2. Go to profile icon → API Keys → Create API Key.
  3. Paste the key in Settings → TTS → ElevenLabs → Save & Close.

Add Custom Translation Terms

Open Settings -> Translation -> Translation terms. Enter terms in Original = Translation format, one per line:

Original sin = ...
Christ = ...
Pneumonia = ...

The app applies these substitutions consistently across every session.

Keyboard Shortcuts

macOS

ShortcutAction
⌘ EnterStart / Stop
⌘ ,Open Settings
EscClose Settings
⌘ 1Switch to System Audio
⌘ 2Switch to Microphone
⌘ TToggle TTS narration

Windows

ShortcutAction
Ctrl+EnterStart / Stop
Ctrl+,Open Settings
EscClose Settings
Ctrl+TToggle TTS narration

Pros

  • Processes audio with low latency around two to three seconds.
  • Captures system audio directly.
  • You retain complete control over your API keys.
  • Edge TTS requires zero setup and zero cost.

Cons

  • The cloud translation mode requires a paid Soniox API account.
  • The offline local mode requires Apple Silicon.

Related Resources

FAQs

Q: Does My Translator record or store my audio on any external server?
A: No. The app connects directly to the Soniox API from your device. Transcripts save as local Markdown or text files on your own disk only.

Q: Can I use this app to translate Zoom or Google Meet calls?
A: Yes. Set the audio source to System Audio in Settings. The app captures everything playing through your Mac or Windows audio output.

Q: Why does the application show a “No API key” error?
A: You must generate a Soniox API key from their developer console and paste it into the general settings tab. The application requires this key to connect to the translation engine.

Q: How do I fix missing translation text on macOS?
A: You need to enable Screen and System Audio Recording in your macOS System Settings.

Q: Can I use this for live conversations in person?
A: Yes. Switch the audio source to Microphone and place the app near the speaker. For best results, use an external USB microphone positioned close to the speaker.

Leave a Reply

Your email address will not be published. Required fields are marked *

Get the latest & top AI tools sent directly to your email.

Subscribe now to explore the latest & top AI tools and resources, all in one convenient newsletter. No spam, we promise!