Skip to content

A lightweight macOS menu bar app for quick on-device audio transcription using Parakeet v3 Multilingual. Press a hotkey, record your thoughts, and get instant text that's automatically inserted in your active app.

License

Notifications You must be signed in to change notification settings

jonathanglasmeyer/FluidVoice

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

134 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

FluidVoice

FluidVoice Icon

Swift License CoreML MLX

macOS menu bar app for audio transcription that works system-wide in any application.

Two recording modes:

  • Toggle Mode: Press your shortcut (Right Option or your favorite modifier like Fn) to start, press again to stop
  • Hold to Speak: Hold your shortcut key, speak, release to transcribe

Text appears instantly in your current application—no copy/paste needed.

Status: Core features are functional. Planned enhancements tracked in docs/features/.

Key Advantages

Complete Privacy

  • 100% offline - Audio never leaves your device
  • No data collection - Zero telemetry, zero tracking
  • Open source - Verify for yourself what the code does

Intelligent Multilingual Support

  • Automatic language detection - Switch between languages mid-sentence
  • 25+ European languages with Parakeet v3 (German, French, Spanish, Italian, etc.)
  • No manual language switching - Just speak, FluidVoice adapts

Ultra-Fast Performance

  • Sub-second transcription: 0.1-0.3 second response times
  • 10x faster than traditional approaches with optimized model loading
  • Instant response - Models preloaded and ready
  • Real-time feel - Near-zero latency for short audio clips

Intelligent Vocabulary Correction

  • Ultra-fast pattern matching: 3-5ms correction (vs 1500-3000ms for LLM-based approaches)
  • Privacy-first processing: All vocabulary correction happens locally, zero network requests
  • Default presets: Includes common technical terms - "api" → "API", "github" → "GitHub", "typescript" → "TypeScript"
  • Domain adaptable: Easily customize for any field - legal, medical, scientific, or business terminology
  • JSONC configuration: Developer-friendly config with inline comments at ~/.config/fluidvoice/vocabulary.jsonc
  • Live reload: Edit vocabulary while FluidVoice runs - changes apply instantly

Features & Development

Local Transcription

  • Parakeet v3 Multilingual: NVIDIA's 600M parameter model optimized for speed and accuracy
    • 25 European languages with automatic detection
    • Daemon mode for zero cold-start latency
    • Sub-second transcription on Apple Silicon
    • Runs entirely offline via MLX

Documentation

Alternative Products

Looking for other voice transcription tools? Check out ALTERNATIVES.md for a detailed comparison of 8 competing products including VoiceInk, Spokenly, SuperWhisper, and more.

Requirements

  • macOS 14.0 (Sonoma) or later

Installation

Build from Source

  1. Install Xcode Command Line Tools: xcode-select --install
  2. Clone repository:
    git clone https://github.com/mazdak/FluidVoice.git
    cd FluidVoice
  3. Setup code signing (required for microphone access):
  4. Build: ./build.sh
  5. Run: Open FluidVoice.app

Why code signing? macOS requires signed apps for microphone permissions. Self-signed certificates work perfectly and cost nothing.

Development Workflow

Using just commands:

just                # List all available commands
just dev            # Build and run development version
just release        # Build release version and install to /Applications
just test           # Run tests
just logs           # Stream app logs
just kill           # Kill running app processes

Troubleshooting

"Unidentified Developer" Warning

  • Right-click the app and select "Open" instead of double-clicking
  • Click "Open" in the security dialog

Microphone Permission Issues

  • Go to System Settings → Privacy & Security → Microphone
  • Ensure FluidVoice is enabled
  • If permissions don't work after rebuilding, reset them:
    tccutil reset Microphone com.fluidvoice.app
  • Then restart the app and grant permission again

Parakeet Setup Issues

  • Click "Download Parakeet v3 Model" in settings
  • Check that download completed (~600MB)

Contributing

See CLAUDE.md for development setup and guidelines.

Acknowledgments

Based on mazdak/AudioWhisper. Built with SwiftUI, AppKit, MLX, and Parakeet v3. MIT License.

About

A lightweight macOS menu bar app for quick on-device audio transcription using Parakeet v3 Multilingual. Press a hotkey, record your thoughts, and get instant text that's automatically inserted in your active app.

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Languages

  • Swift 93.9%
  • Python 3.3%
  • Shell 2.6%
  • Just 0.2%