WhisperClip

Privacy-First Voice-to-Text with AI Enhancement for macOS

Website • Download • Support

✨ Features

🎤 Voice-to-Text Transcription

High-quality speech recognition using WhisperKit
Multiple model sizes (216MB to 955MB) for different accuracy/speed trade-offs
Support for multiple languages with auto-detection
Real-time waveform visualization during recording

🤖 AI-Powered Text Enhancement

Local LLM processing for grammar correction and text improvement
Multiple AI models including Gemma, Llama, Qwen, and Mistral
Custom prompts for different use cases:
- Grammar fixing and email formatting
- Language translation
- Custom text processing workflows

🔒 Privacy-First Design

100% local processing - your voice never leaves your device
No cloud services, no data collection
Open source - audit the code yourself
Secure sandboxed environment

⚡ Productivity Features

Global hotkey support (Option+Space by default)
Auto-copy to clipboard
Auto-paste functionality
Auto-enter for instant message sending
Menu bar integration
Auto-stop recording after 10 minutes

🎨 User Experience

Beautiful dark-themed interface
Real-time recording visualization
Comprehensive onboarding guide
Easy model management and downloads
Customizable shortcuts and prompts

📋 Requirements

macOS 14.0 or later
20GB free disk space (for AI models)
Microphone access permission
Accessibility permissions (for global hotkeys)
Apple Events permissions (for clipboard operations)

🚀 Installation

Download Pre-built App

Visit whisperclip.com
Download the latest release
Drag WhisperClip.app to your Applications folder
Follow the setup guide for permissions

Build from Source

# Clone the repository
git clone https://github.com/cydanix/whisperclip.git
cd whisperclip

# Build the app
./build.sh

# For development
./local_build.sh Debug
./local_run.sh Debug

🔧 Usage

Quick Start

Launch WhisperClip from Applications or menu bar
Grant permissions when prompted (microphone, accessibility)
Download AI models through the setup guide
Press Option+Space (or click Record) to start recording
Press again to stop - text will be automatically copied to clipboard

Customization

Change hotkey: Settings → Hotkey preferences
Add custom prompts: Settings → Prompts → Add new prompt
Switch AI models: Setup Guide → Download different models
Configure auto-actions: Settings → Enable auto-paste/auto-enter

🤖 Supported AI Models

Speech-to-Text (WhisperKit)

OpenAI Whisper Small (216MB) - Fast, good quality
OpenAI Whisper Large v3 Turbo (632MB) - Best balance
Distil Whisper Large v3 Turbo (600MB) - Optimized speed
OpenAI Whisper Large v2 Turbo (955MB) - Maximum accuracy

Text Enhancement (Local LLMs)

Gemma 2 (2B/9B) - Google's efficient models
Llama 3/3.2 (3B/8B) - Meta's powerful models
Qwen 2.5/3 (1.5B-8B) - Alibaba's multilingual models
Mistral 7B - High-quality French company model
Phi 3.5 Mini - Microsoft's compact model
DeepSeek R1 - Advanced reasoning model

All models run locally using MLX for Apple Silicon optimization.

🔒 Privacy & Security

WhisperClip is designed with privacy as the cornerstone:

Local Processing Only: All voice recognition and AI processing happens on your device
No Network Requests: Except for downloading models from Hugging Face
No Analytics: No usage tracking, no telemetry, no data collection
Open Source: Full transparency - inspect the code yourself
Sandboxed: Runs in Apple's secure app sandbox
Encrypted Storage: AI models stored securely on device

🛠 Development

Project Structure

Sources/
├── WhisperClip.swift      # Main app entry point
├── ContentView.swift      # Main UI interface
├── AudioRecorder.swift    # Voice recording logic
├── VoiceToText*.swift     # Transcription engine
├── LLM*.swift            # AI text enhancement
├── ModelStorage.swift     # Model management
├── SettingsStore.swift    # User preferences
└── HotkeyManager.swift    # Global shortcuts

Dependencies

WhisperKit: Apple's optimized Whisper implementation
MLX: Apple Silicon ML framework
MLX-Swift-Examples: LLM implementations
Hub: Hugging Face model downloads

Building

# Debug build
./local_build.sh Debug

# Release build with code signing
./build.sh

# Notarization (requires Apple Developer account)
./notarize.sh

🤝 Contributing

We welcome contributions! Please see our contributing guidelines:

Fork the repository
Create a feature branch: git checkout -b feature/amazing-feature
Make your changes and add tests
Commit your changes: git commit -m 'Add amazing feature'
Push to branch: git push origin feature/amazing-feature
Open a Pull Request

Areas for Contribution

New AI model integrations
UI/UX improvements
Performance optimizations
Language support
Accessibility features
Documentation improvements

📄 License

WhisperClip is licensed under the MIT License - see the LICENSE file for details.

This means you can:

✅ Use commercially
✅ Modify and distribute
✅ Use privately
✅ Fork and create derivatives

Attribution required: Please include the original license notice.

🏢 About

WhisperClip is developed by Cydanix LLC.

Website: whisperclip.com
Support: support@cydanix.com
Version: 1.0.43

🙏 Acknowledgments

Apple - WhisperKit and MLX frameworks
OpenAI - Original Whisper models
Hugging Face - Model hosting and Hub library
ML Community - Open source AI models (Gemma, Llama, Qwen, etc.)

Made with ❤️ for privacy-conscious users

⭐ Star this repo if you find it useful!

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
Sources		Sources
Tests		Tests
icons		icons
.gitignore		.gitignore
LICENSE		LICENSE
Package.resolved		Package.resolved
Package.swift		Package.swift
README.md		README.md
WhisperClip.entitlements		WhisperClip.entitlements
WhisperClip.icns		WhisperClip.icns
build.sh		build.sh
check_bundle.sh		check_bundle.sh
clear_settings.sh		clear_settings.sh
local_build.sh		local_build.sh
local_run.sh		local_run.sh
notarization_check.sh		notarization_check.sh
notarize.sh		notarize.sh
reset_all_approvals.sh		reset_all_approvals.sh
staple.sh		staple.sh
version		version

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

WhisperClip

✨ Features

🎤 Voice-to-Text Transcription

🤖 AI-Powered Text Enhancement

🔒 Privacy-First Design

⚡ Productivity Features

🎨 User Experience

📋 Requirements

🚀 Installation

Download Pre-built App

Build from Source

🔧 Usage

Quick Start

Customization

🤖 Supported AI Models

Speech-to-Text (WhisperKit)

Text Enhancement (Local LLMs)

🔒 Privacy & Security

🛠 Development

Project Structure

Dependencies

Building

🤝 Contributing

Areas for Contribution

📄 License

🏢 About

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

cydanix/whisperclip

Folders and files

Latest commit

History

Repository files navigation

WhisperClip

✨ Features

🎤 Voice-to-Text Transcription

🤖 AI-Powered Text Enhancement

🔒 Privacy-First Design

⚡ Productivity Features

🎨 User Experience

📋 Requirements

🚀 Installation

Download Pre-built App

Build from Source

🔧 Usage

Quick Start

Customization

🤖 Supported AI Models

Speech-to-Text (WhisperKit)

Text Enhancement (Local LLMs)

🔒 Privacy & Security

🛠 Development

Project Structure

Dependencies

Building

🤝 Contributing

Areas for Contribution

📄 License

🏢 About

🙏 Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages