Local speech-to-text with MLX Whisper (Apple Silicon optimized, no API key).
Installation
$npx agent-skills-cli install @openclaw/mlx-whisper
Claude Code
Cursor
Copilot
Codex
Antigravity
Details
Repositoryopenclaw/skills
Pathskills/kevin37li/mlx-whisper/SKILL.md
Branchmain
Scoped Name@openclaw/mlx-whisper
Usage
After installing, this skill will be available to your AI coding assistant.
Verify installation:
npx agent-skills-cli listSkill Instructions
name: mlx-whisper version: 1.0.0 description: Local speech-to-text with MLX Whisper (Apple Silicon optimized, no API key). homepage: https://github.com/ml-explore/mlx-examples/tree/main/whisper metadata: {"clawdbot":{"emoji":"🍎","requires":{"bins":["mlx_whisper"]},"install":[{"id":"pip","kind":"pip","package":"mlx-whisper","bins":["mlx_whisper"],"label":"Install mlx-whisper (pip)"}]}}
MLX Whisper
Local speech-to-text using Apple MLX, optimized for Apple Silicon Macs.
Quick Start
mlx_whisper /path/to/audio.mp3 --model mlx-community/whisper-large-v3-turbo
Common Usage
# Transcribe to text file
mlx_whisper audio.m4a -f txt -o ./output
# Transcribe with language hint
mlx_whisper audio.mp3 --language en --model mlx-community/whisper-large-v3-turbo
# Generate subtitles (SRT)
mlx_whisper video.mp4 -f srt -o ./subs
# Translate to English
mlx_whisper foreign.mp3 --task translate
Models (download on first use)
| Model | Size | Speed | Quality |
|---|---|---|---|
| mlx-community/whisper-tiny | ~75MB | Fastest | Basic |
| mlx-community/whisper-base | ~140MB | Fast | Good |
| mlx-community/whisper-small | ~470MB | Medium | Better |
| mlx-community/whisper-medium | ~1.5GB | Slower | Great |
| mlx-community/whisper-large-v3 | ~3GB | Slowest | Best |
| mlx-community/whisper-large-v3-turbo | ~1.6GB | Fast | Excellent (Recommended) |
Notes
- Requires Apple Silicon Mac (M1/M2/M3/M4)
- Models cache to
~/.cache/huggingface/ - Default model is
mlx-community/whisper-tiny; use--model mlx-community/whisper-large-v3-turbofor best results
More by openclaw
View allsongsee
350,505Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.
camsnap
350,505Capture frames or clips from RTSP/ONVIF cameras.
openhue
350,505Control Philips Hue lights and scenes via the OpenHue CLI.
mcporter
350,505Use the mcporter CLI to list, configure, auth, and call MCP servers/tools directly (HTTP or stdio), including ad-hoc servers, config edits, and CLI/type generation.
