Extract audio from video files to WAV format. Use when you need to analyze audio from video, prepare audio for energy calculation, or convert video audio to standard format for processing.
Installation
Details
Usage
After installing, this skill will be available to your AI coding assistant.
Verify installation:
npx agent-skills-cli listSkill Instructions
name: audio-extractor description: Extract audio from video files to WAV format. Use when you need to analyze audio from video, prepare audio for energy calculation, or convert video audio to standard format for processing.
Audio Extractor
Extracts audio from video files to WAV format for further analysis. Converts to mono 16kHz PCM format optimized for speech/energy analysis.
Use Cases
- Extracting audio for speech analysis
- Preparing audio for energy calculation
- Converting video audio to standard format
Usage
python3 /root/.claude/skills/audio-extractor/scripts/extract_audio.py \
--video /path/to/video.mp4 \
--output /path/to/audio.wav
Parameters
--video: Path to input video file--output: Path to output WAV file--sample-rate: Audio sample rate in Hz (default: 16000)--duration: Optional duration limit in seconds (default: full video)
Output Format
- Format: WAV (PCM 16-bit signed)
- Channels: Mono
- Sample rate: 16000 Hz (default)
Dependencies
- ffmpeg
Example
# Extract first 10 minutes of audio
python3 /root/.claude/skills/audio-extractor/scripts/extract_audio.py \
--video lecture.mp4 \
--duration 600 \
--output audio.wav
Notes
- Output is always mono for consistent analysis
- 16kHz sample rate is sufficient for speech analysis and reduces file size
- Supports any video format that ffmpeg can read
More by benchflow-ai
View allDetect pauses and silence in audio using local dynamic thresholds. Use when you need to find natural pauses in lectures, board-writing silences, or breaks between sections. Uses local context comparison to avoid false positives from volume variation.
Calculate per-second RMS energy from audio files. Use when you need to analyze audio volume patterns, prepare data for silence/pause detection, or create an energy profile for audio analysis tasks.
Process videos by removing segments and concatenating remaining parts. Use when you need to remove detected pauses/openings from videos, create highlight reels, or batch process segment removals using ffmpeg filter_complex.
Combine multiple segment detection results into a unified list. Use when you need to merge segments from different detectors, prepare removal lists for video processing, or consolidate detection outputs.
