audio-extractor

@benchflow-ai/audio-extractor

benchflow-ai

1,359

317 forks

Updated 6/17/2026

View on GitHub

Extract audio from video files to WAV format. Use when you need to analyze audio from video, prepare audio for energy calculation, or convert video audio to standard format for processing.

Installation

$npx agent-skills-cli install @benchflow-ai/audio-extractor

Claude Code

Cursor

Copilot

Codex

Antigravity

Details

Repositorybenchflow-ai/skillsbench

Pathtasks/video-silence-remover/environment/skills/audio-extractor/SKILL.md

Branchmain

Scoped Name@benchflow-ai/audio-extractor

Usage

After installing, this skill will be available to your AI coding assistant.

Verify installation:

npx agent-skills-cli list

Skill Instructions

name: audio-extractor description: Extract audio from video files to WAV format. Use when you need to analyze audio from video, prepare audio for energy calculation, or convert video audio to standard format for processing.

Audio Extractor

Extracts audio from video files to WAV format for further analysis. Converts to mono 16kHz PCM format optimized for speech/energy analysis.

Use Cases

Extracting audio for speech analysis
Preparing audio for energy calculation
Converting video audio to standard format

Usage

python3 /root/.claude/skills/audio-extractor/scripts/extract_audio.py \
    --video /path/to/video.mp4 \
    --output /path/to/audio.wav

Parameters

--video: Path to input video file
--output: Path to output WAV file
--sample-rate: Audio sample rate in Hz (default: 16000)
--duration: Optional duration limit in seconds (default: full video)

Output Format

Format: WAV (PCM 16-bit signed)
Channels: Mono
Sample rate: 16000 Hz (default)

Dependencies

ffmpeg

Example

# Extract first 10 minutes of audio
python3 /root/.claude/skills/audio-extractor/scripts/extract_audio.py \
    --video lecture.mp4 \
    --duration 600 \
    --output audio.wav

Notes

Output is always mono for consistent analysis
16kHz sample rate is sufficient for speech analysis and reduces file size
Supports any video format that ffmpeg can read

More by benchflow-ai

View all

pause-detector

1,359

Detect pauses and silence in audio using local dynamic thresholds. Use when you need to find natural pauses in lectures, board-writing silences, or breaks between sections. Uses local context comparison to avoid false positives from volume variation.

energy-calculator

1,359

Calculate per-second RMS energy from audio files. Use when you need to analyze audio volume patterns, prepare data for silence/pause detection, or create an energy profile for audio analysis tasks.

video-processor

1,359

Process videos by removing segments and concatenating remaining parts. Use when you need to remove detected pauses/openings from videos, create highlight reels, or batch process segment removals using ffmpeg filter_complex.

segment-combiner

1,359

Combine multiple segment detection results into a unified list. Use when you need to merge segments from different detectors, prepare removal lists for video processing, or consolidate detection outputs.