mcp.film

mcp.film / Transcription & Captions / Sanzaru (Whisper MCP)

Sanzaru (Whisper MCP)

Communityby Community (TJC-LP)paidverified Jun 11, 2026

OpenAI Whisper/GPT-4o transcription, audio chat, and TTS in one local MCP server.

Rate it: no ratings yet

What it does

Community MCP server wrapping OpenAI's audio APIs: Whisper and GPT-4o transcription with enhancement templates, interactive 'chat with audio' analysis, format conversion/compression, and OpenAI TTS. A filmmaker-agent can transcribe and interrogate dailies, then generate scratch narration, all against a single OpenAI key.

Connect

Claude Code
claude mcp add sanzaru -e OPENAI_API_KEY=YOUR_KEY -e SANZARU_MEDIA_PATH=/absolute/path/to/media -- uvx "sanzaru[all]"
Auth: api_key · env OPENAI_API_KEY — https://platform.openai.com (API keys)

Tools you'll see

transcribe_audiotranscribe_with_enhancementchat_with_audiocreate_audioconvert_audiocompress_audio

Field notes

Successor to the widely-listed (now deprecated) arcaputo3/mcp-server-whisper. v0.6.2, Apr 2026, MIT. For fully local/offline transcription: jwulff/whisper-mcp and SmartLittleApps/local-stt-mcp (whisper.cpp, Apple Silicon, diarization).

Pairs well with