FAQ - MCP Server Whisper by arcaputo3

Supported formats include flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, and webm for transcription, and mp3, wav for chat.

Files larger than 25MB are automatically compressed to meet API limits.

Yes, you can customize voices, speed, and provide specific instructions for the text-to-speech generation.

The server supports OpenAI's whisper-1, gpt-4o-transcribe, and gpt-4o-mini-transcribe models.

Yes, the server supports parallel batch processing for multiple audio files.