elevenlabs-mcp
The ElevenLabs MCP server allows integration of advanced Text to Speech and audio processing APIs with MCP clients, enabling features like speech generation, voice cloning, and audio transcription. It is designed for compatibility with a variety of clients including Claude Desktop and others.
Overview
Official ElevenLabs Model Context Protocol (MCP) server for interacting with Text to Speech and audio processing APIs. The server facilitates MCP clients such as Claude Desktop, Cursor, Windsurf, and OpenAI Agents to generate speech, clone voices, transcribe audio, and more. Features include configuring for various clients, API key management, and voice transformation. Additional functions are provided through optional environment variables and client settings.
Quickstart with Claude Desktop
- Obtain an API key from ElevenLabs.
- Install
uv
package manager. - Configure Claude Desktop with necessary settings.
Other MCP clients
Instructions for setting up other MCP clients like Cursor and Windsurf.
Example usage
- Create AI agents with specific voice styles.
- Convert recordings to different voice characters.
- Create soundscapes and transcribe audio with voice identification.
Optional features
Allows customization of file paths using environment variables.
Contributing
Details on how to clone the repo, set up a virtual environment, and run tests locally.
Troubleshooting
Solutions for timeouts and errors related to the command path.