Skip to content

Conversation

Orinks
Copy link

@Orinks Orinks commented Aug 1, 2025

Summary

This PR adds the official ElevenLabs MCP server to the zenagents-library. The ElevenLabs MCP server provides comprehensive text-to-speech capabilities through the ElevenLabs API.

Server Details

Features

The ElevenLabs MCP server provides 14 different tools for comprehensive TTS functionality:

Core Tools

  • generate_speech - Generate speech audio from text using specified voice and model
  • get_voices - List all available voices for speech generation
  • get_models - List all available speech generation models

Voice Management

  • clone_voice - Clone a voice from provided audio samples
  • get_voice_settings - Get settings for a specific voice
  • edit_voice_settings - Modify settings for a specific voice
  • get_voice_samples - Get audio samples for a specific voice
  • delete_voice - Delete a custom voice

Account & History Management

  • get_user_info - Get current user account information and usage details
  • get_user_subscription - Get user subscription details and limits
  • get_history - Get speech generation history
  • get_history_item - Get details for a specific history item
  • delete_history_item - Delete a specific history item
  • download_history_items - Download audio files from generation history

Installation & Configuration

pip install elevenlabs-mcp
python -m elevenlabs_mcp

Requires ELEVENLABS_API_KEY environment variable.

Verification

  • ✅ Official server maintained by ElevenLabs
  • ✅ Comprehensive documentation available
  • ✅ Active development and support
  • ✅ Follows MCP server standards
  • ✅ Well-structured API with clear tool descriptions

This addition enhances the library with professional-grade text-to-speech capabilities, voice cloning, and comprehensive audio generation management.

This commit adds the official ElevenLabs MCP server to the zenagents-library. The ElevenLabs MCP server provides comprehensive text-to-speech capabilities including:

- Speech generation from text using various voices and models
- Voice management (list, clone, edit, delete voices)
- User account and subscription information
- Generation history management
- Voice settings and samples access

The server is officially maintained by ElevenLabs and requires an ElevenLabs API key for authentication. It's written in Python and can be installed via pip.

Features:
- 14 different tools for comprehensive TTS functionality
- Official and verified server
- Python-based implementation
- Comprehensive voice cloning capabilities
- Full generation history management
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant