Skip to content

Conversation

Orinks
Copy link

@Orinks Orinks commented Aug 1, 2025

Summary

This PR adds the community ElevenLabs Sound Effects MCP server to the zenagents-library, complementing the official ElevenLabs TTS server. This provides users with comprehensive audio generation capabilities - both speech synthesis and sound effects creation.

Server Details

Features

Core Functionality

  • generate_sound_effect - Generate sound effects from text descriptions and save as MP3 files

Capabilities

  • Text-to-Sound-Effect Generation: Create any sound effect by describing it in text
    • "Thunder and lightning storm"
    • "Car engine starting"
    • "Ocean waves crashing"
    • "Footsteps on gravel"
    • "Birds chirping in forest"

Technical Features

  • Automatic File Management: Generated sounds saved as MP3 files
  • Configurable Output: Customizable save directory (default: ./sound_effects/)
  • Duration Control: Up to 22 seconds per sound effect
  • Prompt Influence: Adjust literal vs. creative interpretation
  • Smart Naming: Automatic filename generation from text prompts

Installation & Setup

# Clone and build from source
git clone https://github.com/zym9863/elevenlabs-sound-effect-server
cd elevenlabs-sound-effect-server
npm install
npm run build

# Run the server
node dist/index.js

Requirements:

  • Node.js ≥18.0.0
  • ElevenLabs API key (set as ELEVENLABS_API_KEY environment variable)

Complementary Servers

This server pairs perfectly with the official ElevenLabs MCP server:

Server Purpose Tools Output
Official ElevenLabs MCP Text-to-Speech 14 comprehensive tools Speech/Voice audio
Community Sound Effects Sound Effects 1 focused tool Environmental/ambient sounds

Together, they provide complete AI audio generation capabilities through ElevenLabs' platform.

Community Contribution

  • ✅ Well-maintained community project
  • ✅ Clear documentation and examples
  • ✅ Simple, focused functionality
  • ✅ Follows MCP server standards
  • ✅ Complements existing official server
  • ⚠️ Community project (not officially verified by ElevenLabs)

This addition enhances the library by providing sound effects generation capabilities, enabling users to create both speech and environmental audio content through ElevenLabs' AI platform.

Orinks added 2 commits July 31, 2025 21:03
This commit adds the official ElevenLabs MCP server to the zenagents-library. The ElevenLabs MCP server provides comprehensive text-to-speech capabilities including:

- Speech generation from text using various voices and models
- Voice management (list, clone, edit, delete voices)
- User account and subscription information
- Generation history management
- Voice settings and samples access

The server is officially maintained by ElevenLabs and requires an ElevenLabs API key for authentication. It's written in Python and can be installed via pip.

Features:
- 14 different tools for comprehensive TTS functionality
- Official and verified server
- Python-based implementation
- Comprehensive voice cloning capabilities
- Full generation history management
This commit adds the community ElevenLabs Sound Effects MCP server alongside the official ElevenLabs TTS server. This provides users with both text-to-speech and sound effects generation capabilities.

The ElevenLabs Sound Effects MCP server provides:

- Sound effect generation from text descriptions
- Automatic MP3 file saving to local filesystem
- Support for various sound types (ambient, mechanical, natural, etc.)
- Configurable duration and prompt influence
- Simple single-tool interface for ease of use

Key Features:
- Community-maintained TypeScript implementation
- Uses ElevenLabs Sound Effects API
- Automatic filename generation from prompts
- Local file storage with configurable directory
- Supports up to 22-second sound effects

Installation requires cloning the repository and building from source:
```bash
git clone https://github.com/zym9863/elevenlabs-sound-effect-server
cd elevenlabs-sound-effect-server
npm install && npm run build
```

This complements the official ElevenLabs MCP server by providing sound effects generation capabilities, giving users access to both speech synthesis and environmental/ambient sound creation through ElevenLabs' AI audio platform.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant