MMAudio MCP
Enables AI-powered video-to-audio and text-to-audio generation using MMAudio's API. Create synchronized audio from video content or generate audio from text descriptions with configurable parameters.
README
MMAudio MCP
AI-powered video-to-audio and text-to-audio generation using MMAudio's advanced AI technology. This Desktop Extension implements the Model Context Protocol (MCP) to provide seamless integration with Cursor, Claude Desktop, and other MCP-compatible clients.
🌟 Features
- Video-to-Audio Generation: Transform video content into synchronized audio with AI-powered analysis
- Text-to-Audio Generation: Create professional audio content from text descriptions
- API Key Validation: Verify MMAudio API credentials and check account status
- MCP Protocol: Full Model Context Protocol implementation for universal compatibility
- Error Handling: Comprehensive error handling with detailed feedback
- Type Safety: Full TypeScript support with Zod schema validation
- Configurable: Flexible configuration options for different use cases
🚀 Quick Start
Prerequisites
- Node.js >= 18.0.0
- MMAudio API key (Get one here)
- MCP-compatible client (Cursor, Claude Desktop, etc.)
Installation
-
Clone or download this extension:
cd mcp npm install -
Configure your API key:
# Set environment variable export MMAUDIO_API_KEY="sk-your-api-key-here" # Or create a config.js file (copy from config.example.js) cp config.example.js config.js # Edit config.js with your API key -
Test the installation:
npm start -
Link the package:
npm link
Usage with Cursor
-
Add to your Cursor MCP configuration:
{ "mcpServers": { "mmaudio": { "command": "npx", "args": ["-y", "mmaudio-mcp@latest"], "env": { "MMAUDIO_API_KEY": "sk-your-api-key-here" } } } } -
Restart Cursor and the MMAudio tools will be available in your AI assistant.
Usage with Claude Desktop
-
Add to your Claude Desktop configuration (
claude_desktop_config.json):{ "mcpServers": { "mmaudio": { "command": "npx", "args": ["-y", "mmaudio-mcp@latest"], "env": { "MMAUDIO_API_KEY": "sk-your-api-key-here" } } } } -
Restart Claude Desktop to load the extension.
🛠️ Available Tools
1. Video-to-Audio Generation
Generate AI-powered audio from video content.
Tool Name: video_to_audio
Parameters:
video_url(required): URL of the video fileprompt(required): Description of the audio you want to generatenegative_prompt(optional): What to avoid in the generated audioduration(optional): Audio duration in seconds (1-30, default: 8)num_steps(optional): Number of inference steps (1-50, default: 25)cfg_strength(optional): Guidance strength (1-10, default: 4.5)seed(optional): Random seed for reproducible results
Example:
Generate audio for this video: https://example.com/video.mp4 with the prompt "peaceful forest sounds with birds chirping and gentle wind"
2. Text-to-Audio Generation
Create audio content from text descriptions.
Tool Name: text_to_audio
Parameters:
prompt(required): Description of the audio you want to generateduration(optional): Audio duration in seconds (1-30, default: 8)num_steps(optional): Number of inference steps (1-50, default: 25)cfg_strength(optional): Guidance strength (1-10, default: 4.5)negative_prompt(optional): What to avoid in the generated audioseed(optional): Random seed for reproducible results
Example:
Create audio with the description "coffee shop ambiance with gentle chatter and espresso machine sounds"
3. API Key Validation
Validate your MMAudio API key and check account status.
Tool Name: validate_api_key
Parameters:
api_key(optional): API key to validate (uses configured key if not provided)
Example:
Validate my MMAudio API key
⚙️ Configuration
Environment Variables
| Variable | Description | Required | Default |
|---|---|---|---|
MMAUDIO_API_KEY |
Your MMAudio API key | Yes | - |
MMAUDIO_BASE_URL |
Base URL for MMAudio API | No | https://mmaudio.net |
MMAUDIO_TIMEOUT |
Request timeout in milliseconds | No | 60000 |
LOG_LEVEL |
Log level (error, warn, info, debug) | No | error |
DEBUG |
Enable debug mode | No | false |
Configuration File
You can also use a config.js file for configuration:
export const config = {
apiKey: "sk-your-api-key-here",
baseUrl: "https://mmaudio.net",
timeout: 60000,
// ... other options
};
📝 Example Usage
Video-to-Audio Example
// In your MCP client (Cursor, Claude Desktop, etc.)
"Please generate audio for this video URL: https://example.com/nature_video.mp4";
"I want forest sounds with birds chirping and a gentle breeze";
"Duration should be 10 seconds";
Text-to-Audio Example
// Create ambient audio
"Generate 15 seconds of coffee shop ambiance with gentle background chatter";
// Create sound effects
"Create the sound of rain falling on a wooden roof for 8 seconds";
// Create atmospheric audio
"Generate futuristic sci-fi ambient sounds for a space station";
🔧 Development
Project Structure
mcp/
├── server/index.js # Main MCP server implementation
├── package.json # Node.js dependencies and scripts
├── manifest.json # DXT extension manifest
├── config.example.js # Configuration example
└── README.md # This file
Testing
-
Test the server directly:
npm start -
Test with a simple MCP client:
echo '{"jsonrpc": "2.0", "method": "tools/list", "id": 1}' | npm start
Development Mode
Run the server in development mode with debugging:
npm run dev
This enables additional logging and the Node.js inspector for debugging.
🚨 Troubleshooting
Common Issues
1. "API key is required" Error
Problem: The extension can't find your API key.
Solution:
- Ensure you've set the
MMAUDIO_API_KEYenvironment variable - Or create a
config.jsfile with your API key - Verify the API key is correct and active
2. "Connection refused" or Network Errors
Problem: Can't connect to MMAudio API.
Solution:
- Check your internet connection
- Verify the base URL is correct
- Check if there are any firewall restrictions
3. "Insufficient credits" Error
Problem: Your MMAudio account doesn't have enough credits.
Solution:
- Check your account balance at mmaudio.net/dashboard
- Purchase additional credits if needed
- Use the
validate_api_keytool to check your account status
4. MCP Client Not Detecting Extension
Problem: Your MCP client (Cursor, Claude Desktop) doesn't show MMAudio tools.
Solution:
- Verify the configuration path in your MCP client settings
- Ensure Node.js is in your PATH
- Check the server logs for errors
- Restart your MCP client
Debug Mode
Enable debug mode for detailed logging:
DEBUG=true MMAUDIO_API_KEY=your-key node server/index.js
Logs
The server logs to stderr. You can redirect logs to a file:
node server/index.js 2> mmaudio.log
📄 API Reference
Response Format
All tools return responses in this format:
{
"content": [
{
"type": "text",
"text": "{\"success\": true, \"message\": \"...\", \"result\": {...}}"
}
]
}
Success Response Example
{
"success": true,
"message": "Audio generated successfully from text",
"result": {
"audio_url": "https://example.com/generated_audio.wav",
"content_type": "audio/wav",
"file_name": "generated_audio.wav",
"file_size": 1024000,
"duration": 8,
"prompt": "coffee shop ambiance"
}
}
Error Response Example
{
"success": false,
"error": "Invalid API key. Please check your MMAudio API key.",
"code": "INVALID_REQUEST"
}
🔒 Security
- API keys are handled securely and not logged
- All requests use HTTPS
- The extension runs in a sandboxed environment
- No sensitive data is stored locally
📖 Related Documentation
- MMAudio API Documentation
- Model Context Protocol Specification
- Cursor MCP Integration Guide
- Claude Desktop MCP Setup
🤝 Contributing
- Fork the repository
- Create a feature branch
- Make your changes
- Add tests if applicable
- Submit a pull request
📝 License
This project is licensed under the MIT License - see the LICENSE file for details.
💬 Support
Made with ❤️ by the MMAudio Team
推荐服务器
Baidu Map
百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。
Playwright MCP Server
一个模型上下文协议服务器,它使大型语言模型能够通过结构化的可访问性快照与网页进行交互,而无需视觉模型或屏幕截图。
Magic Component Platform (MCP)
一个由人工智能驱动的工具,可以从自然语言描述生成现代化的用户界面组件,并与流行的集成开发环境(IDE)集成,从而简化用户界面开发流程。
Audiense Insights MCP Server
通过模型上下文协议启用与 Audiense Insights 账户的交互,从而促进营销洞察和受众数据的提取和分析,包括人口统计信息、行为和影响者互动。
VeyraX
一个单一的 MCP 工具,连接你所有喜爱的工具:Gmail、日历以及其他 40 多个工具。
graphlit-mcp-server
模型上下文协议 (MCP) 服务器实现了 MCP 客户端与 Graphlit 服务之间的集成。 除了网络爬取之外,还可以将任何内容(从 Slack 到 Gmail 再到播客订阅源)导入到 Graphlit 项目中,然后从 MCP 客户端检索相关内容。
Kagi MCP Server
一个 MCP 服务器,集成了 Kagi 搜索功能和 Claude AI,使 Claude 能够在回答需要最新信息的问题时执行实时网络搜索。
e2b-mcp-server
使用 MCP 通过 e2b 运行代码。
Neon MCP Server
用于与 Neon 管理 API 和数据库交互的 MCP 服务器
Exa MCP Server
模型上下文协议(MCP)服务器允许像 Claude 这样的 AI 助手使用 Exa AI 搜索 API 进行网络搜索。这种设置允许 AI 模型以安全和受控的方式获取实时的网络信息。