TEN Agent
TEN Agent is a conversational AI powered by TEN, integrating Gemini 2.0 Multimodal Live API, OpenAI Realtime API, RTC, and more. It offers real-time capabilities to see, hear, and speak, along with advanced tools like weather checks, web search, and RAG. - 1313057/TEN-Agent
README
<div align="center">
<a href="https://trendshift.io/repositories/11978" target="_blank"><img src="https://trendshift.io/api/badge/repositories/11978" alt="TEN-framework%2FTEN-Agent | Trendshift" style="width: 250px; height: 55px;" width="250" height="55"/></a>
<a href="https://github.com/TEN-framework/ten-agent/blob/main/README.md"><img alt="README in English" src="https://img.shields.io/badge/English-lightgrey"></a> <a href="https://github.com/ten-framework/ten-agent/blob/main/docs/readmes/README-CN.md"><img alt="简体中文操作指南" src="https://img.shields.io/badge/简体中文-lightgrey"></a> <a href="https://github.com/ten-framework/ten-agent/blob/main/docs/readmes/README-JP.md"><img alt="日本語のREADME" src="https://img.shields.io/badge/日本語-lightgrey"></a> <a href="https://github.com/ten-framework/ten-agent/blob/main/docs/readmes/README-KR.md"><img alt="README in 한국어" src="https://img.shields.io/badge/한국어-lightgrey"></a> <a href="https://github.com/ten-framework/ten-agent/blob/main/docs/readmes/README-ES.md"><img alt="README en Español" src="https://img.shields.io/badge/Español-lightgrey"></a> <a href="https://github.com/ten-framework/ten-agent/blob/main/docs/readmes/README-FR.md"><img alt="README en Français" src="https://img.shields.io/badge/Français-lightgrey"></a> <a href="https://github.com/ten-framework/ten-agent/blob/main/docs/readmes/README-IT.md"><img alt="README in Italiano" src="https://img.shields.io/badge/Italiano-lightgrey"></a>
Getting Started <span> • </span> Create Extensions <span> • </span> TEN Framework Repository
</div>
<br>
<h2>🌟 Gemini Multimodal Live API Extension with RTC</h2>
<!--
-->
Try Google Gemini Multimodal Live API with realtime vision and realtime screenshare detection capabilities, it is a ready-to-use extension, along with powerful tools like Weather Check and Web Search integrated perfectly into TEN Agent.
<br> <h2>TEN Agent Usecases</h2>
<br> <h2>Ready-to-use Extensions</h2>
<br> <h2>TEN Agent Playground in Local Environment</h2>
Prerequisites
Category | Requirements |
---|---|
Keys | • Agora App ID and App Certificate (free minutes every month) <br>• OpenAI API key<br>• Deepgram ASR (free credits available with signup)<br>• FishAudio TTS (free credits available with signup) |
Installation | • Docker / Docker Compose<br>• Node.js(LTS) v18 |
Minimum System Requirements | • CPU >= 2 Core<br>• RAM >= 4 GB |
<br>
macOS: Docker setting on Apple Silicon
For Apple Silicon Macs, uncheck "Use Rosetta for x86/amd64 emulation" in Docker settings. Note: This may result in slower build times on ARM, but performance will be normal when deployed to x64 servers.
<br>
Next step
1. Create .env
file
cp ./.env.example ./.env
2. Setup Agora App ID and App Certificate in .env
AGORA_APP_ID=
AGORA_APP_CERTIFICATE=
3. Start agent development containers
docker compose up -d
4. Enter container
docker exec -it ten_agent_dev bash
5. Build agent
task use
6. Start the web server
task run
7. Edit playground settings
Open the playground at localhost:3000 to configure your agent.
- Select a graph type (e.g. Voice Agent, Realtime Agent)
- Choose a corresponding module
- Select an extension and configure its API key settings
Running Gemini Realtime Extension
Open the playground at localhost:3000.
- Select voice_assistant_realtime graph
- Choose Gemini Realtime module
- Select v2v extension and enter Gemini API key
<br> <h2>TEN Agent Components</h2>
<br> <h2>Stay Tuned</h2>
Before we get started, be sure to star our repository and get instant notifications for all new releases!
<br> <h2>Join Community</h2>
- Discord: Ideal for sharing your applications and engaging with the community.
- GitHub Discussion: Perfect for providing feedback and asking questions.
- GitHub Issues: Best for reporting bugs and proposing new features. Refer to our contribution guidelines for more details.
- X: Great for sharing your agents and interacting with the community.
<br> <h2>Star History</h2>
<br> <h2>Code Contributors</h2>
<br> <h2>Contribution Guidelines</h2>
Contributions are welcome! Please read the contribution guidelines first.
<br> <h2>License</h2>
This project is licensed under the Apache 2.0 License - see the LICENSE file for details.
推荐服务器
Playwright MCP Server
一个模型上下文协议服务器,它使大型语言模型能够通过结构化的可访问性快照与网页进行交互,而无需视觉模型或屏幕截图。
Neon MCP Server
用于与 Neon 管理 API 和数据库交互的 MCP 服务器
mcp-server-qdrant
这个仓库展示了如何为向量搜索引擎 Qdrant 创建一个 MCP (Managed Control Plane) 服务器的示例。
AIO-MCP Server
🚀 集成了 AI 搜索、RAG 和多服务(GitLab/Jira/Confluence/YouTube)的一体化 MCP 服务器,旨在增强 AI 驱动的开发工作流程。来自 Folk。
https://github.com/Streen9/react-mcp
react-mcp 与 Claude Desktop 集成,能够根据用户提示创建和修改 React 应用程序。
MCP Atlassian
适用于 Atlassian Cloud 产品(Confluence 和 Jira)的 Model Context Protocol (MCP) 服务器。此集成专为 Atlassian Cloud 实例设计,不支持 Atlassian Server 或 Data Center 部署。

any-chat-completions-mcp
将 Claude 与任何 OpenAI SDK 兼容的聊天完成 API 集成 - OpenAI、Perplexity、Groq、xAI、PyroPrompts 等。
MySQL MCP Server
允许人工智能助手通过受控界面列出表格、读取数据和执行 SQL 查询,从而使数据库探索和分析更安全、更有条理。
browser-use MCP server
一个由人工智能驱动的浏览器自动化服务器,它实现了模型上下文协议,从而能够使用自然语言控制网页浏览器,以执行诸如导航、表单填写和视觉交互等任务。
mcp-codex-keeper
作为开发知识的守护者,为 AI 助手提供精心策划的最新文档和最佳实践访问权限。