TEN Agent

TEN Agent

TEN Agent is a conversational AI powered by TEN, integrating Gemini 2.0 Multimodal Live API, OpenAI Realtime API, RTC, and more. It offers real-time capabilities to see, hear, and speak, along with advanced tools like weather checks, web search, and RAG. - 1313057/TEN-Agent

AI集成系统
访问服务器

README

TEN Agent banner

<div align="center">

Follow on X Discussion posts Commits Issues closed PRs Welcome GitHub license

Discord TEN Community

<a href="https://trendshift.io/repositories/11978" target="_blank"><img src="https://trendshift.io/api/badge/repositories/11978" alt="TEN-framework%2FTEN-Agent | Trendshift" style="width: 250px; height: 55px;" width="250" height="55"/></a>

GitHub watchers GitHub forks GitHub stars

<a href="https://github.com/TEN-framework/ten-agent/blob/main/README.md"><img alt="README in English" src="https://img.shields.io/badge/English-lightgrey"></a> <a href="https://github.com/ten-framework/ten-agent/blob/main/docs/readmes/README-CN.md"><img alt="简体中文操作指南" src="https://img.shields.io/badge/简体中文-lightgrey"></a> <a href="https://github.com/ten-framework/ten-agent/blob/main/docs/readmes/README-JP.md"><img alt="日本語のREADME" src="https://img.shields.io/badge/日本語-lightgrey"></a> <a href="https://github.com/ten-framework/ten-agent/blob/main/docs/readmes/README-KR.md"><img alt="README in 한국어" src="https://img.shields.io/badge/한국어-lightgrey"></a> <a href="https://github.com/ten-framework/ten-agent/blob/main/docs/readmes/README-ES.md"><img alt="README en Español" src="https://img.shields.io/badge/Español-lightgrey"></a> <a href="https://github.com/ten-framework/ten-agent/blob/main/docs/readmes/README-FR.md"><img alt="README en Français" src="https://img.shields.io/badge/Français-lightgrey"></a> <a href="https://github.com/ten-framework/ten-agent/blob/main/docs/readmes/README-IT.md"><img alt="README in Italiano" src="https://img.shields.io/badge/Italiano-lightgrey"></a>

Getting Started <span>  •  </span> Create Extensions <span>  •  </span> TEN Framework Repository

</div>

<br> <h2>🌟 Gemini Multimodal Live API Extension with RTC</h2> <!-- Usecases -->

Usecases

agent.theten.ai

Try Google Gemini Multimodal Live API with realtime vision and realtime screenshare detection capabilities, it is a ready-to-use extension, along with powerful tools like Weather Check and Web Search integrated perfectly into TEN Agent.

<br> <h2>TEN Agent Usecases</h2>

Usecases

<br> <h2>Ready-to-use Extensions</h2>

Ready-to-use Extensions

<br> <h2>TEN Agent Playground in Local Environment</h2>

Prerequisites

Category Requirements
Keys • Agora App ID and App Certificate (free minutes every month) <br>• OpenAI API key<br>• Deepgram ASR (free credits available with signup)<br>• FishAudio TTS (free credits available with signup)
Installation Docker / Docker Compose<br>• Node.js(LTS) v18
Minimum System Requirements • CPU >= 2 Core<br>• RAM >= 4 GB

<br>

macOS: Docker setting on Apple Silicon

For Apple Silicon Macs, uncheck "Use Rosetta for x86/amd64 emulation" in Docker settings. Note: This may result in slower build times on ARM, but performance will be normal when deployed to x64 servers.

Docker Setting

<br>

Next step

1. Create .env file

cp ./.env.example ./.env

2. Setup Agora App ID and App Certificate in .env

AGORA_APP_ID=
AGORA_APP_CERTIFICATE=

3. Start agent development containers

docker compose up -d

4. Enter container

docker exec -it ten_agent_dev bash

5. Build agent

task use

6. Start the web server

task run

7. Edit playground settings

Open the playground at localhost:3000 to configure your agent.

  1. Select a graph type (e.g. Voice Agent, Realtime Agent)
  2. Choose a corresponding module
  3. Select an extension and configure its API key settings

Module Example

Running Gemini Realtime Extension

Open the playground at localhost:3000.

  1. Select voice_assistant_realtime graph
  2. Choose Gemini Realtime module
  3. Select v2v extension and enter Gemini API key

Gemini Realtime Playground

<br> <h2>TEN Agent Components</h2>

Components Diagram

<br> <h2>Stay Tuned</h2>

Before we get started, be sure to star our repository and get instant notifications for all new releases!

TEN star us gif

<br> <h2>Join Community</h2>

  • Discord: Ideal for sharing your applications and engaging with the community.
  • GitHub Discussion: Perfect for providing feedback and asking questions.
  • GitHub Issues: Best for reporting bugs and proposing new features. Refer to our contribution guidelines for more details.
  • X: Great for sharing your agents and interacting with the community.

<br> <h2>Star History</h2>

Star History Chart

<br> <h2>Code Contributors</h2>

TEN

<br> <h2>Contribution Guidelines</h2>

Contributions are welcome! Please read the contribution guidelines first.

<br> <h2>License</h2>

This project is licensed under the Apache 2.0 License - see the LICENSE file for details.

推荐服务器

Playwright MCP Server

Playwright MCP Server

一个模型上下文协议服务器,它使大型语言模型能够通过结构化的可访问性快照与网页进行交互,而无需视觉模型或屏幕截图。

官方
精选
TypeScript
Neon MCP Server

Neon MCP Server

用于与 Neon 管理 API 和数据库交互的 MCP 服务器

官方
精选
mcp-server-qdrant

mcp-server-qdrant

这个仓库展示了如何为向量搜索引擎 Qdrant 创建一个 MCP (Managed Control Plane) 服务器的示例。

官方
精选
AIO-MCP Server

AIO-MCP Server

🚀 集成了 AI 搜索、RAG 和多服务(GitLab/Jira/Confluence/YouTube)的一体化 MCP 服务器,旨在增强 AI 驱动的开发工作流程。来自 Folk。

精选
本地
https://github.com/Streen9/react-mcp

https://github.com/Streen9/react-mcp

react-mcp 与 Claude Desktop 集成,能够根据用户提示创建和修改 React 应用程序。

精选
本地
MCP Atlassian

MCP Atlassian

适用于 Atlassian Cloud 产品(Confluence 和 Jira)的 Model Context Protocol (MCP) 服务器。此集成专为 Atlassian Cloud 实例设计,不支持 Atlassian Server 或 Data Center 部署。

精选
any-chat-completions-mcp

any-chat-completions-mcp

将 Claude 与任何 OpenAI SDK 兼容的聊天完成 API 集成 - OpenAI、Perplexity、Groq、xAI、PyroPrompts 等。

精选
MySQL MCP Server

MySQL MCP Server

允许人工智能助手通过受控界面列出表格、读取数据和执行 SQL 查询,从而使数据库探索和分析更安全、更有条理。

精选
browser-use MCP server

browser-use MCP server

一个由人工智能驱动的浏览器自动化服务器,它实现了模型上下文协议,从而能够使用自然语言控制网页浏览器,以执行诸如导航、表单填写和视觉交互等任务。

精选
mcp-codex-keeper

mcp-codex-keeper

作为开发知识的守护者,为 AI 助手提供精心策划的最新文档和最佳实践访问权限。

精选