Zilliz MCP Server

Zilliz MCP Server

Enables AI agents to interact with Milvus vector databases and Zilliz Cloud through natural language, allowing users to create clusters, manage collections, insert vector data, and perform semantic searches directly from their AI assistants.

Category
访问服务器

README

Zilliz-MCP-Server

1. Zilliz-MCP-Server Overview

Model Context Protocol (MCP) is a standardized framework that enables AI applications to securely connect to external data sources and tools in real-time. It acts as a universal interface between AI models and various systems, allowing AI assistants to access current information and perform actions beyond text generation.

Zilliz MCP Server enables AI agents to seamlessly interact with Milvus, a popular open-source vector database, and Zilliz Cloud, the fully managed version of Milvus. Through this integration, your AI assistants can create collections, insert vector data, and perform semantic searches directly within their conversations—no manual database management required. Zilliz MCP Server seamlessly integrates with popular AI-powered coding tools like Cursor, Claude, Windsurf, and other MCP-compatible editors, enabling developers to build vector search capabilities directly within their development workflow.

2. Demos

Demo 1: Create a free Milvus cluster with a simple natural language prompt

Create Free Cluster GIF Instead of navigating through web interfaces or complex setup processes, you can get a free, fully functional vector database cluster up and running it directly from Claude, Cursor or any other MCP-compatible AI coding assistants with the Zilliz MCP Server.

The Zilliz MCP server automatically:

  • Provisions a free Milvus cluster using Zilliz Cloud
  • Handles authentication and configuration
  • Returns connection details for immediate use

No need to leave your AI chat interface or manually set up infrastructure. Just ask in natural language and get a working vector database within seconds. This works in any MCP-enabled environment, including Claude's web interface as shown above.

Demo 2: Monitor cluster performance without leaving your chat

Checkout My Cluster GIF Once you have a cluster running, you can inspect its status and generate visualizations directly through natural language:

The Zilliz MCP server:

  • Retrieves real-time cluster metrics and collection details
  • Fetches performance data (CPU computation, capacity usage)
  • Generates visualizations on demand
  • All without writing queries or accessing monitoring dashboards

This demonstrates how you can monitor and analyze your vector database infrastructure conversationally, making cluster management as simple as asking questions in plain English.

Demo 3: Perform semantic search directly from your chat

Talk with your Data GIF Search your vector collections using natural language without writing any code:

The Zilliz MCP server:

  • Performs semantic search across your specified collection
  • Returns relevant results with similarity scores
  • All through conversational interface - no SDK calls or query syntax required

This shows how vector search becomes as simple as describing what you're looking for. Perfect for testing search queries, exploring your data, or building search functionality without leaving your development environment.

3. Requirements

  • Python: 3.10 or higher.
  • uv: A fast Python package installer and resolver. If you don't have it, run brew install uv on OSX, or just run curl -LsSf https://astral.sh/uv/install.sh | sh .
  • A Zilliz Cloud account: sign up for free if you haven't tried Zilliz Cloud before. If you already have one, sign in here.
  • Zilliz Cloud API Key: You'll need an API key to interact with Zilliz Cloud. You can get one by following the instructions here: Manage API Keys.

4. Usage

You can start the server in two ways:

If you are completely new to MCP and Zilliz, we recommend following our step-by-step guide in Step By Step User Guide which will walk you through the complete setup process.

4.1. Standard I/O (StdIO)

This method is useful when the agent and the MCP server are running on the same machine and you want the agent to manage the server's lifecycle directly. The agent communicates with the server over its standard input and output streams.

Configure your agent's MCP JSON file like this:

{
  "mcpServers": {
    "zilliz-mcp-server": {
      "command": "uvx",
      "args": ["zilliz-mcp-server"],
      "env": {
          "ZILLIZ_CLOUD_TOKEN": "your-token-here"
      }
    }
  }
}

Note: Make sure to replace /path/to/your/zilliz-mcp-server with the actual absolute path to the project directory.

4.2. Streamable HTTP

This method runs the server as a standalone HTTP service. This is useful for development and for agents that can communicate over HTTP.

First, clone or download the project repository.

git clone https://github.com/zilliztech/zilliz-mcp-server.git
cd zilliz-mcp-server

Next, create a .env file from the example and fill in your Zilliz Cloud API key.

cp example.env .env

Now, open .env and add your API key:

ZILLIZ_API_KEY="your_api_key_here"

It 'll start the MCP server as a standalone HTTP service

uv run src/zilliz_mcp_server/server.py --transport streamable-http

After starting the server, you can configure your MCP client to connect to it. If the server is running correctly, the available tools will appear in your client's tool list (e.g., in Cursor or Claude).

You can then configure your agent or MCP client to connect to it using a configuration like this:

{
  "mcpServers": {
    "zilliz-mcp-server": {
      "url": "http://localhost:8000/mcp",
      "transport": "streamable-http",
      "description": "Zilliz Cloud and Milvus MCP Server"
    }
  }
}

5. Available Tools

The server exposes two categories of tools for your AI agents.

Zilliz Control Plane Tools

These tools are for managing your Zilliz Cloud resources.

Tool Name Description
list_projects List all projects in your Zilliz Cloud account.
list_clusters List all clusters within your projects.
create_free_cluster Create a new, free-tier Milvus cluster.
describe_cluster Get detailed information about a specific cluster.
suspend_cluster Suspend a running cluster to save costs.
resume_cluster Resume a suspended cluster.
query_cluster_metrics Query various performance metrics for a cluster.

Milvus Data Plane Tools

These tools are for interacting with the data inside a Milvus cluster.

Tool Name Description
list_databases List all databases within a specific cluster.
list_collections List all collections within a database.
create_collection Create a new collection with a specified schema.
describe_collection Get detailed information about a collection, including its schema.
insert_entities Insert entities (data records with vectors) into a collection.
delete_entities Delete entities from a collection based on IDs or a filter expression.
search Perform a vector similarity search on a collection.
query Query entities based on a scalar filter expression.
hybrid_search Perform a hybrid search combining vector similarity and scalar filters.

推荐服务器

Baidu Map

Baidu Map

百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。

官方
精选
JavaScript
Playwright MCP Server

Playwright MCP Server

一个模型上下文协议服务器,它使大型语言模型能够通过结构化的可访问性快照与网页进行交互,而无需视觉模型或屏幕截图。

官方
精选
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

一个由人工智能驱动的工具,可以从自然语言描述生成现代化的用户界面组件,并与流行的集成开发环境(IDE)集成,从而简化用户界面开发流程。

官方
精选
本地
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

通过模型上下文协议启用与 Audiense Insights 账户的交互,从而促进营销洞察和受众数据的提取和分析,包括人口统计信息、行为和影响者互动。

官方
精选
本地
TypeScript
VeyraX

VeyraX

一个单一的 MCP 工具,连接你所有喜爱的工具:Gmail、日历以及其他 40 多个工具。

官方
精选
本地
graphlit-mcp-server

graphlit-mcp-server

模型上下文协议 (MCP) 服务器实现了 MCP 客户端与 Graphlit 服务之间的集成。 除了网络爬取之外,还可以将任何内容(从 Slack 到 Gmail 再到播客订阅源)导入到 Graphlit 项目中,然后从 MCP 客户端检索相关内容。

官方
精选
TypeScript
Kagi MCP Server

Kagi MCP Server

一个 MCP 服务器,集成了 Kagi 搜索功能和 Claude AI,使 Claude 能够在回答需要最新信息的问题时执行实时网络搜索。

官方
精选
Python
e2b-mcp-server

e2b-mcp-server

使用 MCP 通过 e2b 运行代码。

官方
精选
Neon MCP Server

Neon MCP Server

用于与 Neon 管理 API 和数据库交互的 MCP 服务器

官方
精选
Exa MCP Server

Exa MCP Server

模型上下文协议(MCP)服务器允许像 Claude 这样的 AI 助手使用 Exa AI 搜索 API 进行网络搜索。这种设置允许 AI 模型以安全和受控的方式获取实时的网络信息。

官方
精选