st-k8s MCP Server
Exposes Kubernetes cluster management tools to LLMs, enabling querying pods, deployments, logs, metrics, and managing port forwards via natural language.
README
ST-K8s
View and chat to your Kubernetes cluster and container log files.
brew tap bhf/st-k8s
brew install st-k8s
st-k8s
Features a dashboard (with a K9s inspired dark theme and keyboard navigation), REST API, port forwarding management, resource monitoring, and MCP server. In browser AI chat powered by the Copilot SDK, any OpenAI API compatible provider, or local WebLLM models (requires WebGPU support).

Uses Github Projects for planning and tracking.
Keyboard Navigation
The dashboard supports K9s-style keyboard navigation. Press : to open the command palette and navigate between resources using commands or aliases:
:podsor:po:deploymentsor:deploy:servicesor:svc- ...and many more standard K8s shortcuts.

Log Viewer
View, copy and download streaming logs.

Port Forwarding
Manage Kubernetes port forwarding sessions directly from the dashboard or through AI chat. Supports both Pods and Services.
- Dynamic Config: Specify target ports and local interface bindings.
- Service Mapping: Automatically resolves Service targets to active Pods.
- Agentic Control: Start or stop forwards using natural language through the Copilot integration or MCP server.

Resource Monitoring
Monitor CPU and memory usage for Nodes and Pods directly in the dashboard using interactive charts. Requires that your cluster has metrics server installed.
- Real-time Data: Fetches live metrics from the Kubernetes Metrics Server.
- Node Metrics: View cluster-wide resource utilization across all nodes.
- Pod Metrics: Inspect resource consumption for individual pods in any namespace.
- Visual Charts: Interactive Recharts-based visualizations for easier performance analysis.
Hardware Acceleration & WebGPU
ST-K8s supports local AI models running directly in your browser using WebLLM. This requires WebGPU and hardware acceleration to be enabled.
Google Chrome / Chromium
- Ensure you are on a recent version of Chrome.
- Enable WebGPU: Paste
chrome://flags/#enable-unsafe-webgpuinto your address bar and set it to Enabled. - Enable Vulkan (Linux/Windows): Paste
chrome://flags/#enable-vulkanand set it to Enabled. - Relaunch Chrome.
Mozilla Firefox
- Type
about:configin the address bar. - Search for
dom.webgpu.enabledand set it to true. - Search for
gfx.webgpu.force-enabledand set it to true if WebGPU doesn't work by default. - MacOS users may also need to ensure
gfx.webrender.allis true.
Verification
You can verify WebGPU support by visiting webgpu.github.io/webgpu-samples. If the samples run, ST-K8s will be able to load local models.
Table of Contents
- ST-K8s
How to Run
Using Homebrew (macOS/Linux)
The easiest way to install and run st-k8s is via Homebrew:
brew tap bhf/st-k8s
brew install st-k8s
st-k8s
From Source
To use the browser based chat feature make sure you install the Copilot CLI.
git clone https://github.com/bhf/st-k8s
cd st-k8s
npm run build
npm run start
Using the st-k8s CLI
You can install the project as a global CLI to run the app using the st-k8s command.
# From the repo root — install globally (or publish and install from a registry)
npm install -g .
# During development, link the local package to make `st-k8s` available globally
npm link
# Then launch the app with the CLI (it will build if no build exists)
st-k8s
Notes:
npm install -g .requires appropriate permissions (usesudoon some systems).npm linkis useful when iterating locally — run it once from the repo root.- The
st-k8scommand will attempt to use a Next.js standalone server if present (fromnext build), otherwise it runsnpm run start.
Running Tests
This project uses Vitest for testing.
# Run all tests
npm test
# Run tests in watch mode
npm test -- --watch
# Run tests with coverage
npm run test:coverage
End-to-End Tests
This project uses Playwright for End-to-End testing.
# Run E2E tests
npm run test:e2e
API
Swagger spec available at http://localhost:3000/openapi.json after starting the server or from the public folder.
Model Context Protocol (MCP) Server
This project includes an MCP server that exposes Kubernetes tools to LLMs over stdio. Here are some example uses:
- List of pods
- Rank containers by their memory requests and limits
- Summary of the last events in the namespace
- Get the last 100 lines of logs for a specific pod


Features
Exposes read-only Kubernetes operations as tools:
list_namespaceslist_podslist_deploymentslist_serviceslist_daemonsetslist_replicasetslist_statefulsetslist_ingresseslist_endpointslist_eventslist_pvcslist_nodeslist_configmapslist_jobslist_cronjobslist_serviceaccountslist_roleslist_rolebindingsget_pod_logslist_port_forwardsstart_port_forwardstop_port_forwardget_node_metricsget_pod_metrics
Running the MCP Server
Make sure to auth your kubectl context in your preferred way before running the MCP server.
You can run the MCP server directly using:
npm run mcp
You can also run it from VSCode or any MCP-compatible client by configuring it as shown below.
Configuring for VSCode
Add the following to your mcp.json
{
"servers": {
"k8s-tools": {
"command": "npm",
"args": ["run", "mcp"],
"cwd": "/absolute/path/to/st-k8s",
"disabled": false,
"autoApprove": []
}
}
}
Make sure to replace /absolute/path/to/st-k8s with the actual path to this repository on your machine.
LLM Integration Techniques
This project uses several LLM-based techniques to enhance the development lifecycle and user experience. These artifacts are located in the .github directory:
- Agents: Domain-specific personas which embody specialized knowledge for consistent code generation.
- Instructions: Contextual guidelines that enforce coding standards and architectural patterns.
- Skills: Reusable capabilities that allow the model to perform complex tasks.
- Prompts: Curated prompt templates ensuring high-quality, reproducible outputs for specific tasks.
High Level Architecture

Accessibility
We are committed to making the dashboard accessible to all users. Please refer to our Accessibility Statement and Guidelines for details on current status, findings, and remediation plans.
Security
We take security seriously. Please refer to our Security Review for details on our security posture, findings, and recommendations.
推荐服务器
Baidu Map
百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。
Playwright MCP Server
一个模型上下文协议服务器,它使大型语言模型能够通过结构化的可访问性快照与网页进行交互,而无需视觉模型或屏幕截图。
Audiense Insights MCP Server
通过模型上下文协议启用与 Audiense Insights 账户的交互,从而促进营销洞察和受众数据的提取和分析,包括人口统计信息、行为和影响者互动。
Magic Component Platform (MCP)
一个由人工智能驱动的工具,可以从自然语言描述生成现代化的用户界面组件,并与流行的集成开发环境(IDE)集成,从而简化用户界面开发流程。
VeyraX
一个单一的 MCP 工具,连接你所有喜爱的工具:Gmail、日历以及其他 40 多个工具。
Kagi MCP Server
一个 MCP 服务器,集成了 Kagi 搜索功能和 Claude AI,使 Claude 能够在回答需要最新信息的问题时执行实时网络搜索。
graphlit-mcp-server
模型上下文协议 (MCP) 服务器实现了 MCP 客户端与 Graphlit 服务之间的集成。 除了网络爬取之外,还可以将任何内容(从 Slack 到 Gmail 再到播客订阅源)导入到 Graphlit 项目中,然后从 MCP 客户端检索相关内容。
e2b-mcp-server
使用 MCP 通过 e2b 运行代码。
Neon MCP Server
用于与 Neon 管理 API 和数据库交互的 MCP 服务器
Exa MCP Server
模型上下文协议(MCP)服务器允许像 Claude 这样的 AI 助手使用 Exa AI 搜索 API 进行网络搜索。这种设置允许 AI 模型以安全和受控的方式获取实时的网络信息。