switchyard
Routes coding tasks across multiple AI CLIs (Copilot, Claude Code, Gemini, etc.) with cost-aware tier routing and parallel wave orchestration.
README
<h1 align="center">Threnody</h1> <h3 align="center">Local-first MCP meta-harness — host executes, Threnody coordinates swarms, memory, and learning</h3>
<p align="center"><sub> MCP coordination · self-learning agents · swarm orchestration · cross-session memory · optional delegation </sub></p>
<p align="center"> <a href="LICENSE"><img src="https://img.shields.io/badge/License-Apache_2.0-blue.svg" alt="License"></a> <a href="https://github.com/timjensgrossinger/threnody/actions/workflows/ci.yml"><img src="https://img.shields.io/github/actions/workflow/status/timjensgrossinger/threnody/ci.yml?branch=main" alt="CI"></a> <img src="https://img.shields.io/badge/python-3.10%20%E2%80%93%203.13-blue" alt="Python"> <img src="https://img.shields.io/badge/MCP-stdio-green" alt="MCP"> <a href="CHANGELOG.md"><img src="https://img.shields.io/badge/release-v0.2.0--alpha.1-orange" alt="Release"></a> </p>
<p align="center"> <strong>Plan in Threnody.</strong> Route, decompose, and swarm via MCP.<br> <strong>Execute in the host.</strong> Agent/Task subagents from <code>host_spawn</code> / <code>host_spawn_waves</code>.<br> <strong>Utility delegation (opt-in).</strong> <code>execute_subtask</code> to OpenCode, Aider, or local endpoints — never host→host. </p>
Install in 2 minutes
curl -fsSL https://raw.githubusercontent.com/timjensgrossinger/threnody/main/install.sh | bash
Or clone and install:
git clone https://github.com/timjensgrossinger/threnody.git
cd threnody
./install.sh
Requires: Python 3.10+, macOS or Linux, and at least one host AI CLI (gh, claude, codex, cursor-agent, junie, or opencode).
Restart your shell, then connect from Claude Code, Copilot CLI, Codex, Cursor, Junie, or OpenCode — Threnody registers as an MCP server automatically.
Provider terms: Threnody is not affiliated with or endorsed by any AI provider. Credentials stay in provider-native stores; you configure auth in each host CLI. See docs/LEGAL.md for operator responsibilities.
Docs: limitations · legal · architecture
What is Threnody?
Threnody is a local-first MCP meta-harness for developer workflows. Register it in Claude Code, Copilot CLI, Codex, Cursor, Junie, or OpenCode — Threnody plans and routes in MCP; the host shell executes via host_spawn / host_spawn_waves (Agent or Task subagents). When a handoff includes host_spawn_waves, spawn subagents — do not substitute direct edits on planned files.
execute_subtask is utility delegation only (opt-in): OpenCode, Aider, and local loopback endpoints — never to other host CLIs. Same-host work returns HostNativeRequired with an actionable spawn payload. Claude Code is a router-only host by default.
Search terms that describe the same project: MCP orchestrator, meta-harness, multi-agent coding, swarm coordination, self-learning agents, Copilot / Claude / Codex orchestration.
Claude Code and provider compliance (default posture)
With default configuration, Threnody matches Anthropic's intended MCP pattern for Claude Code:
| Control | Default behavior |
|---|---|
| Execution | Host runs work via Agent and direct edits — Threnody returns host_spawn / host_spawn_waves, not subprocess claude -p loops |
| Router-only | Claude Code is a coordination anchor; Threnody does not delegate to it as a subprocess backend |
Same-host execute_subtask |
Returns HostNativeRequired with an actionable spawn payload |
| Host→host delegation | HostDelegationBlocked — no subprocess to Copilot, Codex, Cursor, Junie, or Claude from MCP |
| Utility delegation | Off by default; opt-in targets OpenCode, Aider, and local loopback endpoints only |
| Routing policy | Advisory by default for all shells — route_task recommended, not mandatory; opt into routing_policy.mode: guarded for coordination gates (docs/HOOKS.md) |
Operator opt-in risk: Enabling providers.router_only_allow_execution for Claude Code can subprocess the CLI. With subscription OAuth, that pattern is documented as high policy risk in docs/LEGAL.md. API-key billing is lower operational risk but still bills outside Threnody's telemetry.
Threnody documents operator responsibilities; it does not provide legal certification. Verify your auth mode and provider terms before changing defaults.
Project skills
Six repo-local skills under skills/ guide MCP workflows from any connected host (Cursor, Copilot, Claude Code, Codex, etc.). Host shells discover them via repo symlinks (.cursor/skills, .claude/skills) and the skill index in copilot-instructions.md.
| Skill | Use when |
|---|---|
| threnody-plan | Plan-only or plan-then-execute; choose waves vs swarm |
| threnody-routing | route_task, routing guard, host-native vs utility delegation |
| threnody-task | plan_task, decompose_task, fleet_plan, host_spawn_waves |
| threnody-swarm | execute_swarm, topology, budget preview, resume |
| threnody-fullstack | Contract-first parallel frontend + backend + API |
| threnody-subtasks | Monitor opt-in utility execute_subtask runs |
Why Threnody?
| Plan in MCP, execute in host | route_task / plan_task return host_spawn / host_spawn_waves — spawn Agent/Task subagents in the host shell. |
| Learn over time | Pattern tracking, draft agents, and an approval queue before anything goes live. |
| Swarm when needed | execute_swarm defaults to host_native: wave plans hand off to the host; no subprocess fanout by default. |
| Utility delegation (opt-in) | execute_subtask(provider_id=…) to OpenCode, Aider, or local endpoints only; host→host delegation is blocked. |
| Planning mode | threnody-plan skill — plan-only vs plan-then-execute; routes to waves or swarms without extra coordinator rounds. |
| Spend discipline | Host-native execution uses existing CLI entitlements; contract-first alignment instead of multi-queen consensus. |
Who this is for
- Developers who want MCP coordination (swarms, memory, learning) inside their existing AI CLI host
- Teams standardizing on one MCP layer across Copilot, Claude Code, Codex, or Cursor
- Operators who want local-first state, explicit provider diagnostics, and approval-gated learned agents
- Anyone who wants credentials to stay in provider-native stores — Threnody does not manage API keys
Who this is not for
- A single chat assistant for casual coding questions — one CLI agent is enough; Threnody adds orchestration overhead
- A hosted SaaS with a support SLA — solo open-source project; GitHub issues are how support happens
- Compliance-certified agent orchestration — Threnody documents operator responsibilities; it does not ship audit-grade compliance bundles
- Non-coding LLM workflows (research, writing, data pipelines) — Threnody wraps CLI coding agents specifically
- Anyone who needs Threnody to guarantee provider ToS compliance — your deployment posture depends on which CLIs and routing patterns you enable
Agents that learn — with your approval
Threnody watches recurring work patterns, drafts reusable agents when evidence is strong, and waits for you to approve before anything goes live.
host execution → track patterns → draft agent → YOU approve → activate → auto-match future work
- No auto-promotion — drafts never become active without explicit approval
- Conservative gates — recurrence, quality score, and low rework must all agree before drafting
- Project vs shared vs cost lanes — project-specific patterns activate sooner; shared patterns need stronger evidence; cost_lane drafts prefer free/low-tier execution when recurrence and quality gates pass
- Inspect everything —
learning_agent_summary,learning_pattern_health, and redactedlearning_audit_logMCP tools
threnody inspect approvals --project .
threnody inspect approvals approve 12 --project . --operator you
How it works
Host shell (Claude / Copilot / Codex / Cursor / …)
→ route_task / plan_task tier + host_spawn / host_spawn_waves
→ host executes Agent or Task subagents, direct edits
→ utility delegation execute_subtask → OpenCode / Aider / local (opt-in)
→ swarm / learning execute_swarm (host_native default), memory_*, learning_*
- You give a task to your MCP host shell.
- Threnody scores complexity → low / medium / high tier (no extra LLM call on the hot path).
route_taskorplan_taskreturns spawn metadata —host_spawnfor single-agent work,host_spawn_wavesfor multi-step plans.- The host runs the work — Claude Code uses Agent; other shells use Task. Same-host
execute_subtaskreturnsHostNativeRequired. - Swarms or utility delegation —
execute_swarmreturns a host-native wave plan by default; optionalexecute_subtaskonly for utility backends when enabled in config.
What leaves your machine
By default, Threnody is local-first:
- Routing state, telemetry, and caches stay in local SQLite (
~/.local/lib/threnody/) - The MCP server talks to your host shell over stdio — no Threnody-hosted control plane
- Outbound traffic comes from the provider CLIs you invoke (Anthropic, OpenAI, GitHub, Google, etc.)
If you route to a network LLM endpoint, re-do the network review. See docs/ARCHITECTURE.md and docs/LEGAL.md.
Honest limitations
- Threnody orchestrates tools that can execute arbitrary code with your user permissions
- Provider risk is real — routing policy reduces it, but cannot change a provider's underlying trust model or terms
- Cost rank is a routing hint, not a bill estimate
- Realistic enforcement outcome is account suspension or rate limits, not necessarily litigation
Full list: docs/RELEASE_LIMITATIONS.md
Spend less across your CLIs
Threnody is built for operators who want token discipline across Copilot, Claude, Codex, Cursor, and other installed CLIs — not a second API bill from the coordination layer.
route_task / plan_task → host_spawn / host_spawn_waves → host Agent/Task
execute_subtask → utility targets only (opt-in; host→host blocked)
execute_swarm → host_native default; delegate opt-in
route_task/plan_taskclassify work and returnhost_spawnmetadata plusexecution_hint.economics.- Host-native first — Agent/Task subagents and direct edits use your existing CLI entitlements; no Threnody subprocess loop for same-host work.
- Utility delegation (opt-in) —
execute_subtask(provider_id=…)to OpenCode, Aider, or local loopback endpoints whendelegation_utilities_enabled: true. - Contract-first alignment — parallel waves + verify gates instead of multi-queen coordinator consensus (see docs/COMPETITIVE.md).
- Measure locally —
inspect_spend,threnody inspect spend, andthrenody gainaggregate delegated-subtask savings fromcost_telemetry. - Guarded coordination (opt-in) — set
routing_policy.mode: guardedto requireroute_taskbefore code edits; Claude Code can install a PreToolUse hook. See docs/HOOKS.md. Default is advisory for all shells.
Workflow guide: docs/COST_SAVINGS.md
Feature highlights
| Feature | What it does | |
|---|---|---|
| 🎯 | Tier routing | Heuristic complexity scoring + host_spawn / execution_hint for host-native work |
| 🧠 | Learning loop | Pattern tracking → draft agents → approval queue → plan-time context injection for matching work |
| 🐝 | Swarm orchestration | execute_swarm returns host_spawn_waves by default (awaiting_host_execution); heuristic intent fans out one agent per file; expand_host_plan for mid-run discovery |
| 💾 | Cross-session memory | memory_* MCP tools backed by local SQLite — shared across all MCP hosts via ~/.local/lib/threnody/cache.db |
| 🔌 | MCP-native | ~43 tools over stdio JSON-RPC; works with any MCP-compatible host shell |
| 🔀 | Utility delegation | Opt-in execute_subtask to OpenCode, Aider, local endpoints; host→host blocked |
| 📋 | Planning skills | Six repo skills under skills/ — start with threnody-plan for plan-only workflows |
| 📈 | Adaptive thresholds | EMA-based threshold learning from record_outcome (pass task_id from route_task; enable per-project learning) |
| 🛡️ | Write safety | Path validation, outside-workspace grant model + audit trail |
| 🔒 | Guarded routing | Optional coordination gate + Claude PreToolUse hooks (routing_policy.mode: guarded; advisory is default) |
Cross-CLI memory
All installed MCP hosts share one SQLite store at ~/.local/lib/threnody/cache.db (WAL). Use these conventions so Claude Code, Copilot, Cursor, and other shells read the same keys:
| Scope | When to use | Cross-CLI tip |
|---|---|---|
global |
Machine-wide coordination | Fully shared — no project_id |
project |
Repo-specific state | Pass a stable absolute path as project_id, not "." (each host resolves "." to its own active workspace) |
task |
One run or wave | Share explicit task_id strings across hosts |
shared/memory.canonical_project_id() resolves relative paths to absolute paths under the active workspace. Do not store secrets in memory — any connected host can read or overwrite keys.
Adaptive routing
route_taskreturnstask_idand persistscomplexity_scorein telemetry.- After work completes, call
record_outcome(task_id=..., outcome=accepted|revised|rejected|reworked). - Enable learning per project:
threnody tune set learning_enabled true --project . - Warm-path EMA updates apply adaptive thresholds at classify time once band and project sample gates are met (requires
cwdonroute_task).
Supported providers
| Provider | Binary | Role | Notes |
|---|---|---|---|
| Claude Code | claude |
Host (router-only) | MCP coordination anchor; executes via Agent / direct edits — no default subprocess delegation |
| GitHub Copilot | gh |
Host | Host executes via Task; coordinates in MCP |
| OpenAI Codex | codex |
Host | Host-native execution via Task tool |
| Cursor | cursor-agent |
Host | Host-native execution via Task tool |
| OpenCode | opencode |
Host + utility | Host-native when MCP host; utility delegation target when enabled |
| JetBrains Junie | junie |
Host / legacy paths | Host-native when MCP host; not a default execute_subtask target from other hosts |
| Aider | aider |
Utility | Secondary adapter; opt-in utility delegation |
| Amazon Q / Kiro | q / kiro |
Legacy / detect | Secondary adapter; not default host→host delegation |
| Mistral Vibe | vibe |
Legacy / detect | Secondary adapter |
| Blackbox AI | blackbox |
Legacy / detect | When CLI installed |
| Windsurf | windsurf |
detect only | Never selected for execution |
Run threnody inspect status --project . --details for your live provider matrix.
Full compatibility matrix: docs/PROVIDER_COMPATIBILITY.md
See it in action
1. Plan in Threnody
plan_task("add JWT auth with tests")
→ host_spawn_waves: [
{ "wave": 1, "agents": [{ "tool": "Agent", "tier": "medium", "prompt": "…", "target_files": ["src/auth.py"] }] },
{ "wave": 2, "agents": [{ "tool": "Agent", "tier": "low", "prompt": "…", "target_files": ["tests/test_auth.py"] }] }
]
2. Execute in the host (Claude Code example)
📋 Wave 1 — spawn Agent (sonnet) → src/auth.py
📋 Wave 2 — spawn Agent (haiku) → tests/test_auth.py
3. Optional utility delegation (opt-in)
# Enable providers.delegation_utilities_enabled in config.yaml first
execute_subtask(prompt="…", tier="low", provider_id="opencode")
→ delegates to OpenCode utility backend
execute_subtask(prompt="…", tier="medium") # same-host caller
→ HostNativeRequired + host_spawn payload
execute_subtask(provider_id="codex") # always blocked
→ HostDelegationBlocked
Shell commands
ghc agent "implement JWT auth for the user service" # multi-agent waves
ghcs "how to list files recursively in python" # quick routed call
threnody inspect status --project . --details # provider readiness
threnody-watch # live TUI monitor
Full reference: docs/CLI.md
Documentation
| Doc | Contents |
|---|---|
| MCP Tools | All 41 MCP tool surfaces |
| CLI Reference | Shell aliases and operator commands |
| Architecture | Trust boundaries and local-first design |
| Configuration | Safe starting config (copy to ~/.local/lib/threnody/config.yaml) |
| Model Discovery | Live catalogs, tier pins, cost ranks |
| Routing Quality | Eval methodology and accuracy |
| Routing accuracy report | Generated fixture stats (python3 -m shared.routing_report --write-docs) |
| Host routing hooks | Claude PreToolUse guard script |
| Release Limitations | Beta scope, privacy, roadmap |
| Legal and Provider Terms | Operator responsibilities and provider links |
| Cost savings workflows | Host-native vs utility delegation and operator commands |
| Competitive positioning | Threnody vs heavy swarm platforms; contract-first alignment |
| Troubleshooting | Common fixes |
Beta status
Public alpha v0.2.0-alpha.1 — MCP tool schemas may change between releases; pin a git tag for stability. See CHANGELOG.md.
- macOS and Linux;
zshandbash - Windows not supported by the installer
- Provider behavior depends on locally installed CLI versions and entitlements
Running tests
THRENODY_TEST_MODE=1 python3 -m pytest tests/ -q
THRENODY_TEST_MODE=1 python3 -m shared.routing_eval
python3 scripts/check_release_archive.py
Uninstall
~/.local/lib/threnody/uninstall.sh
~/.local/lib/threnody/uninstall.sh --purge-data
Legal and provider terms
Threnody is an independent open-source project. It is not affiliated with, endorsed by, or sponsored by Anthropic, OpenAI, GitHub, Google, Cursor, JetBrains, or any other provider named in this repository.
Threnody is provided "AS IS" under the Apache License 2.0 (no warranty; limitation of liability). You are solely responsible for determining whether your routing patterns comply with each provider's current terms.
Default execution model: Threnody plans in MCP; host shells execute via
host_spawn / host_spawn_waves (Agent or Task subagents). Same-host
execute_subtask returns HostNativeRequired. Optional utility delegation
(OpenCode, Aider, local endpoints) requires providers.delegation_utilities_enabled: true.
Host→host subprocess delegation is not supported.
Operator responsibilities and provider links: docs/LEGAL.md
License
Licensed under the Apache License, Version 2.0. Third-party attributions in NOTICE.
Built by @timjensgrossinger.
推荐服务器
Baidu Map
百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。
Playwright MCP Server
一个模型上下文协议服务器,它使大型语言模型能够通过结构化的可访问性快照与网页进行交互,而无需视觉模型或屏幕截图。
Magic Component Platform (MCP)
一个由人工智能驱动的工具,可以从自然语言描述生成现代化的用户界面组件,并与流行的集成开发环境(IDE)集成,从而简化用户界面开发流程。
Audiense Insights MCP Server
通过模型上下文协议启用与 Audiense Insights 账户的交互,从而促进营销洞察和受众数据的提取和分析,包括人口统计信息、行为和影响者互动。
VeyraX
一个单一的 MCP 工具,连接你所有喜爱的工具:Gmail、日历以及其他 40 多个工具。
graphlit-mcp-server
模型上下文协议 (MCP) 服务器实现了 MCP 客户端与 Graphlit 服务之间的集成。 除了网络爬取之外,还可以将任何内容(从 Slack 到 Gmail 再到播客订阅源)导入到 Graphlit 项目中,然后从 MCP 客户端检索相关内容。
Kagi MCP Server
一个 MCP 服务器,集成了 Kagi 搜索功能和 Claude AI,使 Claude 能够在回答需要最新信息的问题时执行实时网络搜索。
e2b-mcp-server
使用 MCP 通过 e2b 运行代码。
Neon MCP Server
用于与 Neon 管理 API 和数据库交互的 MCP 服务器
Exa MCP Server
模型上下文协议(MCP)服务器允许像 Claude 这样的 AI 助手使用 Exa AI 搜索 API 进行网络搜索。这种设置允许 AI 模型以安全和受控的方式获取实时的网络信息。