cortex
Cortex is a Claude Code plugin that gives Claude persistent memory across sessions. A background miner extracts durable lessons (decisions, conventions, bug fixes, preferences) from your completed sessions, stores them as markdown in an Obsidian vault, and automatically surfaces relevant ones at the start of each new session via a query-tailored briefing.
README
memem
Persistent, self-evolving memory for Claude Code. Stop re-explaining your project every session.
<!--
The Glama badge URL below intentionally uses the legacy cortex-plugin
slug. Glama listing slugs are fixed-once-created and the project was
renamed cortex → memem in v0.7.0. The badge keeps rendering A-tier under
the old slug via GitHub's repo-rename redirect. Re-listing under the
new slug requires manual coordination via https://glama.ai/discord.
DO NOT "fix" this URL — glama.ai/mcp/servers/TT-Wang/memem returns 404.
-->
For LLM/AI tool discovery, see llms.txt.
███╗ ███╗███████╗███╗ ███╗███████╗███╗ ███╗
████╗ ████║██╔════╝████╗ ████║██╔════╝████╗ ████║
██╔████╔██║█████╗ ██╔████╔██║█████╗ ██╔████╔██║
██║╚██╔╝██║██╔══╝ ██║╚██╔╝██║██╔══╝ ██║╚██╔╝██║
██║ ╚═╝ ██║███████╗██║ ╚═╝ ██║███████╗██║ ╚═╝ ██║
╚═╝ ╚═╝╚══════╝╚═╝ ╚═╝╚══════╝╚═╝ ╚═╝
persistent memory for Claude Code
What is memem?
memem is a Claude Code plugin that gives Claude persistent memory across sessions. A background miner extracts durable lessons (decisions, conventions, bug fixes, preferences) from your completed sessions, stores them as markdown in an Obsidian vault, and automatically surfaces relevant ones at the start of each new session via a query-tailored briefing.
It's local-first: no cloud services, no API keys required, no vendor lock-in. Everything lives in ~/obsidian-brain/memem/memories/ as human-readable markdown.
When should I use memem?
Use memem if:
- You use Claude Code daily and keep re-explaining your project to every new session
- You want durable memory you can browse and edit as markdown
- You like local-first tools with zero vendor lock-in
- You already use Obsidian (memem plugs straight into your vault)
How is memem different from CLAUDE.md?
CLAUDE.md is a single hand-edited file per project. memem gives you:
- Automatic extraction — no manual note-taking, the miner captures lessons from every completed session
- Query-aware context — only the memories relevant to your current question get injected, not a static dump
- Self-evolving — memories merge, update, and deprecate automatically as your project evolves
- Cross-project — works across every Claude Code project you use, not scoped to one repo
- Security scanning — every write is scanned for prompt injection and credential exfiltration
- Browsable — Obsidian vault with graph view and backlinks for free
Architecture — layered recall
memem uses a 3-tier recall workflow inspired by claude-mem and mem0. Instead of dumping full memory content on every recall, it lets Claude progressively drill into relevant memories:
Session start
┌─────────────────────────────┐
│ SessionStart hook injects: │
│ • L0 full content (~20) │
│ • Compact index of rest │
└──────────┬──────────────────┘
│
▼
During session
┌─────────────────────────────┐
│ 1. memory_search(query) │ → compact index (50 tok/result)
│ 2. memory_get(ids=[...]) │ → full content (500 tok/result)
│ 3. memory_timeline(id) │ → chronological thread
└─────────────────────────────┘
│
▼
┌─────────────────────────────┐
│ Topic-shift detection │ → auto-refreshes context
│ (UserPromptSubmit hook) │ when conversation drifts
└─────────────────────────────┘
Memory layers (auto-classified at mining time):
| Layer | Purpose | Injection |
|---|---|---|
| L0 | Project identity — tech stack, repo structure, core conventions | Full content at session start |
| L1 | Generic conventions — testing, style, commit patterns | Compact index at session start |
| L2 | Domain-specific — most memories (default) | Compact index at session start |
| L3 | Rare/archival — niche failure modes, one-off lessons | Compact index at session start |
Token efficiency: session start injects ~50 tokens per L1-L3 memory (ID + title + snippet) instead of full content. Claude drills into specific memories via memory_get(ids=[...]) only when it needs full detail.
Opt-in features:
MEMEM_SHOW_BANNER=1— show a one-line status banner at session start (off by default)MEMEM_PRETOOL_GATING=1— enrich Read tool calls with memories about the target file (off by default)MEMEM_TOPIC_SHIFT_THRESHOLD=0.3— keyword overlap threshold for topic-shift re-firing (default 0.3)
How do I install memem?
/plugin marketplace add TT-Wang/memem
/plugin install memem
That's it. On first run, bootstrap.sh self-heals everything:
- Verifies Python ≥ 3.11 — or installs it via
uv python install 3.11if your system Python is too old - Installs
uvif missing (via the official Astral installer) - Syncs deps into a plugin-local
.venv(hash-cached againstuv.lock) - Creates and canary-tests
~/.memem/and~/obsidian-brain/ - Writes
~/.memem/.capabilities(used for degraded-mode decisions) - Execs the real MCP server
First run: ~5 seconds. Every run after: ~100ms. No separate pip install step.
Nothing mines until you opt in. memem is strictly opt-in as of v0.9.0 — install does not start the miner or touch your sessions. Type /memem to see status and choose what to do next. You can start mining two ways:
/memem-mine— mine new sessions only (from now on)/memem-mine-history— mine everything, including past history (uses Haiku API credits)
Or just tell Claude "start mining new sessions" / "start mining everything including history" — it knows what to do.
What happens on my first Claude Code session?
You type your first message. The UserPromptSubmit hook fires and sees zero memories (you just installed it), so it injects a welcome banner into Claude's context. Claude reads the banner, tells you memem is active, and — if you have pre-existing Claude Code sessions — offers to mine them via /memem-mine-history.
You work normally. The miner daemon runs silently in the background. When your session ends and settles for 5 minutes, the miner extracts memories from the transcript using Claude Haiku and writes them to your vault.
Session 2 onwards: the hook sends your first message to context_assemble, which gives Haiku the relevant memories and asks it to synthesize a tailored briefing. You see a status banner like [memem] 12 memories · miner OK · assembly OK followed by the brief. Claude starts with full context — no re-explaining.
What does memem save?
It saves durable knowledge, not session logs:
- Architecture decisions with rationale ("we use RS256 JWTs because HS256 can't be verified by third parties without sharing the secret")
- Conventions ("tests go in
tests/notspec/", "commit messages use imperative mood", "never import frominternal/outside its package") - Bug fixes you might forget ("
bcrypt.compareis async — mustawait", "timezone math must usedayjs.utc()or DST shifts the result by an hour") - User preferences ("prefer single commits, not stacked PRs", "terse responses — no trailing summaries", "ask before running migrations in prod")
- Known issues & workarounds ("
JWT_SECRETdefaults to'secret'if unset — tracked in #123", "pnpm install hangs on corporate VPN, use--network-timeout=600000") - Environment & tooling facts ("project uses Poetry, not pip", "CI runs on Node 20 but local defaults to 22 — pin with
nvm use", "Redis must be running on :6380 not :6379") - Project structure & invariants ("auth middleware requires Redis", "all DB writes go through
repo/layer, never raw SQL in handlers") - Failure patterns & post-mortems ("mocking the DB hid a broken migration last quarter — integration tests must hit a real DB", "don't ship on Fridays after the 2025-11 rollback incident")
- Third-party quirks ("Stripe webhooks retry for 3 days — idempotency key is mandatory", "OpenAI streaming drops the final token if client closes early")
- Domain knowledge ("a 'merchant' in our schema is what the legal team calls a 'counterparty'", "revenue is recognized at ship time, not order time")
It does NOT save:
- Raw session transcripts (those are searchable via
transcript_search, not stored as memories) - Trivial or obvious facts
- Session outcomes ("today I worked on X")
Where does memem store my memories?
| Store | Path | Purpose |
|---|---|---|
| Memories | ~/obsidian-brain/memem/memories/*.md |
Source of truth (human-readable markdown) |
| Playbooks | ~/obsidian-brain/memem/playbooks/*.md |
Per-project curated briefings |
| Search DB | ~/.memem/search.db |
SQLite FTS5 index (machine-fast lookup) |
| Telemetry | ~/.memem/telemetry.json |
Access tracking (atomic writes) |
| Event log | ~/.memem/events.jsonl |
Append-only audit trail |
| Capabilities | ~/.memem/.capabilities |
Degraded-mode flags written by bootstrap |
| Bootstrap log | ~/.memem/bootstrap.log |
First-run diagnostics |
You can point memem elsewhere via MEMEM_DIR and MEMEM_OBSIDIAN_VAULT env vars.
What are the MCP tools Claude can call?
| Tool | What it does |
|---|---|
memory_save(content, title, tags) |
Store a lesson. Security-scanned for prompt injection and credential exfil before writing. |
memory_recall(query, limit) |
Search memories. FTS5 + temporal decay + access reinforcement + importance weighting. |
memory_list(scope_id) |
List all memories with stats, grouped by project. |
memory_import(source_path) |
Bulk import from files, directories, or chat exports. |
transcript_search(query) |
Search raw Claude Code session JSONL logs (not the mined memories). |
context_assemble(query, project) |
On-demand query-tailored briefing from playbooks + memories + transcripts. |
What slash commands does memem add?
/memem— welcome, status, help/memem-status— memory count, projects, search DB size, miner health/memem-doctor— preflight health check with fix instructions for any blocker/memem-mine— start the miner daemon manually (normally auto-starts)/memem-mine-history— opt-in: mine all your pre-install Claude Code sessions
What if the claude CLI isn't on my PATH?
memem enters degraded mode — it still works, just without Haiku-powered context assembly and smart recall. You get FTS-only keyword recall instead of query-tailored briefings. Every session shows [memem] N memories · miner OK · assembly degraded (claude CLI missing — FTS-only recall) at the top of the context, so you know why.
This is by design: missing optional dependencies should degrade, not fail.
How do I diagnose problems?
Run /memem-doctor. It runs the same preflight the bootstrap shim runs (Python version, mcp importable, claude CLI on PATH, directory writability, uv available), then prints a report labelled HEALTHY, DEGRADED, or FAILING with explicit fix instructions for each blocker.
For deeper debugging:
tail -f ~/.memem/bootstrap.log # first-run shim log
tail -f ~/.memem/miner.log # miner daemon log
cat ~/.memem/events.jsonl # memory operation audit trail
python3 -m memem.server --status # detailed status dump
How does the mining pipeline work?
Session ends → miner daemon sees the JSONL file in ~/.claude/projects/
→ Waits 5 minutes for the file to "settle" (no more writes)
→ Filters to human messages + assistant prose (strips tool calls, system reminders)
→ One Haiku call with the full context: "extract durable lessons"
→ Haiku returns JSON array of memory candidates
→ Each candidate runs: security scan → dedup check → contradiction detection → save
→ Index rebuilt, per-project playbooks grown and refined
→ Session marked COMPLETE in ~/.memem/.mined_sessions
How does the recall pipeline work?
First message in a new session → auto-recall.sh hook fires
→ Reads ~/.memem/.capabilities for status banner
→ If claude CLI is available → sends (message, memories) to Haiku
→ Haiku synthesizes a focused briefing (300-800 tokens usually)
→ Brief injected into Claude's context as "memem context briefing"
→ If claude CLI is missing → falls back to FTS-only keyword recall
→ Either way, Claude starts its reply with relevant context already loaded
Architecture
memem is split into small, focused modules:
models.py— data types, path constantssecurity.py— prompt injection + credential exfil scanningtelemetry.py— access tracking, event log (atomic writes, fcntl-locked)search_index.py— SQLite FTS5 indexobsidian_store.py— memory I/O, dedup scoring, contradiction detectionplaybook.py— per-project playbook grow + refineassembly.py— context assembly via Haikucapabilities.py— runtime feature detection for degraded modestorage.py— server-lifecycle helpers (PID management, miner auto-start)server.py— thin MCP entrypoint (FastMCP imported lazily)cli.py— command dispatcher for non-MCP entrypointsmining.py— session mining pipeline
Multi-signal recall scoring:
- 50% FTS relevance
- 15% recency (0.995^hours decay)
- 15% access history (usage reinforcement)
- 20% importance (1-5 scale from Haiku)
Memory schema (markdown frontmatter):
---
id: uuid
schema_version: 1
title: "descriptive title"
project: project-name
tags: [mined, project-name]
related: [id1, id2, id3]
created: 2026-04-13
updated: 2026-04-13
source_type: mined | user | import
source_session: abc12345
importance: 1-5
status: active | deprecated
valid_to: # set when deprecated
contradicts: [id1] # flagged conflicts
---
Configuration
| Env var | Default | Purpose |
|---|---|---|
MEMEM_DIR |
~/.memem |
State directory (PID files, search DB, logs) |
MEMEM_OBSIDIAN_VAULT |
~/obsidian-brain |
Vault location |
MEMEM_EXTRA_SESSION_DIRS |
(none) | Colon-separated extra session dirs to mine |
MEMEM_MINER_SETTLE_SECONDS |
300 |
Seconds to wait before mining a completed session |
MEMEM_SKIP_SYNC |
0 |
Bootstrap skips uv sync when set to 1 (dev only) |
Setup Obsidian (optional, recommended)
memem works without Obsidian — it just writes markdown. But Obsidian gives you graph view and backlinks for free:
- Download: https://obsidian.md (free)
- Open
~/obsidian-brainas a vault - Memories appear in
memem/memories/, playbooks inmemem/playbooks/ - Use Graph View to see how memories link via the
relatedfield
Requirements
- Claude Code
- Python ≥ 3.11
uv(auto-installed by bootstrap.sh on first run)claudeCLI on PATH (optional — required for Haiku-powered assembly; degraded mode works without it)
Development
git clone https://github.com/TT-Wang/memem.git
cd memem
pip install -e ".[dev]"
pytest # 54 tests
ruff check . # lint
mypy memem # type check (strict)
See CONTRIBUTING.md for the PR process and CHANGELOG.md for version history.
License
MIT
推荐服务器
Baidu Map
百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。
Playwright MCP Server
一个模型上下文协议服务器,它使大型语言模型能够通过结构化的可访问性快照与网页进行交互,而无需视觉模型或屏幕截图。
Magic Component Platform (MCP)
一个由人工智能驱动的工具,可以从自然语言描述生成现代化的用户界面组件,并与流行的集成开发环境(IDE)集成,从而简化用户界面开发流程。
Audiense Insights MCP Server
通过模型上下文协议启用与 Audiense Insights 账户的交互,从而促进营销洞察和受众数据的提取和分析,包括人口统计信息、行为和影响者互动。
VeyraX
一个单一的 MCP 工具,连接你所有喜爱的工具:Gmail、日历以及其他 40 多个工具。
Kagi MCP Server
一个 MCP 服务器,集成了 Kagi 搜索功能和 Claude AI,使 Claude 能够在回答需要最新信息的问题时执行实时网络搜索。
graphlit-mcp-server
模型上下文协议 (MCP) 服务器实现了 MCP 客户端与 Graphlit 服务之间的集成。 除了网络爬取之外,还可以将任何内容(从 Slack 到 Gmail 再到播客订阅源)导入到 Graphlit 项目中,然后从 MCP 客户端检索相关内容。
e2b-mcp-server
使用 MCP 通过 e2b 运行代码。
Neon MCP Server
用于与 Neon 管理 API 和数据库交互的 MCP 服务器
Exa MCP Server
模型上下文协议(MCP)服务器允许像 Claude 这样的 AI 助手使用 Exa AI 搜索 API 进行网络搜索。这种设置允许 AI 模型以安全和受控的方式获取实时的网络信息。