MCP 服务器

mcp-memory-server

A knowledge-graph-based persistent memory server for the Model Context Protocol, storing entities, observations, and relations in SQLite with semantic vector search and temporal versioning.

README

MCP Memory Server

A knowledge-graph-based persistent memory server for the Model Context Protocol (MCP). Stores entities, observations, and relations in SQLite with semantic vector search, temporal versioning, and cursor-based pagination.

Attribution

This project is derived from @modelcontextprotocol/server-memory by Anthropic, PBC, originally published under the MIT License as part of the MCP Servers monorepo. The original LICENSE file is preserved in this repository.

Features

SQLite storage with WAL mode, foreign key constraints, and database-level deduplication
Semantic vector search via sqlite-vec + all-MiniLM-L6-v2 ONNX embeddings (automatic, no config needed)
Temporal versioning — observations and relations track superseded_at timestamps; old data is preserved for history but hidden from active queries
Point-in-time queries — as_of parameter on read/search operations recovers the graph state at any past timestamp
Entity soft-delete — delete_entities marks entities as superseded (preserving history) rather than hard-deleting
Entity name normalization — surface variants like Dustin-Space, dustin/space, and DUSTIN SPACE all resolve to the same identity key
Observation metadata — importance (1-5 scale), context_layer (L0/L1/null), and memory_type classification on observations
Context layers — get_context_layers returns L0 (always-loaded rules) and L1 (session-start context) observations with token budgets
Session-start briefing — get_summary returns top observations by importance, recently updated entities, and aggregate stats
Entity timeline — view full change history (active + superseded + tombstoned observations and relations)
Similarity detection — add_observations warns when new observations are semantically similar to existing ones
Four-tier eviction — Active → Superseded → Tombstoned → Hard-deleted degradation chain with 6-month LRU shield and configurable size cap (default 1 GB)
Cursor-based pagination — sorted by most-recently-updated, stable under concurrent use
Project scoping — optional projectId parameter isolates entities by project
Auto-migration — upgrades from JSONL to SQLite automatically on first run

Installation

npm install
npm run build

Note: better-sqlite3 is a native C addon. You'll need C build tools (gcc, make) installed. On Fedora: sudo dnf install gcc make. On Ubuntu/Debian: sudo apt install build-essential.

Full harness install (recommended for Claude Code)

The server alone provides the storage layer. To replicate the full session-lifecycle integration — auto-loaded L0 context, freshness scanning, pre/post-compact agents, noise gating, /audit-memory and /checkpoint skills, custom QA agents — see harness/README.md:

cd harness
./install.sh

The installer is idempotent, backs up existing files, and prints the manual integration steps (settings.json merge, MCP registration, CLAUDE.md fragments) at the end.

Usage

With Claude Code

Add to your MCP server configuration:

{
  "mcpServers": {
    "memory": {
      "command": "node",
      "args": ["/path/to/mcp-memory-server/dist/index.js"],
      "env": {
        "MEMORY_FILE_PATH": "/path/to/your/memory.db"
      }
    }
  }
}

Environment Variables

Variable	Default	Description
`MEMORY_FILE_PATH`	`memory.db` (alongside script)	Path to database file. Extension determines backend: `.db`/`.sqlite` → SQLite, `.jsonl` → JSONL
`MEMORY_VECTOR_SEARCH`	`on`	Set to `off` to disable vector search entirely
`MEMORY_SIZE_CAP_BYTES`	`1000000000` (1 GB)	Database size cap for the eviction system. Eviction triggers at 90% of cap

Tools

Tool	Description	Key Parameters
`create_entities`	Create new entities in the knowledge graph	`entities[]`, `projectId?`
`create_relations`	Create directed relations between entities	`relations[]` (from, to, relationType)
`add_observations`	Add observations to existing entities. Returns `similarExisting` when new observations are semantically close to existing ones	`observations[]` (entityName, contents[], importances?[], contextLayers?[], memoryTypes?[])
`delete_entities`	Soft-delete entities (preserves history for `as_of` and `entity_timeline`)	`entityNames[]`
`delete_observations`	Delete specific observations from entities	`deletions[]` (entityName, contents[])
`delete_relations`	Delete relations between entities	`relations[]` (from, to, relationType)
`supersede_observations`	Atomically retire old observations and insert replacements. Preserves history	`supersessions[]` (entityName, oldContent, newContent)
`invalidate_relations`	Mark relations as no longer active. Preserves history. Idempotent	`relations[]` (from, to, relationType)
`set_observation_metadata`	Update importance, context layer, and/or memory type on existing observations in-place	`updates[]` (entityName, content, importance?, contextLayer?, memoryType?)
`read_graph`	Read the knowledge graph (paginated, sorted by most recently updated)	`projectId?`, `asOf?`, `cursor?`, `limit?`
`search_nodes`	Search entities by name, type, or observation content. LIKE + semantic vector search	`query`, `projectId?`, `asOf?`, `cursor?`, `limit?`
`open_nodes`	Retrieve specific entities by name with full relations	`names[]`, `projectId?`, `asOf?`
`list_projects`	List all project names in the knowledge graph	—
`entity_timeline`	Full change history for an entity (active + superseded + tombstoned items)	`entityName`, `projectId?`
`get_context_layers`	Returns L0/L1 observations sorted by importance with token budgets	`projectId?`, `layers?`
`get_summary`	Session-start briefing: top observations, recent entities, aggregate stats	`projectId?`, `excludeContextLayers?`, `limit?`

Data Model

Entities — named nodes with a type, a project scope, and timestamped observations. Entity names are normalized (case-insensitive, separator-insensitive) for identity matching while preserving the original display form.
Relations — directed edges between entities with temporal tracking (createdAt, supersededAt)
Observations — { content, createdAt, importance, contextLayer, memoryType } objects attached to an entity (the atomic unit of knowledge)

Temporal Versioning

Observations, relations, and entities support a four-tier lifecycle:

Active items appear in read_graph, search_nodes, and open_nodes
Superseded items are hidden from active queries but preserved for history and as_of recovery
Tombstoned items have their content stripped by the eviction system but preserve the existence skeleton
Hard-deleted items are permanently removed when under extreme size pressure
Use entity_timeline to see the full history of any entity (all tiers)
Use as_of parameter on read_graph/search_nodes/open_nodes to recover the graph at any past timestamp

Vector Search

Vector search is automatic when using the SQLite backend:

Uses sqlite-vec (brute-force KNN) with all-MiniLM-L6-v2 ONNX embeddings (384 dimensions)
Model downloads ~23MB on first startup (cached thereafter)
LIKE substring search always runs; vector search adds supplementary semantic matches
Set MEMORY_VECTOR_SEARCH=off to disable
If the model fails to load, the system degrades gracefully to LIKE-only search

Storage Backends

SQLite (default, recommended)

SQLite is the default and recommended backend. Features: WAL mode, FK constraints, database-level deduplication, vector search, temporal versioning, schema migrations.

JSONL (deprecated)

The JSONL backend is maintained for environments without C build tools but does not support vector search, as_of queries, entity soft-delete, entity_timeline, set_observation_metadata, invalidate_relations, or eviction. Set MEMORY_FILE_PATH to a .jsonl path to use it.

Migration from JSONL

Change MEMORY_FILE_PATH to a .db path (or remove it to use the default)
On first run, the server auto-migrates data from the .jsonl file
The original JSONL file is renamed to .jsonl.bak

Known Limitations

Entity names are globally unique across all projects (relations use names as FK)
Project filtering is advisory, not a security boundary
Paginated relation coverage is incomplete — use open_nodes for full relations on specific entities
Vector search is best-effort — degrades to LIKE-only if model/extension unavailable
vec0 is brute-force KNN — fine at current scale, consider ANN at ~50,000+ observations
Context layers and memory types are caller-managed — the server stores them but does not auto-classify

See CLAUDE.md for detailed architecture documentation and the full limitations list.

Development

npm test           # run tests (520 tests across 9 test files)
npm run test:watch # run tests in watch mode
npm run build      # compile TypeScript
npm run watch      # compile in watch mode

Set SKIP_VECTOR_INTEGRATION=1 to skip the vector integration tests (which download the embedding model) for faster CI runs.

License

See LICENSE for the full text and original copyright notices. The upstream project is transitioning from MIT to Apache-2.0; documentation is licensed under CC-BY-4.0.