widemem — Claude Code Memory Skill

Remembers things at scale and knows when it doesn't.

Claude's built-in memory works fine for 20 facts. When you have hundreds of memories across users and projects, you need semantic search, importance scoring, and confidence levels — not grep over a markdown file.

widemem gives Claude Code a real memory engine: vector embeddings, LLM-based fact extraction, temporal decay, and the honesty to say "I don't have that" instead of guessing.

Built on widemem-ai — the open-source memory layer for AI agents.

Built-in memory vs widemem

	Claude's MEMORY.md	widemem
Storage	Flat markdown	Vector embeddings (FAISS/Qdrant)
Search	Grep / full context load	Semantic similarity
Knows when it doesn't know	No	Yes — 4 confidence levels
Importance scoring	No	1-10 with temporal decay
Dedup / conflict detection	No	LLM-based
Quality gates	No	5-tier classification
Scales to 1000s of facts	Poorly (context window)	Yes (vector index)
Multi-user support	No	Built-in

With this skill

You: "What city do I live in?"
AI:  confidence: NONE — "I don't have that stored. Want to tell me?"
You: "Berlin."
AI:  [stores with importance 7, semantic embedding, never guesses again]

...500 memories later...

You: "What do you know about my work?"
AI:  confidence: HIGH — "You're a data scientist at Stripe.
     You prefer Python with type hints. You're on the platform team."

Install

1. Install widemem

pip install widemem-ai[mcp,sentence-transformers]

For OpenAI embeddings instead of local:

pip install widemem-ai[mcp]

2. Install the skill

Option A — Plugin (recommended):

# In Claude Code
/plugin install widemem

Option B — Manual:

# Copy skill files to your project
git clone https://github.com/remete618/widemem-skill.git
mkdir -p .claude/skills
cp -r widemem-skill/skills/widemem .claude/skills/

# Add MCP config (or merge into your existing .mcp.json)
cp widemem-skill/.mcp.json .mcp.json

3. Configure (optional)

Edit .mcp.json to change providers:

Env Variable	Default	Options
`WIDEMEM_LLM_PROVIDER`	`ollama`	`openai`, `anthropic`, `ollama`
`WIDEMEM_LLM_MODEL`	`llama3.2`	Any model your provider supports
`WIDEMEM_EMBEDDING_PROVIDER`	`sentence-transformers`	`openai`, `sentence-transformers`
`WIDEMEM_DATA_PATH`	`~/.widemem/data`	Any local path

For cloud providers, set OPENAI_API_KEY or ANTHROPIC_API_KEY in the env block.

Commands

Command	What it does
`/mem search <query>`	Semantic search across all memories
`/mem add <text>`	Store a fact (with quality gates — not everything gets in)
`/mem pin <text>`	Pin critical fact at importance 9.0 (allergies, API keys, things that matter)
`/mem stats`	Memory count and health check
`/mem prune`	Find duplicates and stale entries
`/mem export`	Export all memories as JSON
`/mem reflect`	Full memory audit — find contradictions, duplicates, staleness

Or just talk normally. The skill triggers on "remember this", "do you remember", "what do you know about me", and similar phrases.

The "Not Everything Gets Stored" Problem

Most memory tools store everything. You say "ok thanks" and now that's a memory forever. You sneeze and it stores your sneeze.

widemem has quality gates. Every incoming fact gets classified before storage:

Classification	Example	What happens
CRITICAL	"Remember: I'm allergic to penicillin"	Pinned at importance 9.0
HIGH	"Actually I moved to Berlin"	Deletes old location, stores new
MEDIUM	"I work at Stripe"	Dedup check, then store
LOW	"I'm on the dev branch right now"	Stored only if you insist
SKIP	"ok", "thanks", "run the tests"	Silently ignored

Your memory stays clean. Your vector store stays relevant. Your facts stay accurate.

Confidence Scoring

Every search tells you how sure it is:

Level	Similarity	What the AI says
HIGH	>= 0.65	Answers confidently
MODERATE	>= 0.45	"I think so, but not 100% sure"
LOW	>= 0.25	"I have a vague recollection..."
NONE	< 0.25	"I don't have that stored."

The skill never makes up memories. If confidence is NONE, it says so instead of guessing.

Frustration Detection

You: "I ALREADY told you my anniversary is March 15th"

The skill detects frustration, searches harder, and when you repeat the fact, pins it at high importance so it never misses it again. Turns out user annoyance is actually useful data.

Memory Reflection

/mem reflect runs a full audit:

Duplicates — "Lives in SF" and "User's city is San Francisco" → keep one
Contradictions — "Works at Google" vs "Works at Meta" → which is current?
Stale entries — "Debugging auth module" from 3 weeks ago → still relevant?
Consolidation — "Likes Python" + "Uses mypy" + "Prefers typed code" → merge

Reports everything before touching anything. Nothing gets deleted without your say-so.

How It Works

You: "I work at Stripe as a data scientist"
     |
     v
Quality Gate: MEDIUM (personal fact) --> dedup check --> no match --> store
     |
     v
widemem_add --> LLM extracts: "Works at Stripe as a data scientist" (importance: 7)
     |
     v
Stored in FAISS with embedding vector

...three weeks later...

You: "What do you know about my job?"
     |
     v
widemem_search --> confidence: HIGH, similarity: 0.89
     |
     v
"I recall you work at Stripe as a data scientist."

Architecture

+----------------------------------+
|  Claude Code + /mem skill        |
|  (SKILL.md + references/)        |
+-----------+----------------------+
            | MCP protocol (stdio)
+-----------v----------------------+
|  widemem MCP Server              |
|  7 tools: add, search, pin,     |
|  delete, count, export, health   |
+-----------+----------------------+
            |
+-----------v----------------------+
|  widemem-ai SDK                  |
|  Fact extraction, conflict       |
|  resolution, confidence scoring  |
+----------+-----------+-----------+
| FAISS/   | SQLite    | LLM       |
| Qdrant   | history   | (any)     |
+----------+-----------+-----------+

vs. Other Memory Solutions

Feature	widemem	claude-mem	Pure SKILL.md skills
Semantic search	FAISS/Qdrant	Chroma	grep
Confidence scoring	4 levels	No	No
Quality gates	5 tiers	No	Some
Frustration detection	Yes	No	No
YMYL protection	Yes	No	No
Conflict resolution	LLM-based	No	No
Memory pinning	Yes	No	No
Self-reflection	`/mem reflect`	No	`/memory prune`
Runs fully local	Ollama + FAISS	Requires Chroma + Bun	Filesystem
Install	pip + skill	npm + plugin	curl
Says "I don't know"	Yes (4 confidence levels)	No	No

Requirements

Python 3.10+
widemem-ai[mcp] (pip install widemem-ai[mcp])
One of: Ollama (local, free), OpenAI API key, or Anthropic API key
For local embeddings (no API key needed): pip install widemem-ai[sentence-transformers]

The Full SDK

This skill is a taste of widemem-ai. The Python SDK can do more:

Hierarchical memory (facts → summaries → themes)
Retrieval modes (fast/balanced/deep)
Topic weighting and custom extraction
YMYL protection for health/financial/legal data
Time-range queries and temporal decay functions
Batch conflict resolution
Export/import, history audit trails

If the skill makes you want more, the SDK is where it lives.

License

Apache 2.0 — same as widemem-ai.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.claude-plugin		.claude-plugin
skills/widemem		skills/widemem
.gitignore		.gitignore
.mcp.json		.mcp.json
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

widemem — Claude Code Memory Skill

Built-in memory vs widemem

With this skill

Install

1. Install widemem

2. Install the skill

3. Configure (optional)

Commands

The "Not Everything Gets Stored" Problem

Confidence Scoring

Frustration Detection

Memory Reflection

How It Works

Architecture

vs. Other Memory Solutions

Requirements

The Full SDK

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

widemem — Claude Code Memory Skill

Built-in memory vs widemem

With this skill

Install

1. Install widemem

2. Install the skill

3. Configure (optional)

Commands

The "Not Everything Gets Stored" Problem

Confidence Scoring

Frustration Detection

Memory Reflection

How It Works

Architecture

vs. Other Memory Solutions

Requirements

The Full SDK

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages