Shodh-Memory: A Cognitive Memory System for Edge-Native AI Agents

Q: How is shodh-memory different from a vector database?

Vector databases give you similarity search. Shodh-memory gives you cognition — memories strengthen when accessed together (Hebbian learning), decay naturally over time (power-law forgetting), and form associative networks via a knowledge graph. It's the difference between storage and memory.

Q: Does shodh-memory require an internet connection?

No. Shodh-memory runs 100% offline. The embeddings, vector index, knowledge graph — everything runs locally. Perfect for edge devices, air-gapped systems, or anywhere you need data privacy.

Q: What's the memory overhead?

The binary is ~30MB. Models add ~50MB (22MB MiniLM embeddings + 14MB NER model + 14MB ONNX runtime). Each memory entry uses roughly 2-5KB. A system with 10,000 memories uses approximately 50MB of storage.

Q: Can shodh-memory run on a Raspberry Pi?

Yes. Shodh-memory is designed for edge deployment. It runs on Raspberry Pi Zero, Jetson Nano, industrial PCs, and other resource-constrained devices. Graph lookups are under 1 microsecond.

Q: How does memory decay work?

Shodh-memory uses a hybrid model: exponential decay for the first 3 days (consolidation phase), then power-law decay for long-term retention. Memories accessed 10+ times become potentiated and decay 10x slower. Based on Wixted & Ebbesen (1991).

Q: What is Hebbian learning in AI agent memory?

Cells that fire together, wire together. When memories are accessed together, their connection strengthens. When memories compete, interference effects occur. It's how biological brains work, now applied to AI agent memory.

Q: Is there a cloud version?

No, and that's intentional. Shodh-memory is built for local-first, privacy-preserving AI. Your agent's memories stay on your hardware. If you need multi-device sync, you can replicate the RocksDB storage yourself.

Q: What languages and frameworks does shodh-memory support?

The core is Rust. We provide: an MCP server (for Claude, Cursor, and other AI agents), Python bindings (via PyO3/maturin), and a REST API. The Rust crate can be embedded directly in your application.

Q: How do I contribute?

Check out github.com/varun29ankuS/shodh-memory. Open issues, submit PRs, or join discussions. The codebase is well-documented with 688+ tests. All constants have neuroscience citations.

Varun Sharma

doi:10.5281/zenodo.18668709

2026-03-20•11 min read

What Your AI Doesn't Know It Doesn't Know: Why Memory Needs Diagnostic Introspection

cognitionarchitectureagentic-ai

what-your-ai-doesnt-know-it-doesnt-know.md

What Your AI Doesn't Know It Doesn't Know: Why Memory Needs Diagnostic Introspection

There's a failure mode in AI agents that nobody talks about. It's not hallucination. It's not context window limits. It's not even forgetting.

It's the inability to know what has been forgotten.

Your agent loses context silently. Rules degrade without warning. Knowledge graphs develop blind spots that no query will ever surface — because the agent doesn't know to ask. This is the "unknown unknowns" problem, and it's the reason most AI memory systems fail in production even when they work in demos.

The Claude.md Problem

If you use Claude Code or Cursor, you've probably set up a CLAUDE.md file — a persistent instruction set that tells the AI how to behave in your codebase. Coding conventions, architecture decisions, tool preferences, deployment rules.

Here's what actually happens during a long session:

1. Context window fills up. Your conversation + tool results + file contents consume tokens.

2. The system compacts. Older context gets summarized or dropped to make room.

3. CLAUDE.md instructions get compressed. What was "always use bun, never npm" becomes a vague recollection.

4. Rules degrade to suggestions. Community testing shows enforcement drops to 60-70% as sessions lengthen.

5. The agent doesn't notice. It continues working, now violating constraints it was explicitly given.

This isn't a bug in Claude. It's a fundamental limitation of stateless context windows. The agent has no mechanism to detect that it has lost information. There's no "confidence meter" for instruction retention. No alarm when a rule it was following drops out of context.

The agent doesn't know what it doesn't know.

Beyond Forgetting: The Four Failure Modes

When you examine how AI agents lose knowledge, four distinct patterns emerge:

1. Blind Spots

Areas of the knowledge graph with no coverage. The agent was never told about a critical system dependency, or the information existed once but decayed completely. No query will surface it because there's nothing to find.

Example: Your agent knows about your API layer and your database schema, but has zero knowledge of the caching layer between them. Every architectural suggestion it makes ignores cache invalidation — and it will never spontaneously realize the gap.

2. Shallow Knowledge

Topics where the agent has surface-level awareness but insufficient depth. It knowsthat something exists but nothow it works. This is dangerous because the agent will confidently use its shallow understanding, producing plausible but wrong outputs.

Example: The agent knows you use Kubernetes but doesn't understand your specific pod affinity rules. It suggests a deployment config that looks correct but violates constraints it was told about three sessions ago.

3. Orphaned Clusters

Islands of knowledge disconnected from the main graph. The information exists but can't be reached through normal association or spreading activation. It's there, but functionally invisible.

Example: You had a detailed conversation about error handling patterns last week. Those memories exist in storage. But they're not connected to the current context — your agent is refactoring error handling right now and has no idea it already has strong opinions about the approach.

4. Stale Zones

Knowledge that was accurate when stored but has since been superseded. The agent hasn't been told about the change and continues operating on outdated information.

Example: You migrated from REST to gRPC three weeks ago. The agent's memory still has strong REST-related patterns. It suggests REST endpoint patterns because that knowledge has higher activation strength — it was accessed more frequently over a longer period.

Why Vector Search Can't Fix This

Traditional AI memory systems — vector databases with semantic search — are structurally incapable of detecting these failures. Here's why:

Vector search answers the question:"What do I know that's similar to X?"

It cannot answer:"What am I missing?"

If a concept doesn't exist in the embedding space, no similarity query will find it. If knowledge is orphaned, no retrieval path reaches it. If information is stale, its embedding is still valid — the vector doesn't know it's wrong.

This is the fundamental limitation. Vector databases are retrieval systems. They find what's there. They cannot introspect on what's absent.

The Knowledge Graph Difference

A knowledge graph changes the equation. Instead of isolated vectors in embedding space, you have a connected structure where:

• Entities have relationships. You can traverse from one concept to related concepts, discovering gaps in the connection pattern.

• Edges have strength. Frequently co-accessed knowledge forms stronger bonds (Hebbian learning). Weak edges indicate shallow understanding.

• Clusters are visible. You can identify connected components and detect orphaned subgraphs.

• Temporal metadata exists. You know when knowledge was last accessed, last reinforced, and how it has decayed.

With this structure, you can compute things that are impossible with vectors alone:

Blind Spot Detection

```

Given: entity A (API layer) and entity C (database)

Expected: connection through entity B (cache layer)

Found: no entity B exists in the graph

Alert: potential blind spot in system architecture knowledge

```

The graph knows that A connects to C through normal software architecture patterns. When the intermediary is missing, that absence is detectable.

Shallow Knowledge Scoring

```

Entity: "Kubernetes"

- Connected edges: 3 (deploy, pods, services)

- Average edge strength: 0.2 (weak)

- Compared to: "PostgreSQL" with 47 edges, avg strength 0.8

Alert: Kubernetes knowledge is shallow relative to usage frequency

```

Edge count and strength are direct proxies for knowledge depth. An entity with few, weak connections is understood superficially.

Orphan Detection

```

Cluster: error-handling-patterns (7 memories, 12 edges)

- Last accessed: 8 days ago

- Connected to main graph: no

- Relevance to current context: high (active refactoring)

Alert: disconnected knowledge cluster relevant to current work

```

Graph traversal reveals disconnected components. Cross-referencing with current context identifies which orphans matter.

Staleness Detection

```

Entity: "REST API patterns"

- Last reinforced: 3 weeks ago

- Contradicted by: memory[uuid] "migrated to gRPC" (1 week ago)

- Activation strength: still high (historical frequency)

Alert: high-activation knowledge may be outdated

```

When newer memories contradict older, stronger memories, the graph can flag the conflict. The agent can then verify which is current rather than defaulting to the stronger (but stale) association.

How Shodh Implements This

shodh-memory's knowledge graph isn't a retrieval optimization — it's a cognitive diagnostic tool. Here's what runs under the hood:

Spreading Activation propagates energy through the graph when a concept is accessed. This isn't just for finding related memories — it reveals which areas of the graph are unreachable from current context. Dead zones in the activation pattern are potential blind spots.

Hebbian Learning continuously updates edge strengths based on co-access patterns. Edges that haven't been reinforced decay naturally. When edge strength drops below a threshold, the connection is flagged as potentially stale — the agent knew this relationship once but hasn't verified it recently.

Entity Extraction builds the graph automatically from every memory stored. NER identifies entities, relation detection finds connections, and the graph grows organically. Gaps in entity coverage correspond directly to gaps in knowledge.

Memory Consolidation runs periodic maintenance that identifies:

• Orphaned subgraphs (clusters with no edges to the main component)

• Weakening clusters (average edge strength declining over time)

• Entity coverage gaps (topics referenced but never deeply explored)

• Contradictory edges (newer information conflicting with established patterns)

The Practical Impact

Consider a coding agent that has been working with a developer for three months. Traditional memory gives it recall — "what do I know about this codebase?" Knowledge graph introspection gives itmetacognition — "what should I know that I don't?"

When the agent starts a new session, instead of just loading relevant memories, it can report:

```

⚠ Knowledge gaps detected:

- Authentication flow: last accessed 23 days ago, 4 edges decayed

- CI/CD pipeline: shallow (2 entities, weak connections)

- Error handling: orphaned cluster from session on Mar 3

- API versioning: contradicted by recent migration notes

```

This changes the agent from a passive tool that answers questions to an active participant that surfaces what needs attention. It's the difference between a filing cabinet and a colleague who says "hey, I think we forgot something."

Why This Matters Now

As AI agents move from toy demos to production systems — managing infrastructure, writing code, operating robots — the cost of unknown unknowns scales dramatically. A chatbot that forgets your preference for dark mode is annoying. An autonomous agent that forgets a safety constraint is dangerous.

The industry is focused on making agents remember more. That's necessary but insufficient. The harder problem — the one that separates a memory system from a cognitive system — is knowing what has been lost.

Context windows are lossy. Rules degrade. Knowledge decays. These aren't bugs to fix — they're properties of any finite memory system, biological or artificial. The question isn't whether your agent will forget. It's whether your agent will know that it forgot.

That requires a knowledge graph. That requires introspection. That requires treating memory not as storage, but as a living structure you can examine, diagnose, and repair.

Getting Started

shodh-memory runs as a single binary with zero cloud dependencies. The knowledge graph builds itself from the memories you store — no manual graph construction required.

```bash

Install via npm (MCP server for Claude Code / Cursor)

npx @shodh/memory-mcp@latest

Or run the server directly

docker run -d -p 3030:3030 -v shodh-data:/data varunshodh/shodh-memory

```

Once running, every memory automatically feeds into the knowledge graph. Entity extraction, edge formation, Hebbian learning, and decay all happen without configuration. The graph grows as your agent works — and its diagnostic capabilities grow with it.

The agent that knows what it doesn't know is the agent you can actually trust in production.

References

1. Cowan, N. (2001). The Magical Number 4 in Short-Term Memory.Behavioral and Brain Sciences, 24(1), 87-114.

2. Hebb, D.O. (1949). The Organization of Behavior. Wiley.

3. Anderson, J.R. (1983). A Spreading Activation Theory of Memory.Journal of Verbal Learning and Verbal Behavior, 22(3), 261-295.

4. Wixted, J.T. (2004). The Psychology and Neuroscience of Forgetting.Annual Review of Psychology, 55, 235-269.

5. Metcalfe, J. & Shimamura, A.P. (1994). Metacognition: Knowing About Knowing. MIT Press.

6. Nelson, T.O. & Narens, L. (1990). Metamemory: A Theoretical Framework and New Findings.Psychology of Learning and Motivation, 26, 125-173.

What Your AI Doesn't Know It Doesn't Know: Why Memory Needs Diagnostic Introspection

What Your AI Doesn't Know It Doesn't Know: Why Memory Needs Diagnostic Introspection

The Claude.md Problem

Beyond Forgetting: The Four Failure Modes

1. Blind Spots

2. Shallow Knowledge

3. Orphaned Clusters

4. Stale Zones

Why Vector Search Can't Fix This

The Knowledge Graph Difference

Blind Spot Detection

Shallow Knowledge Scoring

Orphan Detection

Staleness Detection

How Shodh Implements This

The Practical Impact

Why This Matters Now

Getting Started

Install via npm (MCP server for Claude Code / Cursor)

Or run the server directly

References

Related Posts

Graph Databases for AI Memory: Why Your Agent Needs a Knowledge Graph

What Is AI Agent Memory? Beyond Chat History and RAG

Best AI Agent Frameworks 2026: LangChain, CrewAI, AutoGen, OpenAI Agents SDK Compared

$ subscribe