Persistent memory for AI agentic infrastructure

Give your agents memory that lasts.

A semantic memory database your agents can read, write, and reason over — across sessions, models, and teams. Model-agnostic. Enterprise-secure. Native MCP and REST API.

Start a 14-day trial See how it works

Native MCP for Claude Code · OpenAI-compatible · Self-hostable · No credit card required

claude-code · ~/projects/checkout

› remember why we picked Stripe over Adyen
memory_search → 3 results · bank: payments-platform

[decision · importance 9]
Chose Stripe for global card coverage + Issuing API; Adyen contract
required $200k/yr minimum and lacked sub-account model we needed for
multi-tenant payouts. Reviewed 2026-02-11 with finance + legal.

[constraint]
Never log raw PAN or CVV — PCI scope must stay frozen.

› remember this: rate-limited Stripe webhook to 50/s, was 502'ing on bursts
memory_remember → saved · type: pattern · talked-like-ai ✓

AES-256-GCM

encryption at rest

15 min

JWT rotation

SOC 2

readiness path

MCP-native

1-line install

Features

Everything your agents need to actually remember.

A purpose-built memory layer for agentic systems — semantic search, multi-tenant security, and a credentials gateway so your agents never hold the keys.

SIGNATURE

Talks like AI by default.

Drop in raw notes, Slack threads, or messy bullet points. Memory System auto-rewrites them into AI-optimized format — structured, retrievable, and tuned for LLM context windows. Your humans write like humans; your agents read like agents.

// you write

"ugh chose stripe again, adyen wanted 200k min and no sub-accounts"

// agent reads

[decision] Chose Stripe over Adyen. Adyen required $200k/yr minimum and lacked sub-account model required for multi-tenant payouts.

Model-agnostic

Claude, GPT, Gemini, Llama, Mistral, your local Ollama — same memory bank, every agent. Switch models without losing context.

Enterprise security

AES-256-GCM at rest, JWT + MFA, row-level multi-tenant isolation, full audit logging, rate limiting, and a SOC 2 readiness path.

REST API + native MCP

First-class Model Context Protocol server for Claude Code and a clean REST API for everything else. Drop in via one config block.

Service Gateway

Agents call Cloudflare, AWS, and other APIs through the gateway — they never see the secret. Fine-grained scopes, elevation workflows, full audit trail.

Departments & sharing

Hierarchical departments, cross-team bank sharing with granular CRUD permissions, and time-boxed access for contractors.

Memory analytics

See which memories actually get retrieved, which drift stale, and where your agents are flying blind.

How it works

Three steps to give every agent shared memory.

1

Connect your agents

Add a single MCP block to Claude Code, or hit the REST API from any agent in any language.
2

Remember & recall

Your agent calls memory_remember and memory_search. We auto-summarize, embed, and structure into AI-readable form.
3

Scale across teams

Departments, bank sharing, fine-grained scopes, elevation requests — Memory System grows with your org.

~/.claude/mcp.json

{
  "mcpServers": {
    "memory-system": {
      "command": "npx",
      "args": ["memory-system-mcp"],
      "env": {
        "MEMORY_API_KEY": "ms_xxxxx"
      }
    }
  }
}

That's it. Restart Claude Code — your agent now has persistent, semantic memory across every session.

Security & compliance

Built for the enterprises your agents are touching.

Memory is sensitive. We treat it like the production data store it is.

Encryption

AES-256-GCM at rest

Every memory body encrypted with envelope keys. TLS 1.3 in flight.

Identity

JWT + TOTP MFA

Short-lived 15-min access tokens, refresh rotation, optional MFA enforced org-wide.

Isolation

Row-level multi-tenant

Postgres RLS, per-org keys, departments, scoped API keys.

Audit

Every read & write

Append-only audit log with export, plus elevation approval workflow.

Secrets

Vault + Service Gateway

Hierarchical namespaces, versioning, agents act through the gateway — never holding keys.

Compliance

SOC 2 readiness

Controls mapped from day one. HIPAA/BAA on Enterprise.

Residency

US & EU regions

Pin your data where your customers expect it.

Self-host

On-prem option

Docker images for VPC, air-gapped, or fully on-prem deployments.

Pricing

Simple plans that grow with your agents.

All plans include MCP, REST API, encryption at rest, and "talks like AI" auto-formatting.

Starter

Solo builders

$29 /mo

For one developer running agents day-to-day.

Start free trial

✓ 1 seat
✓ 5 GB encrypted storage
✓ 2M embedding tokens / mo
✓ Up to 5,000 memories
✓ MCP + REST API
✓ Talks like AI auto-formatting
— Departments, audit export, SSO

Pro

Teams shipping agents

$100 /mo

Everything a team needs to run shared agentic memory.

Start free trial

✓ 5 seats included (+$15/seat)
✓ 50 GB encrypted storage
✓ 25M embedding tokens / mo
✓ 100,000 memories
✓ Departments & bank sharing
✓ Service Gateway (Cloudflare, AWS)
✓ Audit log export, webhooks
✓ SSO (Google / Microsoft)

Enterprise

Regulated & scale

Custom

Compliance, on-prem, dedicated infra.

Talk to sales

✓ Unlimited seats & banks
✓ Dedicated single-tenant infra
✓ SAML SSO + SCIM provisioning
✓ SOC 2 Type II report & HIPAA BAA
✓ US / EU data residency
✓ On-prem / air-gapped deploy
✓ 99.95% uptime SLA
✓ Dedicated support channel

Overage on Pro: $0.50 per additional GB · $1.50 per additional 1M embedding tokens. No charge for memory reads.

Frequently asked

What does "talks like AI by default" actually mean?

Whatever you save — a one-line note, a pasted Slack thread, a half-finished thought — gets auto-rewritten into AI-optimized form: typed (decision/pattern/constraint), summarized, tagged, and embedded. So when an agent retrieves it later, it lands in context windows ready to use, not as raw mush.

Which models and agent frameworks do you support?

All of them. Native MCP for Claude Code, plus a clean REST API for OpenAI, Gemini, Llama, Cursor, Cline, LangChain, custom agents — anything that can make an HTTP call.

How is this different from a vector DB?

A vector DB stores embeddings. Memory System is the full memory layer above it: typed memories, summarization, departments, audit, encryption, MCP tooling, a service gateway, and the AI-formatting pipeline. We use pgvector underneath — you don't have to think about it.

Can my agent run actions, or just read/write memory?

Both. The Service Gateway lets agents execute scoped API actions (Cloudflare DNS, AWS, more coming) without ever holding the credentials themselves. Sensitive actions go through an elevation/approval flow.

Where is data hosted?

AWS us-east-1 by default. Pro and Enterprise customers can pin to EU. Enterprise can deploy on-prem or in your VPC.

Do my memories train your models?

No. Ever. Your memories are encrypted at rest with per-org keys. We don't train on customer data and we don't share it with model providers beyond the embedding/summarization calls you've explicitly enabled.