What is Cloudflare Agent Memory?

Agent Memory is a managed service announced by Cloudflare on April 17, 2026 that extracts information from AI agent conversations and makes it available when needed, so agents can maintain persistent recall without filling the model's context window.

How can developers access Agent Memory?

It is currently in private beta. Developers can join via a waitlist and, once approved, access it either through a binding inside a Cloudflare Worker or through a REST API for use outside the Workers ecosystem.

Who owns the data stored in Agent Memory?

Cloudflare states that the data belongs to the customer. Every memory is exportable, and the company says users can take the knowledge their agents accumulate with them if their needs change.

Cloudflare Launches Agent Memory Private Beta to Give AI Agents Persistent Recall

Cloudflare has opened the private beta of Agent Memory, a managed service designed to give AI agents a durable form of recall that sits outside the model's context window. Announced on April 17, 2026 as part of Cloudflare's Agents Week, the release targets one of the most persistent bottlenecks in agentic AI: models that forget what they learned the moment a session ends or tokens run out.

What the service does

Agent Memory captures facts, preferences, and decisions from ongoing agent conversations, runs them through a multi-stage ingestion pipeline (extraction, verification, classification, and storage), and surfaces them back to the agent on demand. Cloudflare engineers Tyson Trautmann and Rob Sutter described the goal plainly: "It gives AI agents persistent memory, allowing them to recall what matters, forget what doesn't, and get smarter over time."

Retrieval uses five parallel channels, including full-text search, fact-key lookup, raw message search, direct vector search, and HyDE vector search. Under the hood, the service is built on Cloudflare primitives already familiar to Workers developers: Durable Objects for isolation and storage, Vectorize for embeddings, Workers AI for inference, and SQLite-backed storage.

Why it matters

Even with frontier models now offering large context windows, system prompts, tools, and recent history can consume a significant share of the available tokens, leaving less room for the conversation itself. Offloading durable knowledge into a separate memory layer allows long-running agents (coding assistants, background workers, autonomous research pipelines) to stay focused on the task in front of them without constantly relitigating earlier decisions.

The launch explicitly targets coding agents such as Claude Code and OpenCode, as well as custom agent harnesses and teams that want multiple agents to share a common memory store. Developers can call Agent Memory through a binding inside a Cloudflare Worker or through a REST API for workloads that live elsewhere.

Data portability and rollout

Cloudflare is pitching Agent Memory against both open-source memory libraries and the built-in memory features shipped by some model providers. A key selling point is portability: the company says every memory is exportable, framing the service as "managed, but your data is yours."

The private beta opened on April 17, and Cloudflare says it plans to make the service publicly available soon. Pricing has not yet been published.

Implications

Agent Memory is a small piece of infrastructure that hints at a larger shift: as agents take on longer-horizon work, the interesting engineering problem is no longer squeezing more tokens into a prompt, but deciding what an agent should remember, what it should forget, and where that knowledge should live. By turning memory into a networked service with its own retrieval pipeline, Cloudflare is betting that the next generation of AI agents will look less like chatbots with long context windows and more like stateful applications running on a new kind of cloud.

Cloudflare Launches Agent Memory Private Beta to Give AI Agents Persistent Recall

What the service does

Why it matters

Data portability and rollout

Implications

More in Tools

OpenAI Turns Codex Into a Super App With Computer Use, Atlas Browser, and Image Generation

Anthropic Launches Claude Design, Turning Text Prompts Into Slides, Prototypes and One-Pagers

Google Brings AI Mode Side-by-Side With Web Pages in Chrome