Skip to content
All guidesAI systems

AI memory systems for businesses: what you actually need

Vector databases, RAG, knowledge graphs: the jargon makes AI memory sound like something every business must buy. Here is what each one is, and how to match it to your size instead of the hype.

7 min read

AI tools have a strange limitation: on their own, they remember nothing. Ask a model a question today and it has no memory of what you told it yesterday. Everything it knows is either baked in from training or sitting in front of it right now. AI memory is the catch all term for the ways you give a system lasting knowledge and recall. The trap is assuming you need the most advanced version. Most businesses do not, and this is how to tell what you actually need.

A useful way to picture it

Andrej Karpathy, one of the clearest voices in the field, describes a large language model as a kind of computer. The model is the processor, and its context window, the text you give it in a single go, is the working memory, the RAM. RAM is fast but small, and it forgets the moment you close the program. Anything you want the system to remember for longer has to live somewhere else and be loaded back in when it is needed, the way a computer reads from its hard drive.

That picture cuts through most of the confusion. AI memory is really just the question of where your business knowledge lives when it is not in the model's working memory, and how the right piece gets pulled back in at the right moment.

The levels, from simplest to heaviest

Level 0: no memory

Every conversation starts from scratch. You paste in what the model needs and it answers. For one off tasks, drafting, summarising, brainstorming, this is completely fine and costs nothing extra. A lot of AI use in a small business sits here and should.

Level 1: built in memory

The mainstream tools now remember a little for you. ChatGPT and Claude have project spaces, custom instructions and saved context. You tell them about your business once and they carry it across chats. For a small team, this plus a shared document of key facts covers more than people expect.

Level 2: a simple knowledge store

You keep your important information in an organised place, a set of documents, a Notion space, a few well structured files, and load the relevant parts into the model when you need them. No special database, just good housekeeping. This is the sweet spot for most growing businesses, and it is where we keep our own systems.

Level 3: retrieval with a vector database

When your knowledge is too big to paste in, and search needs to understand meaning rather than exact words, you reach for retrieval, often called RAG. Your documents are turned into lists of numbers that capture their meaning, called embeddings, and stored in a vector database. When a question comes in, the system finds the passages whose meaning is closest and feeds only those to the model. This is what lets a chatbot answer accurately from a thousand page manual.

Level 4: hybrid and graph memory

The heaviest setups combine several methods: meaning based search, keyword search, structured records, and sometimes a knowledge graph that maps how things relate. They add per user memory, monitoring and governance. This is real engineering, justified only when AI is central to a large operation.

So what is a vector database, in plain terms?

A normal database finds things by exact match: show me invoice 4021. A vector database finds things by similarity of meaning: show me the passages about late payments, even if they never use those exact words. It does that by storing the embeddings mentioned above and comparing them mathematically. Names you will hear include Pinecone, Qdrant and Weaviate, and pgvector, which bolts onto an ordinary Postgres database you may already run.

Matching the system to your business

Three things decide what you need: how much knowledge the AI has to draw on, how often it changes, and how many people lean on it. Roughly:

  • Solo or a handful of people, light AI use: stay at Level 0 to 2. Built in memory, saved prompts and a shared context document. No database, close to zero cost.
  • Around five to twenty people, AI in daily use, a growing knowledge base: a well organised knowledge store, and only if plain search starts failing, a light retrieval layer using pgvector on a database you already have. Climb one level, not three.
  • Twenty to a hundred people, AI wired into real workflows, a large or fast changing body of knowledge: a proper retrieval pipeline with a vector database, document ingestion and re ranking. Now the investment pays back.
  • AI central to a large operation: hybrid retrieval, possibly a knowledge graph, per user memory, monitoring, and someone who owns it. A real system with a real budget.

The mistake to avoid

The common error is buying the Level 3 or 4 setup because it sounds serious, when the business is a Level 1 problem. A vector database does not make a small company more capable, it makes it more expensive and harder to maintain. Start at the simplest level that works, and climb only when you hit a wall you can actually feel: search returning rubbish, knowledge too big to handle, real answers going wrong.

If you are not sure which level you are at, that is exactly the kind of thing worth an hour of straight advice before you spend on infrastructure. Getting this right early saves a lot of money later.