Schrodinger's Memory: Large Language Models

Abstract

Memory is the foundation of all human activities; without memory, it would benearly impossible for people to perform any task in daily life. With thedevelopment of Large Language Models (LLMs), their language capabilities arebecoming increasingly comparable to those of humans. But do LLMs have memory?Based on current performance, LLMs do appear to exhibit memory. So, what is theunderlying mechanism of this memory? Previous research has lacked a deepexploration of LLMs' memory capabilities and the underlying theory. In thispaper, we use Universal Approximation Theorem (UAT) to explain the memorymechanism in LLMs. We also conduct experiments to verify the memorycapabilities of various LLMs, proposing a new method to assess their abilitiesbased on these memory ability. We argue that LLM memory operates likeSchr\"odinger's memory, meaning that it only becomes observable when a specificmemory is queried. We can only determine if the model retains a memory based onits output in response to the query; otherwise, it remains indeterminate.Finally, we expand on this concept by comparing the memory capabilities of thehuman brain and LLMs, highlighting the similarities and differences in theiroperational mechanisms.

Quick Read (beta)

loading the full paper ...