Knowledge and memory

Here's a summary of the themes discussed in the Hacker News thread, presented with markdown headers and direct quotes:

The Nature and Storage of Human Memory

A significant portion of the discussion revolves around whether the brain "stores" memories in a way analogous to computers, and if so, how. Some users question the necessity of direct storage, suggesting that intelligence might arise from on-the-fly reconstruction of information.

ClaraForm argues, "I'm not convinced the brain stores memories, or that memory storage is required for human intelligence. And we 'hallucinate' all the time." They further elaborate, "It's possible the brain is similar, on a much more refined scale. My brain certainly doesn't store 35,000 instances of my mum's image to help me identify her, just an averaged image to help me know when I'm looking at my mum."
n4r9 counters this by stating, "What do you mean, you're not convinced that the brain stores memories? What is happening in the brain when you have an experience and later recall that same experience? It might not be perfect recall but something is being stored."
ckemere, identifying as a memory neuroscientist, touches on established distinctions: "A distinction between semantic (facts/concepts) and episodic (specific experiences) declarative memories are fairly well established since at least the 1970s. That the latter is required to construct the former is also long posited, with reasonable evidence."
Regarding the colloquial use of "memory," mallowdram states, "The terms are arbitrary and don't relate to human memory. Using the term 'I know this' is a post-hoc retrofit to a process exclusively accessed wordlessly. Just as 'I remember this' does not access that process, but rather comments on the aftermath pretty unscientifically, like a sportscaster trying to read a pitcher's mind."

LLMs vs. Human Cognition and Hallucinations

A central theme is the comparison of Large Language Models (LLMs) with human cognitive processes, particularly in understanding why LLMs "hallucinate" (generate factually incorrect or nonsensical information). Many users emphasize the fundamental differences between the two.

HarHarVeryFunny highlights a key distinction: "Humans can generally differentiate between when they know something or not, and I'd agree with the article that this is because we tend to remember how we know things, and also have different levels of confidence according to source." They add, "To the LLM all text is just statistics, and it has no personal experience to lean to to self-check and say 'hmm, I can't recall ever learning that - I'm drawing blanks'."
The same user also pushes back against direct comparisons: "Frankly it's silly to compare LLMs (Transformers) and brains. An LLM was only every meant to be a linguistics model, not a brain or cognitive architecture."
Muromec's initial experience illustrates the problem: "I guess it merged two tokens why learning the text." They also faced a situation where a corrected LLM still provided a hallucinated quote, noting, "Amazingly it also knows about difference between two constants, but referrs to the wrong one in both calculations and in hallucinating the quote."
Gobdovan suggests that analogies can be misleading: "The article you shared raises an interesting point by comparing human memory with LLMs, but I think the analogy can only go so far. They're too distinct to explain hallucinations simply as a lack of meta-cognition or meta-memory. These systems are more like alien minds, and allegories risk introducing imprecision when we're trying to debug and understand their behavior."
Burnte offers a pragmatic view on LLM output: "I agree with the folks who call these screwups rather than hallucinations because the point of LLMs isn't to be right, it's to be statistically highly likely. If making something up fits that model, then that's what it will do. That's literally how it works."
Juancroldan provides a technical perspective: "The fundamental limitation of LLMs is that they represent knowledge as parametric curves, and their generalization is merely interpolation of those curves. This can only ever produce results that correlate with facts (training data), not ones that are causally derived from them, which makes hallucinations inevitable. Same as with human memory."

The Origins and Design of Transformer Models

Several participants delve into the history and technical motivations behind the development of Transformer architectures, pushing back against the idea that they were designed with AGI or brain-like emulation in mind.

HarHarVeryFunny clarifies the historical context: "The motivation for the Transformer was to build a more efficient and scalable language model by using parallel processing, not sequential (RNN/LSTM), to take advantage of GPU/TPU acceleration." They cite Jakob Uzkoreit's conceptual role and focus on linguistic principles: "There were two key ideas, originating from the way linguists use syntax trees to represent the hierarchical/grammatical structure of a sentence."
They also note the unexpected scaling effects: "Even at this stage, nobody was saying 'if we scale this up it'll be AGI-like'. It took multiple steps of scaling... for there to be a growing realization of how good a next-word predictor, with corresponding capabilities, this architecture was when scaled up."
DavidSJ links to a 1999 paper by Mahoney, suggesting an earlier conceptual link between compression, prediction, and intelligence. HarHarVeryFunny, however, questions the relevance of this point to the Transformer's specific design motivations.

The Role of Written Knowledge and Data

The discussion touches on the importance of externalizing knowledge through writing and documentation as a means to improve AI systems and facilitate knowledge transfer.

Devstein emphasizes this point: "I believe this is why the importance of written (human) knowledge is only increasing, especially internally at companies. Written knowledge (i.e documentation) has always served as a knowledge cache and a way to transfer knowledge between people." They conclude, "Without fundamental changes to the LLMs or the way we think about agentic systems, high quality, comprehensive written knowledge is the best path to helping agents 'learn' over time."

Unresolved Mysteries of Biological Memory

The fundamental lack of complete understanding regarding the physical substrate of memory in the brain is acknowledged and discussed as a parallel to the emergent complexity of LLMs.

A quote attributed to "the author" (presumably of the article being discussed) states: "I’ll remind you that biologists do not, in the year 2025, know memory’s physical substrate in the brain! Plenty of hypotheses — no agreement. Is there any more central mystery in human biology, maybe even human existence?"
Mallowdram strongly disagrees with this characterization: "This is patently false. 'A hypothesis is very distinct from theoretical knowledge. A hypothesis lacks empirical evidence. A theory uses empirical information.'" They direct users to specific books for a more accurate understanding of current neuroscience research.
Norseboar defends the original statement's intent: "I don't think it shows that the author lacks the ability to discern state of the art research or is making wildly unsupported statements, I think they were using plain-English terms to describe a state where there's a lot of uncertainty about the physical mechanism."

General Reflections on AI and Human Nature

Some users offer broader philosophical reflections on the implications of AI development and human understanding.

Tolerance expresses gratitude for the ongoing scientific inquiry: "You know, whatever memory is or where it’s at and however the mind works, I’m grateful I’ve got mine in tact right now and I appreciate science’s inability to zero in on these things." They also note a growing appreciation for such introspection: "It’s nice to know that this sort of appreciation is becoming more common. Somewhere between tech accelerationism and protestant resistance are those willing to re-interrogate human nature in anticipation of what lies ahead."
Alessandru expresses skepticism about the quality of AI commentary: "let's stop taking opinions on ai from randoms. please. they haven't a fkin clue."