Notable theory

User as Engram: Internalizing Per-User Memory as Local Parametric Edits

Bojie Li

Published
Jun 17, 2026 — 15:15 UTC

Problem
The paper addresses the limitations of current personalization methods in language models, which typically store user-specific facts externally, leading to inefficiencies and potential contamination of unrelated text. Existing approaches, such as per-user LoRA adapters, merge content and reasoning skills into a single global weight delta, which can degrade performance. This work proposes a new framework, “User as Engram,” to internalize user memory as localized edits, thereby maintaining the integrity of the model’s reasoning capabilities. The authors note that this is a preprint and has not undergone peer review.

Method
The proposed architecture, User as Engram, separates user-specific content from shared reasoning skills. It employs a hash-keyed memory table to store user facts as surgical edits, allowing for precise retrieval without altering the underlying model weights. The reasoning skills are maintained in a shared adapter, ensuring that the model’s performance remains consistent across users. The memory footprint is significantly reduced, achieving a 33,000x smaller size compared to traditional methods. The model’s design allows for disjoint hash slots for different users, enabling additive and lossless composition of edits. The retrieval mechanism is optimized to prevent growth in search space, outperforming traditional retrieval pipelines when the number of facts exceeds approximately 100.

Results
The User as Engram model demonstrates a 5.6x improvement in indirect reasoning accuracy compared to per-user LoRA adapters, while ensuring that no user experiences a decline in reasoning performance relative to the untouched base model. The efficiency of the memory retrieval process is highlighted, with the Engram table surpassing a retrieval pipeline in a 2.5x larger model after a threshold of 100 facts. These results indicate a significant advancement in both memory efficiency and reasoning capabilities in personalized language models.

Limitations
The authors acknowledge that the approach is still in the early stages of development and may require further validation through extensive empirical testing. Additionally, the reliance on a hash-keyed memory table may introduce challenges in scalability and collision management as the user base grows. The paper does not address potential issues related to the complexity of managing the surgical edits or the implications of user privacy in storing personalized data.

Why it matters
This work has significant implications for the future of personalized AI systems, particularly in enhancing user interaction and experience by allowing models to retain and utilize personal information more effectively. The separation of content and reasoning skills could lead to more robust and adaptable language models, paving the way for applications in personalized education, customer service, and beyond. The findings are relevant for researchers and engineers looking to improve personalization in AI, as discussed in detail in the paper available on arXiv.

Turing Wire

By Callan Zhang · Jun 17, 2026 · Editorial standards →

Summarised from the primary source with AI assistance under human editorial oversight. Turing Wire is not a primary source — read the original for the authoritative account.

Source: arXiv cs.AI