Notable training methods

Categorical Prior Lock-in: Why In-Context Learning Fails for Structured Data

Antonio Pelusi, Stefano Braghin, Alberto Trombetta

Published
Jun 10, 2026 — 11:41 UTC

Problem
This work addresses the limitations of in-context learning (ICL) in large language models (LLMs) when applied to structured data, particularly under conditions of distribution mismatch. The authors highlight a specific failure mode termed “categorical prior lock-in,” where the model’s prior over token distributions, inherited from pre-training, cannot be effectively updated. This issue is particularly pronounced in high-cardinality tabular data, which serves as a controlled test case. The paper is a preprint and has not undergone peer review.

Method
The authors conduct experiments using two 7B-parameter open-weight models to evaluate the performance of ICL in generating structured data. They analyze the impact of providing additional examples on numerical fidelity and categorical distribution reproduction. The study also explores parameter-efficient fine-tuning techniques, specifically Low-Rank Adaptation (LoRA), to mitigate the limitations of ICL. The training setup includes a focus on high-cardinality categorical data, with the models evaluated on their ability to adapt to new distributions without parameter updates.

Results
The findings reveal that while ICL can enhance numerical fidelity with more examples, it reaches a sharp ceiling in reproducing categorical distributions, particularly failing to generate rare classes entirely. In contrast, LoRA fine-tuning demonstrates improved adaptability but introduces risks of memorization and potential destabilization of structured output generation. The authors provide quantitative results, indicating that ICL’s performance plateaus significantly below that of LoRA, although specific numerical metrics and baseline comparisons are not detailed in the abstract.

Limitations
The authors acknowledge that while LoRA addresses some limitations of ICL, it also poses risks related to memorization and output stability. They do not explore the implications of these risks in depth, nor do they provide extensive analysis on the trade-offs between adaptability and privacy. Additionally, the study is limited to high-cardinality tabular data, which may not generalize to other structured data types or lower-cardinality scenarios.

Why it matters
This research has significant implications for the deployment of LLMs in applications requiring structured data generation, such as database management and automated reporting. The identification of categorical prior lock-in underscores the need for improved methodologies in adapting LLMs to new data distributions, particularly in high-cardinality contexts. The trade-offs highlighted between adaptability and privacy in fine-tuning approaches like LoRA suggest a critical area for future research, as practitioners must balance performance with the risks of data leakage. This work contributes to the ongoing discourse on the limitations of ICL in structured data contexts, as published in arXiv cs.AI.

Turing Wire

By Turing Wire editorial staff · Jun 10, 2026 · Editorial standards →

Source: arXiv cs.AI