Major efficiency inference

In-Context Graphical Inference

Zehua Cheng, Wei Dai, Jiahao Sun

Published: Jun 3, 2026 — 16:04 UTC

Problem
The paper addresses the limitations of existing marginal inference methods in discrete graphical models, particularly the trade-off between exactness and scalability. Exact algorithms become intractable for high-treewidth graphs, while iterative approximations like Belief Propagation and variational methods often lack convergence guarantees on frustrated topologies. This work is a preprint and has not undergone peer review, highlighting the need for improved inductive biases in inference methods.

Method
The authors propose In-Context Graphical Inference (ICG-I), an autoregressive Graph Transformer designed to restore the sequential elimination structure inherent in exact inference. ICG-I employs learned, Tensor-Train (TT) compressed intermediate factors to mimic Variable Elimination, which enhances the efficiency of the inference process. The architecture includes a Dirichlet output layer that facilitates calibrated, distribution-free coverage guarantees under topological shifts. The authors provide theoretical proofs demonstrating that TT compression errors propagate linearly through the autoregressive chain, that the Dirichlet-Multinomial loss functions as a proper scoring rule, and that Weighted Conformal Prediction (WCP) maintains coverage with quantifiable degradation under estimated density ratios. The training process and computational requirements are not explicitly detailed, but the focus on autoregressive modeling suggests a significant computational overhead typical of transformer architectures.

Results
ICG-I achieves state-of-the-art performance across multiple benchmarks. Specifically, it reduces Mean Absolute Error (MAE) from 0.041 (the best baseline) to 0.020 on standard instances. In the context of N=500 frustrated spin glasses, where traditional methods like Belief Propagation diverge, ICG-I achieves an MAE of 0.048, demonstrating its robustness in challenging scenarios. These results indicate a substantial improvement over existing methods, particularly in high-treewidth and frustrated topologies.

Limitations
The authors acknowledge that while ICG-I significantly improves inference accuracy, the reliance on TT compression may introduce limitations in scenarios with extreme density ratios, potentially affecting the robustness of the coverage guarantees. Additionally, the computational complexity associated with the autoregressive nature of the model may limit its scalability in very large graphs, although specific scalability metrics are not provided. The lack of peer review also raises questions about the reproducibility and generalizability of the results.

Why it matters
The introduction of ICG-I has significant implications for the field of graphical models and inference methods, particularly in applications requiring high accuracy in complex topologies. By addressing the inductive bias mismatch in existing methods, ICG-I paves the way for more reliable inference in high-dimensional settings. This work could influence future research directions in graphical models and machine learning, particularly in developing more efficient algorithms for marginal inference. For further details, see the full paper available on arXiv.

By Callan Zhang · Jun 3, 2026 · Editorial standards →

Summarised from the primary source with AI assistance under human editorial oversight. Turing Wire is not a primary source — read the original for the authoritative account.

Source: arXiv cs.LG