Provenance Tracking in AI Compilers through the Lens of Coalgebra
Zilu Tian, Liying Liu
- Published
- Jun 9, 2026 — 14:46 UTC
Problem
The paper addresses the challenge of tracking provenance in AI compilers, which often perform aggressive rewrites of computation graphs through normalization, lowering, and optimization. Existing methods for provenance tracking are either invasive or ad hoc, particularly under non-injective graph transformations. This gap in capability is critical, as reliable provenance is necessary for postprocessing, debugging, and validating compiler transformations. The work is presented as a preprint and has not undergone peer review.
Method
The authors propose a generative approach to provenance tracking that leverages observational semantics rather than traditional identifier propagation. Their method formalizes provenance tracking using a coalgebraic model and bisimulation, which allows for the preservation of provenance information even when intermediate nodes in the computation graph are eliminated. The implementation of this approach is realized in a prototype AI compiler named COVAN. The authors detail the architecture of COVAN, emphasizing its lightweight nature and minimal engineering overhead, which facilitates stable provenance tracking across various compilation pipelines.
Results
The prototype COVAN demonstrates effective provenance tracking capabilities, maintaining stability across compilation processes. While specific quantitative results are not provided in the abstract, the authors claim that their method outperforms existing solutions in terms of engineering overhead and reliability. The paper suggests that COVAN can handle complex graph transformations without losing provenance information, although exact metrics and comparisons to named baselines are not explicitly detailed in the provided text.
Limitations
The authors acknowledge that their approach may not be universally applicable to all types of graph transformations, particularly those that are highly complex or non-standard. They also note that while their method reduces engineering overhead, it may still require careful integration into existing compiler frameworks. Additionally, the lack of empirical performance metrics against established baselines limits the ability to fully assess the effectiveness of their approach.
Why it matters
This work has significant implications for the development of AI compilers, particularly in enhancing the reliability of provenance tracking, which is crucial for debugging and validating compiler optimizations. By providing a formalized, lightweight method for tracking provenance, the authors contribute to the broader field of AI compiler design, potentially influencing future research and development in this area. The approach could facilitate more robust AI systems by ensuring that transformations are traceable and verifiable, as discussed in related literature on compiler optimizations and provenance tracking, available on arXiv.
By Turing Wire editorial staff · Jun 9, 2026 · Editorial standards →
Source: arXiv cs.AI