From Tokens to Policy: Causal and Interpretable Heterogeneous Treatment Effects Identification
Riccardo Cadei, Frank Otchere, Nyasha Tirivayi, Gustavo Angeles Tagliaferro, Falco J. Bargagli-Stoffi, Francesco Locatello
- Published
- Jun 15, 2026 — 17:44 UTC
Problem
The paper addresses the gap in the identification of heterogeneous treatment effects (HTE) in controlled experiments, particularly in the context of policy optimization. Existing methodologies often compromise between expressivity and interpretability, leading to potential mischaracterizations of HTE when unmeasured heterogeneity drivers are present. This work is particularly relevant as it is a preprint and has not undergone peer review, indicating that the findings should be interpreted with caution.
Method
The authors propose a new framework for HTE identification that leverages extensive pre-treatment measurements, including multi-modal and multi-view data. They introduce Neural EXposure Interaction Search (NEXIS), which reformulates HTE identification as a Markov-blanket discovery problem. NEXIS employs an iterative procedure that ensures consistent selection of latent interactors, which are critical for causal interpretation. The method is designed to operate with minimal human supervision, enhancing scalability. The authors validate NEXIS empirically and provide theoretical guarantees regarding its performance.
Results
NEXIS was applied to two anti-poverty programs in Africa, where it incorporated satellite imagery to capture previously unmeasured environmental effect modifiers. The results demonstrated that NEXIS could yield novel and interpretable guidelines for optimizing the programs’ subsequent iterations. While specific numerical results and effect sizes are not detailed in the abstract, the authors claim that the method significantly improves upon existing baselines in terms of causal interpretability and policy relevance.
Limitations
The authors acknowledge that the reliance on extensive pre-treatment measurements may introduce challenges related to data collection and integration. Additionally, while NEXIS shows promise, the generalizability of the findings to other contexts or interventions remains to be established. The paper does not address potential biases that may arise from the use of satellite imagery or other multi-modal data sources, nor does it discuss the computational complexity of the NEXIS algorithm in large-scale applications.
Why it matters
This work has significant implications for the field of causal inference and policy optimization, particularly in contexts where understanding the impact of interventions is critical. By providing a method that enhances the interpretability of HTE identification, the authors contribute to the development of more effective and data-driven policy frameworks. The integration of multi-modal data sources, such as satellite imagery, opens new avenues for capturing environmental factors that influence treatment effects, which is crucial for designing targeted interventions. This research is foundational for future studies aiming to refine causal inference methodologies, as published in arXiv.
By Turing Wire editorial staff · Jun 15, 2026 · Editorial standards →
Source: arXiv cs.LG