Human Adults and LLMs as Scientists: Who Benefits from Active Exploration?
Mandana Samiei, Eunice Yiu, Anthony GX-Chen, Dongyan Lin, Jocelyn Shen, Blake A. Richards
- Published
- Jun 4, 2026 — 17:53 UTC
Problem
This study addresses the “conjunctive handicap” in causal learning, where adults struggle to identify conjunctive causal rules compared to disjunctive ones. Previous research primarily utilized passive observation paradigms, limiting the understanding of how agency through active exploration might influence causal reasoning. The authors aim to fill this gap by examining whether allowing adults to actively explore can mitigate this bias. This work is presented as a preprint, indicating it has not yet undergone peer review.
Method
The authors employed a modified “blicket detector” task, where adult participants were allowed to actively intervene to identify causal objects under both conjunctive and disjunctive rule structures. The experimental design involved measuring the number of tests participants conducted to infer causal relationships, thereby assessing the efficiency of their exploration strategies. The study also included a comparative analysis of human performance against various state-of-the-art large language models (LLMs) in the same task context. The models’ performance was evaluated based on hypothesis inference accuracy and exploration efficiency.
Results
The findings reveal that active exploration significantly enhances adults’ ability to reason about conjunctive causal rules, although they still require more tests to reach conclusions compared to disjunctive rules. Specifically, adults demonstrated improved performance in identifying conjunctive rules when allowed to explore actively, but the performance gap between conjunctive and disjunctive reasoning persisted. In comparison, while some LLMs achieved near-human levels of accuracy in hypothesis inference, they exhibited less efficient exploration strategies and maintained similar performance gaps between conjunctive and disjunctive tasks. The paper provides quantitative results, although specific numerical values are not detailed in the summary.
Limitations
The authors acknowledge that while active exploration improves conjunctive reasoning, it does not completely eliminate the inherent challenges associated with these types of causal rules. Additionally, the study’s reliance on a specific task (the blicket detector) may limit the generalizability of the findings to other contexts or more complex causal reasoning scenarios. The comparison with LLMs also raises questions about the models’ adaptability to different exploration strategies, which may not be fully captured in the experimental design.
Why it matters
This research has significant implications for understanding causal reasoning in both humans and artificial intelligence systems. By demonstrating that active exploration can enhance adults’ ability to identify conjunctive causal rules, the study suggests potential avenues for improving educational strategies and AI training methodologies. Furthermore, the comparative analysis with LLMs highlights the need for developing more efficient exploration strategies in AI systems, which could lead to advancements in their reasoning capabilities. This work contributes to the broader discourse on causal learning and exploration in AI, as published in arXiv cs.CL.
By Turing Wire editorial staff · Jun 4, 2026 · Editorial standards →
Source: arXiv cs.CL