Major agents robotics

MLEvolve: A Self-Evolving Framework for Automated Machine Learning Algorithm Discovery

Shangheng Du, Xiangchao Yan, Jinxin Shi, Zongsheng Cao, Shiyang Feng, Zichen Liang

Published
Jun 4, 2026 — 17:55 UTC

Problem
This paper addresses the limitations of existing machine learning engineering (MLE) agents, particularly their inability to effectively manage inter-branch information isolation, memoryless search processes, and lack of hierarchical control. These issues hinder the agents’ capabilities in long-horizon optimization tasks. The authors propose MLEvolve, a self-evolving framework designed to enhance the discovery of machine learning algorithms. This work is presented as a preprint and has not yet undergone peer review.

Method
MLEvolve employs a multi-agent architecture based on large language models (LLMs) and introduces several novel components to enhance algorithm discovery. The framework extends traditional tree search methods to a Progressive Multi-Contextual Graph Search (MCGS), which facilitates cross-branch information flow through graph-based reference edges. This allows the framework to transition from broad exploration to focused exploitation using an entropy-inspired progressive schedule. Additionally, MLEvolve incorporates a Retrospective Memory system that combines a cold-start domain knowledge base with a dynamic global memory, enabling the retrieval and reuse of task-specific experiences. The framework also decouples strategic planning from code generation through adaptive coding modes, which stabilizes long-horizon iterations. The training compute details are not disclosed, but the evaluation is conducted under a 12-hour budget, which is half the standard runtime for such tasks.

Results
MLEvolve demonstrates state-of-the-art performance on the MLE-Bench benchmark, achieving significant improvements in average medal rate and valid submission rate compared to existing methods. Specifically, it outperforms specialized algorithm discovery methods, including AlphaEvolve, in mathematical algorithm optimization tasks. The results indicate strong cross-domain generalization capabilities, although specific numerical performance metrics are not provided in the abstract.

Limitations
The authors acknowledge that while MLEvolve improves upon existing MLE agents, it may still face challenges related to the scalability of the memory systems and the efficiency of the search process in highly complex environments. Additionally, the reliance on LLMs may introduce biases inherent to the training data, which could affect the generalizability of the discovered algorithms. The paper does not address potential computational overhead associated with the dynamic memory systems or the implications of the entropy-inspired scheduling on convergence rates.

Why it matters
MLEvolve represents a significant advancement in the field of automated machine learning, particularly in addressing the limitations of existing MLE agents. By enabling sustained self-evolution and effective information sharing, this framework has the potential to enhance the efficiency and effectiveness of algorithm discovery processes. The implications of this work extend to various domains where automated machine learning is applicable, paving the way for more robust and adaptable systems. This research is crucial for future developments in MLE and is available on arXiv.

Turing Wire

By Turing Wire editorial staff · Jun 4, 2026 · Editorial standards →

Source: arXiv cs.CL