From Data to Action: Accelerating Refinery Optimization with AI
Dániel Pfeifer, Ábrahám Papp, Tibor Bernáth, Tamás Zoltán Varga, Márk Czifra, Botond Szilágyi
- Published
- May 14, 2026 — 17:07 UTC
Problem
This preprint addresses the gap in refinery optimization where traditional Linear Programming (LP) methods, despite their mathematical correctness, often yield results that are difficult to interpret and apply due to model simplifications and potential data supply errors. The authors highlight the need for enhanced decision-making tools that can provide insights beyond the LP solutions, particularly by leveraging historical data to improve trust in the optimization results.
Method
The core technical contribution involves the integration of machine learning techniques with LP outputs to enhance decision-making in refinery operations. The authors propose a modified version of the ECOD (Enhanced Change Detection) methodology, which is tailored for high-dimensional data analysis. The approach focuses on selecting the most informative pairs of features from the data, which are then analyzed using two 2D Anomaly Detection algorithms. These algorithms are designed to identify discrepancies and opportunities within the refinery scheduling and planning architecture, thereby providing actionable insights that complement the LP solutions.
Results
The proposed methodology was evaluated on real-world data from the MOL refinery. The authors report significant findings, including the identification of multiple business opportunities and data supply errors that were previously undetected by the LP model alone. While specific numerical results and effect sizes are not disclosed in the abstract, the authors emphasize that the integration of anomaly detection with LP outputs led to a marked improvement in operational insights, suggesting a substantial enhancement over traditional LP-only approaches.
Limitations
The authors acknowledge several limitations, including the reliance on the quality of historical data for training the anomaly detection models. They also note that the proposed methods may require further tuning and validation across different refinery contexts to ensure generalizability. Additionally, the computational overhead introduced by the machine learning components may pose challenges in real-time applications. An obvious limitation not explicitly mentioned is the potential for overfitting in high-dimensional spaces, which could lead to misleading anomaly detection results.
Why it matters
This work has significant implications for the field of refinery optimization and operational decision-making. By effectively combining LP with machine learning techniques, the authors provide a framework that not only enhances the interpretability of optimization results but also facilitates proactive management of refinery operations. The insights gained from anomaly detection can lead to improved efficiency, reduced operational risks, and better resource allocation. This approach could serve as a model for similar applications in other industrial domains where complex optimization problems are prevalent.
Authors: Dániel Pfeifer, Ábrahám Papp, Tibor Bernáth, Tamás Zoltán Varga, Márk Czifra, Botond Szilágyi, Edith Alice Kovács
Source: arXiv:2605.15085
URL: https://arxiv.org/abs/2605.15085v1
By Turing Wire editorial staff · May 14, 2026 · Editorial standards →
Source: arXiv cs.LG