基于Dueling DQN算法的原油调度优化应用研究

石油炼制与化工 ›› 2026, Vol. 57 ›› Issue (6): 131-139.

基于Dueling DQN算法的原油调度优化应用研究

王永豪,周智菊,赵毅,房韡

中石化石油化工科学研究院有限公司

收稿日期:2026-01-04 修回日期:2026-02-03 出版日期:2026-06-12 发布日期:2026-05-22
通讯作者: 房韡 E-mail:fangwei.ripp@sinopec.com

APPLICATION RESEARCH OF CRUDE OIL SCHEDULING OPTIMIZATION BASED ON THE Dueling DQN ALGORITHM

Received:2026-01-04 Revised:2026-02-03 Online:2026-06-12 Published:2026-05-22

摘要/Abstract

摘要： 聚焦于深度强化学习在原油调度中的应用，将调度过程建模为马尔可夫决策过程，并采用 Dueling DQN 算法对原油调度优化问题进行求解。针对调度场景中状态尺度不一、奖励分布不稳定等问题，设计了状态归一化与奖励标准化机制以提升训练稳定性与收敛效率；通过对卸油决策维度的合理简化，以降低动作空间复杂度。试验结果表明，所提出方法能够在满足多类工艺与操作约束的前提下生成稳定、可行且经济性良好的调度方案，验证了深度强化学习在复杂炼油厂原油调度优化任务中的有效性与应用潜力。

关键词: 原油调度, 深度强化学习, Dueling DQN, 调度优化

Abstract: Focusing on the application of deep reinforcement learning in crude oil scheduling, the scheduling process is modeled as a Markov decision process, and the Dueling DQN algorithm is adopted to solve the crude oil scheduling optimization problem. To address challenges such as disparate state scales and fluctuating reward distributions, state normalization and reward standardization mechanisms are designed to enhance training stability and convergence efficiency. Furthermore, the complexity of the action space is effectively reduced through the strategic simplification of unloading decision dimensions. Experimental results demonstrate that the proposed method can generate stable, feasible, and cost-effective scheduling plans while satisfying various process and operational constraints. These outcomes verify the effectiveness and application potential of deep reinforcement learning in optimizing complex crude oil scheduling tasks in refinery operations.

Key words: crude oil scheduling, deep reinforcement learning, Dueling DQN, scheduling optimization

王永豪周智菊赵毅房韡. 基于Dueling DQN算法的原油调度优化应用研究[J]. 石油炼制与化工, 2026, 57(6): 131-139.

参考文献

[1]郑万鹏，高小永，朱桂瑶，等.原油作业过程优化的研究进展[J].化工学报, 2021, 72(11):5481-5501 [2]王川，杜文莉，朱佳雯，等.数智赋能流程工业调度决策优化：综述与展望[J].中国科学：信息科学, 2025, 55(07):1571-1598 [3]Shah N.Mathematical programming techniques for crude oil scheduling[J].Computers&Chemical Engineering, 1996, 20(Supplement 2):1227-1232 [4]Lee H, Pinto J M, Grossmann I E, et al.Mixed-integer linearprogramming model for refinery short-term scheduling of crudeoil unloading with inventory management[J].Industrial &Engineering Chemistry Research, 1996, 35(5):1630-1641 [5]Xu J, Qu H, Wang S, et al.A new proactive scheduling methodology for front-end crude oil and refinery operations under uncertainty of shipping delay[J].Industrial & Engineering Chemistry Research, 2017, 56(28):8041-8053 [6]Neiro, Sergio Mauro da Silva, et al.quot;Dealing with multiple tank outflows and in-line blending in continuous-time crude oil scheduling problems[J].quot; Industrial&Engineering Chemistry Research, 2019, 58(11):4495-4510 [7]侯艳，牛聪，滕少华，等.基于改进 -Ⅲ 的原油短期调度能耗优化[J].工业工程, 2024, 27(06):38-50 [8]Yang Y, He R, Yu G, et al.Efficient rolling horizon approach to a crude oil scheduling problem for marine-access refineries[J].Computers & Chemical Engineering, 2023, 170(February 2023):108121-108121 [9]Shakya A K, Pillai G, Chakrabarty S.Reinforcement learning algorithms: A brief survey[J].Expert Systems with Applications, 2023, 231(30 November):120495-120495 [10]李宝帅，叶春明.深度强化学习算法求解作业车间调度问题[J].计算机工程与应用, 2021, 57(23):248-254 [11]肖鹏飞，张超勇，孟磊磊，等.基于深度强化学习的非置换流水车间调度问题[J].计算机集成制造系统, 2021, 27(01):192-205 [12]杨挺，赵黎媛，刘亚闯，等.基于深度强化学习的综合能源系统动态经济调度[J].电力系统自动化, 2021, 45(05):39-47 [13]张君，林琳，郭芮，等.基于改进深度强化学习的电网电力智能调度分析模型研究[J].自动化技术与应用, 2025, 44(07):139-142 [14]Zhang M, Pan C.Hierarchical optimization scheduling algorithm for logistics transport vehicles based on multi-agent reinforcement learning[J].IEEE Transactions on Intelligent Transportation Systems, 2023, 25(3):3108-3117 [15]马楠，李洪奇，刘华林，等.基于的炼厂原油储运调度方法[J].化工进展, 2024, 43(3):1167-1177 [16]钟文涛.基于强化学习的原油处理短期调度策略研究与系统实现[D]. 广东工业大学，2024. [17]Van Hasselt H P, Guez A, Hessel M, et al.Learning values across many orders of magnitude[J].Advances in neural information processing systems, 2016, 29(05 December):4294-4302 [18]Mnih V, Kavukcuoglu K, Silver D, et al. Playing atari with deep reinforcement learning[J]. arXiv prep.[J].rXiv:2013, 1312:5602., rint, :- [19]Wang Z, Schaul T, Hessel M, et al.Dueling network architectures for deep reinforcement learning[C]//International conference on machine learning. PMLR, 2016: 1995-2003.