石油炼制与化工 ›› 2026, Vol. 57 ›› Issue (6): 131-139.

• 控制与优化 • 上一篇    下一篇

基于Dueling DQN算法的原油调度优化应用研究

王永豪,周智菊,赵毅,房韡   

  1. 中石化石油化工科学研究院有限公司
  • 收稿日期:2026-01-04 修回日期:2026-02-03 出版日期:2026-06-12 发布日期:2026-05-22
  • 通讯作者: 房韡 E-mail:fangwei.ripp@sinopec.com

APPLICATION RESEARCH OF CRUDE OIL SCHEDULING OPTIMIZATION BASED ON THE Dueling DQN ALGORITHM


  • Received:2026-01-04 Revised:2026-02-03 Online:2026-06-12 Published:2026-05-22

摘要: 聚焦于深度强化学习在原油调度中的应用,将调度过程建模为马尔可夫决策过程,并采用 Dueling DQN 算法对原油调度优化问题进行求解。针对调度场景中状态尺度不一、奖励分布不稳定等问题,设计了状态归一化与奖励标准化机制以提升训练稳定性与收敛效率;通过对卸油决策维度的合理简化,以降低动作空间复杂度。试验结果表明,所提出方法能够在满足多类工艺与操作约束的前提下生成稳定、可行且经济性良好的调度方案,验证了深度强化学习在复杂炼油厂原油调度优化任务中的有效性与应用潜力。

关键词: 原油调度, 深度强化学习, Dueling DQN, 调度优化

Abstract: Focusing on the application of deep reinforcement learning in crude oil scheduling, the scheduling process is modeled as a Markov decision process, and the Dueling DQN algorithm is adopted to solve the crude oil scheduling optimization problem. To address challenges such as disparate state scales and fluctuating reward distributions, state normalization and reward standardization mechanisms are designed to enhance training stability and convergence efficiency. Furthermore, the complexity of the action space is effectively reduced through the strategic simplification of unloading decision dimensions. Experimental results demonstrate that the proposed method can generate stable, feasible, and cost-effective scheduling plans while satisfying various process and operational constraints. These outcomes verify the effectiveness and application potential of deep reinforcement learning in optimizing complex crude oil scheduling tasks in refinery operations.

Key words: crude oil scheduling, deep reinforcement learning, Dueling DQN, scheduling optimization