TY - JOUR
T1 - Effective reinforcement learning-based dynamic flexible job shop scheduling using two-stage dispatching
AU - Ding, Jiepin
AU - Xia, Jun
AU - Ye, Yutong
AU - Chen, Mingsong
N1 - Publisher Copyright:
© 2025 Elsevier B.V.
PY - 2026/3
Y1 - 2026/3
N2 - Deep Reinforcement Learning (DRL) has been recognized as a promising means for solving the Dynamic Flexible Job Shop Scheduling Problem (DFJSP), where involved jobs have both distinct start time and due dates. However, due to the improper DRL modeling of scheduling components, it is hard to guarantee the quality (e.g., makespan, resource utilization) of job-to-machine dispatching solutions for DFJSP. This is mainly because (i) most existing DRL-based methods design actions as composite rules by combining both the processes of operation sequencing and machine assignment together, which will inevitably limit their adaptability to ever-changing scheduling scenarios; and (ii) without considering knowledge sharing among DRL network nodes, the learned policy networks with bounded sizes cannot be applied to complex large-scale scheduling problems. To address this problem, this paper introduces a novel DRL-based two-stage dispatching method that can effectively solve the DFJSP to achieve scheduling solutions of better quality. In our approach, the first stage utilizes a graph neural network-based policy network to facilitate optimal operation selection at each dispatching point. Since the policy network is size-agnostic and can share knowledge among DRL network nodes through graph embedding, it can handle DFJSP instances of varying scales. For the second stage, by decoupling the dependencies between operations and machines, we propose an effective machine selection heuristic that can derive more dispatching rules to improve the adaptability of DRL to various complex dynamic scheduling scenarios. Comprehensive experimental results demonstrate the superiority of our approach over state-of-the-art methods from both the perspective of scheduling solution quality and the adaptability of learned DRL models.
AB - Deep Reinforcement Learning (DRL) has been recognized as a promising means for solving the Dynamic Flexible Job Shop Scheduling Problem (DFJSP), where involved jobs have both distinct start time and due dates. However, due to the improper DRL modeling of scheduling components, it is hard to guarantee the quality (e.g., makespan, resource utilization) of job-to-machine dispatching solutions for DFJSP. This is mainly because (i) most existing DRL-based methods design actions as composite rules by combining both the processes of operation sequencing and machine assignment together, which will inevitably limit their adaptability to ever-changing scheduling scenarios; and (ii) without considering knowledge sharing among DRL network nodes, the learned policy networks with bounded sizes cannot be applied to complex large-scale scheduling problems. To address this problem, this paper introduces a novel DRL-based two-stage dispatching method that can effectively solve the DFJSP to achieve scheduling solutions of better quality. In our approach, the first stage utilizes a graph neural network-based policy network to facilitate optimal operation selection at each dispatching point. Since the policy network is size-agnostic and can share knowledge among DRL network nodes through graph embedding, it can handle DFJSP instances of varying scales. For the second stage, by decoupling the dependencies between operations and machines, we propose an effective machine selection heuristic that can derive more dispatching rules to improve the adaptability of DRL to various complex dynamic scheduling scenarios. Comprehensive experimental results demonstrate the superiority of our approach over state-of-the-art methods from both the perspective of scheduling solution quality and the adaptability of learned DRL models.
KW - Deep reinforcement learning
KW - Dynamic scheduling
KW - Flexible job shop scheduling
KW - Job insertion
UR - https://www.scopus.com/pages/publications/105025203592
U2 - 10.1016/j.sysarc.2025.103664
DO - 10.1016/j.sysarc.2025.103664
M3 - 文章
AN - SCOPUS:105025203592
SN - 1383-7621
VL - 172
JO - Journal of Systems Architecture
JF - Journal of Systems Architecture
M1 - 103664
ER -