TY - JOUR
T1 - A Supply–Demand Balance Guided Hierarchical Reinforcement Learning Approach for Truck-Cargo Matching
AU - Liao, Jiajun
AU - Dong, Yitao
AU - Huang, Xiaopeng
AU - Mao, Jiali
AU - Zhou, Aoying
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025
Y1 - 2025
N2 - Truck-cargo matching is one of the core tasks of online freight platforms, where the primary objective is to optimally assign each cargo task to the most suitable truck. The existing matching strategies seek to maximize total transported weight of cargoes by increasing the number of truck-cargo pairings. However, these strategies fail to ensure global matching pair maximization across all regions due to the heterogeneous spatial distribution of truck supply and cargo transporting demand. This limitation necessitates the incorporation of regional supply–demand gap prediction into the matching process. Two critical challenges emerge in achieving optimal matching: (1) the prediction accuracy of supply–demand gaps is influenced by multiple complex factors, and (2) the multi-objective optimization conflicts within current matching decisions may adversely impact subsequent matching performance. To address these challenges, we propose a hierarchical reinforcement learning framework for multi-objective truck-cargo matching, comprising two key components: a supply–demand gap prediction module and a multi-objective optimization matching module.For accurate supply–demand gap prediction, we develop a hypergraph attention network model incorporating an adaptive confidence interval optimization mechanism to capture complex relationships among various predictive factors. Furthermore, to mitigate negative effects of multi-objective conflicts on long-term matching performance, we design a hierarchical deep Q network model that dynamically adjusts objective weights based on predicted long-term benefits. Extensive experiments conducted on two real-world logistics datasets demonstrate that our proposed method achieves a 10.7% higher competitive ratio compared to state-of-the-art approaches, validating the effectiveness of our supply–demand balance guided matching strategy in practical operational scenarios.
AB - Truck-cargo matching is one of the core tasks of online freight platforms, where the primary objective is to optimally assign each cargo task to the most suitable truck. The existing matching strategies seek to maximize total transported weight of cargoes by increasing the number of truck-cargo pairings. However, these strategies fail to ensure global matching pair maximization across all regions due to the heterogeneous spatial distribution of truck supply and cargo transporting demand. This limitation necessitates the incorporation of regional supply–demand gap prediction into the matching process. Two critical challenges emerge in achieving optimal matching: (1) the prediction accuracy of supply–demand gaps is influenced by multiple complex factors, and (2) the multi-objective optimization conflicts within current matching decisions may adversely impact subsequent matching performance. To address these challenges, we propose a hierarchical reinforcement learning framework for multi-objective truck-cargo matching, comprising two key components: a supply–demand gap prediction module and a multi-objective optimization matching module.For accurate supply–demand gap prediction, we develop a hypergraph attention network model incorporating an adaptive confidence interval optimization mechanism to capture complex relationships among various predictive factors. Furthermore, to mitigate negative effects of multi-objective conflicts on long-term matching performance, we design a hierarchical deep Q network model that dynamically adjusts objective weights based on predicted long-term benefits. Extensive experiments conducted on two real-world logistics datasets demonstrate that our proposed method achieves a 10.7% higher competitive ratio compared to state-of-the-art approaches, validating the effectiveness of our supply–demand balance guided matching strategy in practical operational scenarios.
KW - Hierarchical reinforcement learning
KW - Multi-objective optimization
KW - Supply-demand balance
KW - Truck-cargo matching
UR - https://www.scopus.com/pages/publications/105009038588
U2 - 10.1007/s41019-025-00299-6
DO - 10.1007/s41019-025-00299-6
M3 - 文章
AN - SCOPUS:105009038588
SN - 2364-1185
JO - Data Science and Engineering
JF - Data Science and Engineering
ER -