TY - GEN
T1 - Parallel stream processing against workload skewness and variance
AU - Fang, Junhua
AU - Zhang, Rong
AU - Fu, Tom Z.J.
AU - Zhang, Zhenjie
AU - Zhou, Aoying
AU - Zhu, Junhua
N1 - Publisher Copyright:
© 2017 Association for Computing Machinery.
PY - 2017/6/26
Y1 - 2017/6/26
N2 - Key-based workload partitioning is a common strategy used in parallel stream processing engines, enabling effective key-value tuple distribution over worker threads in a logical operator. It is likely to generate poor balancing performance when workload variance occurs on the incoming data stream. This paper presents a new key-based workload partitioning framework, with practical algorithms to support dynamic workload assignment for stateful operators. The framework combines hash-based and explicit keybased routing strategies for workload distribution, which specifies the destination worker threads for a handful of keys and assigns the other keys with the hash function. When short-term distribution fluctuations occur to the incoming data stream, the system adaptively updates the routing table containing the chosen keys, in order to rebalance the workload with minimal migration overhead within the stateful operator. We formulate the rebalance operation as an optimization problem, with multiple objectives on minimizing state migration costs, controlling the size of the routing table and breaking workload imbalance among worker threads. Despite of the NP-hardness nature behind the optimization formulation, we carefully investigate and justify the heuristics behind key (re)routing and state migration, to facilitate fast response to workload variance with ignorable cost to the normal processing in the distributed system. Empirical studies on synthetic data and real-world stream applications validate the usefulness of our proposals.
AB - Key-based workload partitioning is a common strategy used in parallel stream processing engines, enabling effective key-value tuple distribution over worker threads in a logical operator. It is likely to generate poor balancing performance when workload variance occurs on the incoming data stream. This paper presents a new key-based workload partitioning framework, with practical algorithms to support dynamic workload assignment for stateful operators. The framework combines hash-based and explicit keybased routing strategies for workload distribution, which specifies the destination worker threads for a handful of keys and assigns the other keys with the hash function. When short-term distribution fluctuations occur to the incoming data stream, the system adaptively updates the routing table containing the chosen keys, in order to rebalance the workload with minimal migration overhead within the stateful operator. We formulate the rebalance operation as an optimization problem, with multiple objectives on minimizing state migration costs, controlling the size of the routing table and breaking workload imbalance among worker threads. Despite of the NP-hardness nature behind the optimization formulation, we carefully investigate and justify the heuristics behind key (re)routing and state migration, to facilitate fast response to workload variance with ignorable cost to the normal processing in the distributed system. Empirical studies on synthetic data and real-world stream applications validate the usefulness of our proposals.
KW - Adjustment
KW - Distributed stream processing
KW - Load balance
KW - Stateful operation
KW - Workload variance
UR - https://www.scopus.com/pages/publications/85025815502
U2 - 10.1145/3078597.3078613
DO - 10.1145/3078597.3078613
M3 - 会议稿件
AN - SCOPUS:85025815502
T3 - HPDC 2017 - Proceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing
SP - 15
EP - 26
BT - HPDC 2017 - Proceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing
PB - Association for Computing Machinery, Inc
T2 - 26th ACM International Symposium on High-Performance Parallel and Distributed Computing, HPDC 2017
Y2 - 26 June 2017 through 30 June 2017
ER -