摘要
SIMD is an instruction set in mainstream processors, which provides the data level parallelism to accelerate the performance of applications. However, its advantages diminish when applications suffer from heavy cache misses. To eliminate cache misses in SIMD vectorization, we present interleaved multi-vectorizing (IMV) in this paper. It interleaves multiple execution instances of vectorized code to hide memory access latency with more computation. We also propose residual vectorized states to solve the control flow divergence in vectorization. IMV can make full use of the data parallelism in SIMD and the memory level parallelism through prefetching. It reduces cache misses, branch misses and computation overhead to significantly speed up the performance of pointerchasing applications, and it can be applied to executing entire query pipelines. As experimental results show, IMV achieves up to 4.23X and 3.17X better performance compared with the pure scalar implementation and the pure SIMD vectorization, respectively.
| 源语言 | 英语 |
|---|---|
| 页(从-至) | 226-238 |
| 页数 | 13 |
| 期刊 | Proceedings of the VLDB Endowment |
| 卷 | 13 |
| 期 | 3 |
| DOI | |
| 出版状态 | 已出版 - 2020 |
| 活动 | 46th International Conference on Very Large Data Bases, VLDB 2020 - Virtual, 日本 期限: 31 8月 2020 → 4 9月 2020 |
指纹
探究 'Interleaved multi-vectorizing' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver