跳到主要导航 跳到搜索 跳到主要内容

StreamRec: A Recommendation Inference System with CUDA Stream Acceleration

  • Yuean Niu
  • , Zhizhen Xu
  • , Yushu Sun
  • , Chen Xu*
  • *此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Deep learning based recommendation models are widely used in various applications. There are often dozens of groups of sparse features in the input of the recommendation model, and each group of features computes the embedding layer independently and applies a separate feature interaction. However, current deep learning frameworks sequentially schedule the execution of all operators into a single CUDA computational stream. Therefore, we propose StreamRec, a stream-based parallel inference system. It assigns the processing of individual features to different CUDA streams for parallel execution. Besides, StreamRec is able to visualize the execution performance and operators assignment results on the web.

源语言英语
主期刊名Database Systems for Advanced Applications - 29th International Conference, DASFAA 2024, Proceedings
编辑Makoto Onizuka, Jae-Gil Lee, Yongxin Tong, Chuan Xiao, Yoshiharu Ishikawa, Kejing Lu, Sihem Amer-Yahia, H.V. Jagadish
出版商Springer Science and Business Media Deutschland GmbH
480-483
页数4
ISBN(印刷版)9789819755745
DOI
出版状态已出版 - 2024
活动29th International Conference on Database Systems for Advanced Applications, DASFAA 2024 - Gifu, 日本
期限: 2 7月 20245 7月 2024

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
14856 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议29th International Conference on Database Systems for Advanced Applications, DASFAA 2024
国家/地区日本
Gifu
时期2/07/245/07/24

指纹

探究 'StreamRec: A Recommendation Inference System with CUDA Stream Acceleration' 的科研主题。它们共同构成独一无二的指纹。

引用此