跳到主要导航 跳到搜索 跳到主要内容

PrefixFPM: A parallel framework for general-purpose frequent pattern mining

  • University of Alabama at Birmingham
  • East China Normal University
  • Tongji University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Frequent pattern mining (FPM) has been a focused theme in data mining research for decades, but there lacks a general programming framework that can be easily customized to mine different kinds of frequent patterns, and existing solutions to FPM over big transaction databases are IO-bound rendering CPU cores underutilized even though FPM is NP-hard.This paper presents, PrefixFPM, a general-purpose framework for FPM that is able to fully utilize the CPU cores in a multicore machine. PrefixFPM follows the idea of prefix projection to partition the workloads of PFM into independent tasks by divide and conquer. PrefixFPM exposes a unified programming interface to users who can customize it to mine their desired patterns, and the parallel execution engine is transparent to end-users and can be reused for mining all kinds of patterns. We have adapted the state-of-the-art serial algorithms for mining frequent patterns including subsequences, subtrees, and subgraphs on top of PrefixFPM, and extensive experiments demonstrate an excellent speedup ratio of PrefixFPM with the number of cores.A demo is available at https://youtu.be/PfioC0GDpsw; the code is available at https://github.com/yanlab19870714/PrefixFPM.

源语言英语
主期刊名Proceedings - 2020 IEEE 36th International Conference on Data Engineering, ICDE 2020
出版商IEEE Computer Society
1938-1941
页数4
ISBN(电子版)9781728129037
DOI
出版状态已出版 - 4月 2020
活动36th IEEE International Conference on Data Engineering, ICDE 2020 - Dallas, 美国
期限: 20 4月 202024 4月 2020

出版系列

姓名Proceedings - International Conference on Data Engineering
2020-April
ISSN(印刷版)1084-4627

会议

会议36th IEEE International Conference on Data Engineering, ICDE 2020
国家/地区美国
Dallas
时期20/04/2024/04/20

指纹

探究 'PrefixFPM: A parallel framework for general-purpose frequent pattern mining' 的科研主题。它们共同构成独一无二的指纹。

引用此