跳到主要导航 跳到搜索 跳到主要内容

Coresets for minimum enclosing balls over sliding windows

  • National University of Singapore
  • Singapore Management University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Coresets are important tools to generate concise summaries of massive datasets for approximate analysis. A coreset is a small subset of points extracted from the original point set such that certain geometric properties are preserved with provable guarantees. This paper investigates the problem of maintaining a coreset to preserve the minimum enclosing ball (MEB) for a sliding window of points that are continuously updated in a data stream. Although the problem has been extensively studied in batch and append-only streaming settings, no efficient sliding-window solution is available yet. In this work, we first introduce an algorithm, called AOMEB, to build a coreset for MEB in an append-only stream. AOMEB improves the practical performance of the state-of-the-art algorithm while having the same approximation ratio. Furthermore, using AOMEB as a building block, we propose two novel algorithms, namely SWMEB and SWMEB+, to maintain coresets for MEB over the sliding window with constant approximation ratios. The proposed algorithms also support coresets for MEB in a reproducing kernel Hilbert space (RKHS). Finally, extensive experiments on real-world and synthetic datasets demonstrate that SWMEB and SWMEB+ achieve speedups of up to four orders of magnitude over the state-of-the-art batch algorithm while providing coresets for MEB with rather small errors compared to the optimal ones.

源语言英语
主期刊名KDD 2019 - Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
出版商Association for Computing Machinery
314-323
页数10
ISBN(电子版)9781450362016
DOI
出版状态已出版 - 25 7月 2019
已对外发布
活动25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2019 - Anchorage, 美国
期限: 4 8月 20198 8月 2019

出版系列

姓名Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

会议

会议25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2019
国家/地区美国
Anchorage
时期4/08/198/08/19

指纹

探究 'Coresets for minimum enclosing balls over sliding windows' 的科研主题。它们共同构成独一无二的指纹。

引用此