Abstract
This chapter addresses the issue of mining frequent patterns from transactional data streams—the problem of frequent data stream pattern mining (FDPM). While most existing algorithms in mining frequent items for data streams using false-positive oriented approaches to control the error on the estimated frequency of mined patterns and memory requirement, a new paradigm in FDPM is explored—the false-negative oriented approach. That is, the data mining process is controlled by limiting the probability of a frequent pattern that is missed in the result, but all mined patterns are frequent. The chapter introduces both frequent item and item-set mining algorithms using the Chernoff bound. The bound enables pruning infrequent patterns from the continuously arriving transactions with the guarantee of the required recall rate of frequent patterns.
| Original language | English |
|---|---|
| Title of host publication | Proceedings 2004 VLDB Conference |
| Subtitle of host publication | The 30th International Conference on Very Large Databases (VLDB) |
| Publisher | Elsevier |
| Pages | 204-215 |
| Number of pages | 12 |
| ISBN (Electronic) | 9780120884698 |
| DOIs | |
| State | Published - 1 Jan 2004 |
| Externally published | Yes |