The document describes an algorithm called StreamGen for efficiently mining frequent generator itemsets over data streams using a sliding window model. It introduces the concepts of generator itemsets and why they are important. StreamGen uses a novel enumeration tree structure and optimization techniques. It is the first algorithm that can mine generator itemsets from data streams. Evaluation results show it outperforms other algorithms for related tasks and achieves high classification accuracy when extended to mine classification rules.
Related topics: