This document presents the Dynamic Itemset Counting (DIC) algorithm for mining frequent itemsets from transactional data. DIC allows itemsets to be counted as soon as they are suspected of being frequent rather than waiting until the end of each pass over the data. The algorithm marks itemsets with solid or dashed boxes/circles to indicate their confirmed or suspected size. It iterates over the transaction data, incrementing counters for suspected itemsets and updating their markings if counts pass the minimum support threshold. The document discusses how DIC can be parallelized and extended to handle incremental updates to the transaction data.
Related topics: