pith. sign in

arxiv: 1603.01682 · v1 · pith:OH6BPMMNnew · submitted 2016-03-05 · 💻 cs.DB

Frequent-Itemset Mining using Locality-Sensitive Hashing

classification 💻 cs.DB
keywords apriorialgorithmminingnumberaddingasymmetricbottleneckcandidates
0
0 comments X
read the original abstract

The Apriori algorithm is a classical algorithm for the frequent itemset mining problem. A significant bottleneck in Apriori is the number of I/O operation involved, and the number of candidates it generates. We investigate the role of LSH techniques to overcome these problems, without adding much computational overhead. We propose randomized variations of Apriori that are based on asymmetric LSH defined over Hamming distance and Jaccard similarity.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.