pith. sign in

arxiv: 1901.07773 · v1 · pith:4FLHZJAVnew · submitted 2019-01-23 · 💻 cs.IR · cs.DB

Boosting Frequent Itemset Mining via Early Stopping Intersections

classification 💻 cs.IR cs.DB
keywords miningtechniquefrequentdeclatearlyeclatgeneralitemset
0
0 comments X
read the original abstract

Mining frequent itemsets from a transaction database has emerged as a fundamental problem in data mining and committed itself as a building block for many pattern mining tasks. In this paper, we present a general technique to reduce support checking time in existing depth-first search generate-and-test schemes such as Eclat/dEclat and PrePost+. Our technique allows infrequent candidate itemsets to be detected early. The technique is based on an early-stopping criterion and is general enough to be applicable in many frequent itemset mining algorithms. We have applied the technique to two TID-list based schemes (Eclat/dEclat) and one N-list based scheme (PrePost+). Our technique has been tested over a variety of datasets and confirmed its effectiveness in runtime reduction.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.