TY - GEN
T1 - Incremental frequent itemsets mining with MapReduce
AU - Kandalov, Kirill
AU - Gudes, Ehud
N1 - Publisher Copyright:
© 2017, Springer International Publishing AG.
PY - 2017/1/1
Y1 - 2017/1/1
N2 - Frequent itemsets mining is a common task in data mining. Since sizes of today’s databases go far beyond capabilities of a single machine, recent studies show how to adopt classical algorithms for frequent itemsets mining for parallel frameworks such as MapReduce. Even then, in case of a slight database update a re-run of the MapReduce mining algorithm from the beginning on the whole data set is required and could be far from optimal. Thus, a variation of these algorithms for incremental database update is desired. The current paper presents a general algorithm for incremental frequent itemsets mining and shows how to adapt it to the parallel paradigm. It also provides optimizations that are unique to a constrained model of MapReduce for an effective algorithm.
AB - Frequent itemsets mining is a common task in data mining. Since sizes of today’s databases go far beyond capabilities of a single machine, recent studies show how to adopt classical algorithms for frequent itemsets mining for parallel frameworks such as MapReduce. Even then, in case of a slight database update a re-run of the MapReduce mining algorithm from the beginning on the whole data set is required and could be far from optimal. Thus, a variation of these algorithms for incremental database update is desired. The current paper presents a general algorithm for incremental frequent itemsets mining and shows how to adapt it to the parallel paradigm. It also provides optimizations that are unique to a constrained model of MapReduce for an effective algorithm.
UR - https://www.scopus.com/pages/publications/85030153390
U2 - 10.1007/978-3-319-66917-5_17
DO - 10.1007/978-3-319-66917-5_17
M3 - Conference contribution
AN - SCOPUS:85030153390
SN - 9783319669168
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 247
EP - 261
BT - Advances in Databases and Information Systems - 21st European Conference, ADBIS 2017, Proceedings
A2 - Kirikova, Marite
A2 - Norvag, Kjetil
A2 - Papadopoulos, George A.
PB - Springer Verlag
T2 - 21st European Conference on Advances in Databases and Information Systems, ADBIS 2017
Y2 - 24 September 2017 through 27 September 2017
ER -