Exploring Decomposition for Solving Pattern Mining Problems

Youcef Djenouri; Jerry Chun-Wei Lin; Kjetil Nørvåg; Heri Ramampiaro; Philip S. Yu

Exploring Decomposition for Solving Pattern Mining Problems

Philip Yu

ACM Transactions on Management Information Systems

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

This article introduces a highly efficient pattern mining technique called Clustering-based Pattern Mining (CBPM). This technique discovers relevant patterns by studying the correlation between transactions in the transaction database based on clustering techniques. The set of transactions is first clustered, such that highly correlated transactions are grouped together. Next, we derive the relevant patterns by applying a pattern mining algorithm to each cluster. We present two different pattern mining algorithms, one applying an approximation-based strategy and another based on an exact strategy. The approximation-based strategy takes into account only the clusters, whereas the exact strategy takes into account both clusters and shared items between clusters. To boost the performance of the CBPM, a GPU-based implementation is investigated. To evaluate the CBPM framework, we perform extensive experiments on several pattern mining problems. The results from the experimental evaluatio...

Shahid Kamal

In data mining studies, mining of frequent patterns in transaction databases has been a popular area of research. Many approaches are being used to solve the problem of discovering association rules among items in large databases. We also consider the same problem. We present a new approach for solving this problem that is fundamentally different from the known techniques. In this study, we propose a transactional patternbase where transactions with same pattern are added as their frequency is increased. Thus subsequent scanning requires only scanning this compact dataset which increases efficiency of the respective methods. We have implemented this technique by using two-dimensional matrix instead of using FP-Growth method, as used by most of the algorithms. Empirical evaluation shows that this technique outperforms the database approach, implemented with FP-Growth, in many situations and performs exceptionally well when the repetition of transaction patterns is higher. We have implemented it using Visual Basic which has substantially reduced coding and computational cost. Success of this method will open new directions.

Log In

Exploring Decomposition for Solving Pattern Mining Problems

Sign up for access to the world's latest research

Abstract

Related papers

Related papers