0% found this document useful (0 votes)

12 views7 pages

An Improvement of FP-Growth Association Rule Minin

This paper presents an improved FP-Growth algorithm for association rule mining that utilizes an adjacency table combined with a hash table to enhance efficiency, particularly for dense datasets. The proposed method reduces memory usage and processing time by requiring only a single scan of the transaction database, as opposed to the traditional two scans. Experimental results demonstrate that the improved algorithm significantly outperforms the original FP-Growth algorithm in mining frequent itemsets, especially in large datasets.

Uploaded by

jvsjoaovictorjvs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views7 pages

An Improvement of FP-Growth Association Rule Minin

Uploaded by

jvsjoaovictorjvs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

MATEC Web of Conferences 189, 10012 (2018) https://doi.org/10.

1051/matecconf/201818910012
MEAMT 2018

An improvement of FP-Growth association rule

mining algorithm based on adjacency table
Ming Yin1, Wenjie Wang1,*, Yang Liu1, and Dan Jiang2
School of Software and Microelectronics Northwestern Polytechnical University, Xi’an 710072, P.R China
1

Chinese helicopter design institute, Tianjin, P.R China

Abstract. FP-Growth algorithm is an association rule mining algorithm

based on frequent pattern tree (FP-Tree), which doesn’t need to generate a
large number of candidate sets. However, constructing FP-Tree requires
two scansof the original transaction database and the recursive mining of
FP-Tree to generate frequent itemsets. In addition, the algorithm can’t
work effectively when the dataset is dense. To solve the problems of large
memory usage and low time-effectiveness of data mining in this algorithm, this
paper proposes an improved algorithm based on adjacency table using a hash
table to store adjacency table, which considerably saves the finding time. The
experimental results show that the improved algorithm has good
performance especially for mining frequent itemsets in dense data sets.

1 Introduction
Data mining is a process of obtaining potentially useful knowledge from data[1]. As an
important part of data mining, association rule mining reflects the intrinsic relationship
between complex itemsets [2]. Agrawal et al[3, 4] proposed the Boolean association rule
proplem and the corresponding Apriori algorithm. Considering the disadvantages of the
Apriori algorithm, J. Han et al. proposed the FP-Growth algorithm using the FP-Tree to
generate frequent itemsets [5]. It compresses the transaction itemsets into FP-Tree to store
the association information of itemsets with FP-Tree and generates frequent itemsets using
the FP-Tree[6]. Although the algorithm requires two database scans, and doesn’t need to generate
candidate sets [7], it needs to create FP-Tree that contains all the itemsets, which requires lots of
memory. If frequent itemsets in the database is too many and the memory can’t load the mapping
information of all the items in the FP-Tree, the algorithm won’t be effective [8]. Besides, scanning
the transaction database twice also makes the performance of the algorithm low.
This paper proposes an improved FP-Growth algorithm based on adjacency table which
draws on the idea of graphs. After scanning the itemsets in the transaction database, we
adopts a storage method combing the adjacency table with the hash table, which can
remove itemsets that are less than the minimum support as soon as possible and avoid
generating all nonempty subsets of the long largest frequent itemsets. The algorithm makes
full use of the established adjacency table, and only needs to scan the original transaction database
once. It has the advantages of fast running speed, small memory consumption and low complexity.

*
Corresponding author: [email protected]

© The Authors, published by EDP Sciences. This is an open access article distributed under the terms of the Creative Commons
Attribution License 4.0 (http://creativecommons.org/licenses/by/4.0/).
MATEC Web of Conferences 189, 10012 (2018) https://doi.org/10.1051/matecconf/201818910012
MEAMT 2018

The rest of this paper is organized as follows: In Section 2, related works are discussed.The
section 3, we proposes the improvement of the FP-Growth algorithm based on adjacent
table and the mining process of frequent itemsets. Section 4, we analyse the time
performance between FP-Growth algorithm and the improved one. In Section 5, we do
experiments to compare the performance of FP-Growth algorithm with the improved one
on various itemsets. In last section, we present our conclusions and future work.

2 Related works
Association rule mining is an important data analysis method and data mining technology
[9]. Although Agrawal et al. proposed the Apriori algorithm, the algorithm uses iterative
process for the data subset and uses the candidate itemsets produced earlier to generate
frequent itemsets later, which results in low efficiency of the algorithm and being difficult
to be used in the mining of massive data [10,11]. In response to the disadvantages of the
Apriori algorithm, J. Han proposed the FP-Growth algorithm to generate frequent itemsets[5]. It
compresses the transaction itemsets into FP-Tree to store the association information of itemsets
with FP-Tree and generates frequent itemsets using the FP-Tree[12].
The algorithm introduces an data structures including three parts. The first part is the
header-table. It is used to record the frequency of occurrences of all itemsets and then sort
descendly by the frequency recorded; the second is FP-Tree, which maps the original
itemsets to FP-Tree in memory and maintains the association information between itemsets;
the third is the list of nodes. The frequent itemsets in all header tables are the node lists’
heads which respectively point to the position of frequent itemsets in the
FP-Tree[13].Although the algorithm requires two database scans, and does not need to
generate candidate itemsets[14], it needs to create FP-Tree that contains all the itemsets. If
frequent itemsets in the database is too many and the memory can’t load the mapping
information of all the itemsets in the FP-Tree, the algorithm won’t be effective[15]. Besides,
scanning the database twice also makes the algorithm inefficient.The DMFIA algorithm is an
improvement based on the FP-Growth algorithm, which reduces the frequency of database scans,
but still adopts the FP-Tree storage structure and the traversal method,whice has to search many
layers and generate a lot of candidate itemsets at each layer leading to the low efficiency of the
algorithm[16].
This paper proposes an improved FP-Growth algorithm. After scanning the itemsets in
the data, we adopts a storage method combing adjacency table with hash table, which can
remove itemsets quickly that are less than the minimum support .The algorithm makes full
use of the established adjacency table, and only needs to scan the original database once. It
has the advantages of fast running speed, small memory consumption and low complexity.

3 Improvement of FP-Growth algorithm based on adjacent table

The FP-Growth algorithm scans the database shown in table1 twice, figure .1 shows that
how the transaction database converted into the FP-Tree. However, for large-scale data sets,
the algorithm has shortcomings of memory and computational, making the algorithm
inefficient [17].
Table 1. transaction database.
TID Items
T100 �2 ,�3 ,�5
T200 �6, �2
T300 �3 ,�1 ,�4

2
MATEC Web of Conferences 189, 10012 (2018) https://doi.org/10.1051/matecconf/201818910012
MEAMT 2018

T400 �4,�2 ,�3 ,�1 ,�5

T500 �3 ,�5 ,�4
T600 �5 ,�6

Fig.1. The transaction database is converted into the FP-Tree.

3.1 Generation of adjacency table

Takingthe database intable 1 as an example, theitems ineach itemsets can be considered related to
each other and form a complete graph. Once the same two items are associated, the weight of
the edge is incremented by one. The weight of the final edge is the association frequency. After the
first scan of the database, the formed association relationship graph is shown in figure 2.

Fig. 2. The formed association relationship diagram and the generated adjacency table.

3.2 The mining of frequent itemsets

Using minimum support to remove unrelated items and adopting the adjacency table's
vertices, adjacency points, for example, we removed the itemsets whose support countings
are one and then can get a single-dimensional frequent itemsets as follow:
{ �1 ,�3 :2 ;(�1 ,�4 :2);(�2 ,�5 :2); �4 ,�3 :3 ;(�4 ,�5 :2);(�5 ,�3 :3)}.
Some single-dimension frequent itemsets are subsets of three frequent itemsets. By the
subset continue to mine adjacency table, we can obtain three frequent itemsets: {( �1 ,
�3 ,�4 :2),(�4 ,�3 ,�5 :2),(�2 ,�3 ,�5 :2)}Similarly, three frequent itemsets are subsets of the four
frequent itemsets, and so on. The algorithm not end until all the frequent itemsets are
mined.The improved FP-Growth algorithm steps are as follows:
Scan the transaction database to generate the adjacency table:

3
MATEC Web of Conferences 189, 10012 (2018) https://doi.org/10.1051/matecconf/201818910012
MEAMT 2018

(1) HashMap<String, HashMap<String,Integer>> GraphMap;//Define an adjacency table

(2) HashMap<String,Integer> frequent;
(3) while (read each transaction set I of transaction database! = null) {
(4) HashMap<String,Integer> list;//declare list to store adjacent points and weights
(5) Take the items in the array String arr[] in turn as the vertices of the adjacency table;
(6) if(GraphMap.containsKey(top));
(7) If there is, then go to the set of adjacency points of the vertex that is the association
relationship set, list = GraphMap.get(top);
(8) else if it does not exist, it initializes the vertex's set of adjacency points;
(9) list= new HashMap<String,Integer>();}
(10) The loop traverses the items in arr[] in turn to store non-top items into the “list”;
(11) list.put(arr[j],(list.containsKey(arr[j]) ? 1 + list.get(arr[j]) : 1));
(12) GraphMap.put(top, list);}
Mine adjacency tables to generate frequent item collections:
(1) The loop obtains the vertices of the GraphMap in turn. Then it gets adjacent frequent
itemsets according to the vertices, and removes items less than min_sup;
(2) The obtainedvertices, form a single-dimensional frequent itemset with adjacent points;
(3) frequent.put(item, weight_count);//if not in the frequent itemsets
(4)//Frequent itemsets (A, B) and (B, A) are considered to be the same frequent itemsets;
(5)Traverse the adjacent points in GraphMap and declare ArrayList<Integer> Weight
(6)//Weight array is used to store the weight of adjacent points that is the association frequency;
(7) If the set of adjacency points of the vertex contains items in “frequent”, we take this vertex
�� ∪ ��，and take the minimum min in the array Weight_sort[] as the frequent number;
(8) if(frequent  (�� ∪ �� )) then frequent.put(�� ∪ �� , min);
(9) Repeat steps (6)-(9) until the return value of step (8) is null;
(10) return frequent;//Final itemsets mining is completed;

4 Time complexity analysis

This paper compares the time performance of the FP-Growth algorithm with an improved
algorithm based on adjacency table. The following are the symbols used for performance
analysis. n: the number of transactions in the entire database; n1: The number of itemsets in
the FP-Tree corresponding to the original database that is the number of leaf nodes in the
FP-Tree; n2: The average number of items in per itemset I; �t� : Time to read transaction i from
the original database; �棈�: Time of FP-Growth counting frequency of each item in the database; �棈�:
Time of improved FP-Growth counting frequency of each item in the database; �� : Time
consumption of sorting each itemset by head table order; ��〵�� : Time to insert each item into
FP-Tree;��〵� : Time to insert each item into the adjacency table; �� : Time to get frequent itemsets
from FP-Tree; �� : Time to get frequent itemsets from the adjacency table; �� : Time to
eventually find all frequent itemsets from the original database using FP-Growth algorithm; Tg:
Time to find all frequent itemsets from the original database using improved FP-Growth algorithm.
〵
�� = �=1
(�t� + �棈� + �� + ��〵�� + �� ) (1)
When FP-Tree is close to the binary tree, time complexity of FP-Growth is lowest.
Formula (1) above is approximately equal to Formula (2).
n n
TFP = i=1
(tri + tli + tsi ) + i=1
log2 i + n1log2 (n1 )) (2)
n
Tg = i=1
(tri + gini + gfi + gli ) (3)

4
MATEC Web of Conferences 189, 10012 (2018) https://doi.org/10.1051/matecconf/201818910012
MEAMT 2018

Since this paper adopts hash tables to design adjacency tables, the most time improved
FP-Growth algorithm cost is approximately equal to Formula (4) below.
n
Tg = i=1
(tri + gri ) + o n + n2 o(n2 ) (4)
When constructing an FP-Tree, it is necessary to sort each itemset by the order of the
head table.Besides, counting the frequency of each item needs to traverse the header table.
On the contrast, when constructing the adjacency table, each itemset doesn’t have to be sorted
but traverse the hash table simply, so Formula (5) and Formula (6) can be as following:
n n
i=1
(tri + tli + tsi ) > i=1
(tri + gri ) (5)
n
i=1
log2 i = log2 n! (6)
Formula (7) can be deduced by Stirling's approximation:
1
lnn! > n +
2
lnn − n (7)

Since the average number of items n2 of each transaction set I is less than the number of
leaf nodes n1 in the FP-Tree, the following can be obtained by Formula (4), (5), (6), and
(7):o(〵) + 〵2 � 〵2 < 〵�=1 棈��2 � + 〵1 棈��2 (〵1 )), So �� > �� .

5 Experimental results
To study the performance of the algorithm, this paper compares the FP-Growth algorithm
with the improved one on sparse dataset and dense dataset under the same experimental
environment. The sparse dataset averages 10 items per transaction set. In figure 3, the
minimum support counting of each transaction database is 100; in figure 4, the number of
transaction set is 500,000. The dense dataset averages 23 items per transaction set. In figure
5, the minimum support counting of each transaction database is 500; in figure 6, the
number of transaction set is 30,000.
From the experimental results, it can be concluded that the effiency of mining the
frequency itemsets using the FP-Growth algorithm improved obviously after improving the
FP-Growth algorithm. Especially when dealing with maasive frequent itemsets, the effect is
more prominent. The main reason lies in that the improved algorithm only needs to scan the
transaction database once, and doesn’t have to sort many itemsets by the support frequency.
Further more,the fast lookup of the hash table also helps save more time, even in the case of
massive frequent itemsets.

Fig.3. Comparison of effects on different numbers of sparse transaction items.

5
MATEC Web of Conferences 189, 10012 (2018) https://doi.org/10.1051/matecconf/201818910012
MEAMT 2018

Fig.4. Comparison of different support counting for sparse transaction items.

Fig.5. Comparison of effects on different numbers of dense transaction items.

Fig.6. Comparison of different support counting for dense transaction items.

6 Conclusions and future scope

After studying the mining process of association rules of FP-Growth algorithm, this paper
proposes an improved FP-Growth algorithm based on adjacency table.It significantly
improves the performance of the algorithm to a certain degree. First of all, the improved
algorithm only scans the transaction database once, which reduces the I/O operations
greatly. Secondly, It doesn’t require the establishment of a header table and a large number
of sort operations. Finally, when mining frequent itemsets, the improved algorithm adopts
the hash table for the fast lookup and does not need recursive mining. These have

6
MATEC Web of Conferences 189, 10012 (2018) https://doi.org/10.1051/matecconf/201818910012
MEAMT 2018

considerably reduce the algorithm's time and memory consumption. Especially in dealing
with dense transaction items, the improved algorithm shows high performance and is
supposed to have great application value. The future work will perfect the FP-Growth
algorithm combing the application, and study the improvement in parallelization.

References
1. Wu X, Kumar V, Quinlan J R, et al. Top 10 algorithms in data mining[J]. Knowledge
& Information Systems, 2007, 14(1):1-37.
2. Sharma S, Bhatia S. Analysis of Association rule in Data Mining[C]. International Conference
on Information and Communication Technology for Competitive Strategies. ACM, 2016:1-4.
3. Agrawal R. Fast Algorithm for Mining Association Rules in Large Databases[C]// Proc. Very
Large Data Bases Conference. 1994.
4. Agrawal R, Imieliński T, Swami A. Mining Assocation Rules between Sets of Items in
Large Databases[J]. Proc of Sigmod, 1993, 22(2):207-216.
5. Han J, Pei J, Yin Y. Mining frequent patterns without candidate generation[C]. ACM
SIGMOD International Conference on Management of Data. ACM, 2000:1-12.
6. Heaton J. Comparing dataset characteristics that favor the Apriori, Eclat or FP-Growth
frequent itemset mining algorithms[C]. Southeastcon. IEEE, 2017:1-7.
7. Difallah D E, Benton R G, Raghavan V, et al. FAARM: Frequent Association Action
Rules Mining Using FP-Tree[C]. IEEE, International Conference on Data Mining
Workshops. IEEE Computer Society, 2011:398-404.
8. Hao J, He M. A Parallel FP-Growth Algorithm Based on GPU[C]. IEEE, International
Conference on E-Business Engineering. IEEE Computer Society, 2017:97-102.
9. Chang H Y, Lin J C, Cheng M L, et al. A Novel Incremental Data Mining Algorithm
Based on FP-growth for Big Data[C]. International Conference on NETWORKING
and Network Applications. IEEE, 2016:375-378.
10. Tsai C F, Lin Y C, Chen C P. A new fast algorithms for mining association rules in large
databases[C]. International Conference on Systems, Man and Cybernetics. IEEE, 2002:6 pp.
11. Sun D, Teng S, Zhang W, et al. An Algorithm to Improve the Effectiveness of
Apriori[C].International Conference on Cognitive Informatics. IEEE, 2007:385-390.
12. Heaton J. Comparing dataset characteristics that favor the Apriori, Eclat or FP-Growth
frequent itemset mining algorithms[C]. Southeastcon. IEEE, 2017:1-7.
13. Chen M, Gao X D, Li H F. An efficient parallel FP-Growth algorithm[C]. International
Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery.
IEEE, 2009:283-286.
14. Difallah D E, Benton R G, Raghavan V, et al. FAARM: Frequent Association Action
Rules Mining Using FP-Tree[C]. International Conference on Data Mining
Workshops.IEEE, 2011:398-404.
15. Hao J, He M. A Parallel FP-Growth Algorithm Based on GPU[C]. IEEE, International
Conference on E-Business Engineering. IEEE Computer Society, 2017:97-102.
16. Subbulakshmi B, Dharini B, Deisy C. Recent weighted maximal frequent itemsets
mining[C]. International Conference on I-Smac. IEEE, 2017:391-397.
17. Willhalm T, et al. FPTree: A Hybrid SCM-DRAM Persistent and Concurrent B-Tree for
Storage Class Memory[C]. International Conference on Management of Data. ACM,
2016:371-386.

Improv Me Net
No ratings yet
Improv Me Net
7 pages
Efficient Algorithm For Mining Frequent Patterns Java Project
No ratings yet
Efficient Algorithm For Mining Frequent Patterns Java Project
38 pages
FP-Growth: Efficient Pattern Mining
No ratings yet
FP-Growth: Efficient Pattern Mining
12 pages
Apriori-Growth Algorithm for Data Mining
No ratings yet
Apriori-Growth Algorithm for Data Mining
3 pages
An Improved Frequent Pattern Tree The Child Struct
No ratings yet
An Improved Frequent Pattern Tree The Child Struct
19 pages
Itemset Mining Over Large Transactional Tables On The Relational Databases
No ratings yet
Itemset Mining Over Large Transactional Tables On The Relational Databases
6 pages
FP-Growth Approach for Frequent Itemsets
No ratings yet
FP-Growth Approach for Frequent Itemsets
19 pages
A New Parallel Algorithm For Frequent Pattern Mining
No ratings yet
A New Parallel Algorithm For Frequent Pattern Mining
5 pages
DM Unit-2
No ratings yet
DM Unit-2
14 pages
A Comparative Analysis of NFA and Tree-Based Approach For Infrequent Itemset Mining
No ratings yet
A Comparative Analysis of NFA and Tree-Based Approach For Infrequent Itemset Mining
5 pages
177 1496393364 - 02-06-2017 PDF
No ratings yet
177 1496393364 - 02-06-2017 PDF
6 pages
177 1496393364 - 02-06-2017 PDF
No ratings yet
177 1496393364 - 02-06-2017 PDF
6 pages
Q) FP Growth Algorithm?: This Algorithm Works As Follows
No ratings yet
Q) FP Growth Algorithm?: This Algorithm Works As Follows
3 pages
An Implementation of The FP-growth Algorithm
No ratings yet
An Implementation of The FP-growth Algorithm
6 pages
Mtech Project Seminar1
No ratings yet
Mtech Project Seminar1
36 pages
Frequent Pattern Mining Without Candidate Generation: Lesson Introduction
No ratings yet
Frequent Pattern Mining Without Candidate Generation: Lesson Introduction
6 pages
AzqaSaleemKhan (SP22 RCS 003) FPGrowth
No ratings yet
AzqaSaleemKhan (SP22 RCS 003) FPGrowth
19 pages
Unit4 2 Association Rules FP Growth
No ratings yet
Unit4 2 Association Rules FP Growth
33 pages
FPgrowth
No ratings yet
FPgrowth
2 pages
Analysis and Implementation of FP & Q-FP Tree With Minimum CPU Utilization in Association Rule Mining
No ratings yet
Analysis and Implementation of FP & Q-FP Tree With Minimum CPU Utilization in Association Rule Mining
6 pages
Frequent Closed Pattern Mining Algorithm Based On COFI-Tree
No ratings yet
Frequent Closed Pattern Mining Algorithm Based On COFI-Tree
2 pages
Fptreehuffman
No ratings yet
Fptreehuffman
4 pages
JDM 6
No ratings yet
JDM 6
12 pages
Market Basket Analysis Using Improved FP-tree
No ratings yet
Market Basket Analysis Using Improved FP-tree
4 pages
U3 - FP Trees - 5th Sem - DS
No ratings yet
U3 - FP Trees - 5th Sem - DS
9 pages
18-FP-Growth Algorithm-12-02-2025
No ratings yet
18-FP-Growth Algorithm-12-02-2025
24 pages
Efficient FP-Growth Pattern Mining
No ratings yet
Efficient FP-Growth Pattern Mining
7 pages
15-Fp-Tree Problem-10-09-2024
No ratings yet
15-Fp-Tree Problem-10-09-2024
2 pages
1 Explain Apriori Algorithm With Example or Finding Frequent Item Sets Using With Candidate Generation
No ratings yet
1 Explain Apriori Algorithm With Example or Finding Frequent Item Sets Using With Candidate Generation
21 pages
Lecture 5 - FP-Growth Algorithm
No ratings yet
Lecture 5 - FP-Growth Algorithm
26 pages
FP-Tree Growth Algorithm
No ratings yet
FP-Tree Growth Algorithm
15 pages
From Introduction To Data Mining: Data Mining Association Analysis: Basic Concepts and Algorithms
No ratings yet
From Introduction To Data Mining: Data Mining Association Analysis: Basic Concepts and Algorithms
37 pages
2 Unit DM K Raj Kuamr
No ratings yet
2 Unit DM K Raj Kuamr
26 pages
UNIT-3 DM
No ratings yet
UNIT-3 DM
9 pages
(18-22) Hybrid Association Rule Mining Using AC Tree
No ratings yet
(18-22) Hybrid Association Rule Mining Using AC Tree
5 pages
Fpgrowth
No ratings yet
Fpgrowth
11 pages
DM Unit2 - 1 Association Mining 19I504
No ratings yet
DM Unit2 - 1 Association Mining 19I504
86 pages
Frequent Pattern Analysis Guide
No ratings yet
Frequent Pattern Analysis Guide
5 pages
Chapter 5
No ratings yet
Chapter 5
24 pages
DM Module 3
No ratings yet
DM Module 3
11 pages
426-Article Text-1037-1-10-20210421
No ratings yet
426-Article Text-1037-1-10-20210421
9 pages
DM Project Fastfood
No ratings yet
DM Project Fastfood
5 pages
Efficient Frequent Pattern Mining with FP Tree
No ratings yet
Efficient Frequent Pattern Mining with FP Tree
1 page
Single-Pass Interesting Frequent Pattern Mining: Without Support Threshold
No ratings yet
Single-Pass Interesting Frequent Pattern Mining: Without Support Threshold
2 pages
Fp-Tree Growth Algorithm
No ratings yet
Fp-Tree Growth Algorithm
11 pages
Powerpoint Presentation On Somlething
No ratings yet
Powerpoint Presentation On Somlething
181 pages
Lecture 2.3.3 2.3.4
No ratings yet
Lecture 2.3.3 2.3.4
29 pages
What Is Frequent Pattern Analysis?
No ratings yet
What Is Frequent Pattern Analysis?
37 pages
Algorithms For Frequent Itemset Mining: A Literature Review
No ratings yet
Algorithms For Frequent Itemset Mining: A Literature Review
19 pages
ML 4
No ratings yet
ML 4
13 pages
FP Growth
No ratings yet
FP Growth
10 pages
2007 Jiawei Han FP Mining
No ratings yet
2007 Jiawei Han FP Mining
32 pages
Frequent Itemset Mining
No ratings yet
Frequent Itemset Mining
58 pages
BCA Semester VI Data Mining Module 3 (Presentation Kind of N
No ratings yet
BCA Semester VI Data Mining Module 3 (Presentation Kind of N
108 pages
DMDW U3
No ratings yet
DMDW U3
16 pages
FP Growth Algorithm Overview
No ratings yet
FP Growth Algorithm Overview
13 pages
Comparative Evaluation of Association Rule Mining Algorithms With Frequent Item Sets
No ratings yet
Comparative Evaluation of Association Rule Mining Algorithms With Frequent Item Sets
7 pages
2024 and 2025 Java IEEE Projects List BITS
No ratings yet
2024 and 2025 Java IEEE Projects List BITS
7 pages
Software (1) Application Software
No ratings yet
Software (1) Application Software
2 pages
Difference Between DBMS and RDBMS
No ratings yet
Difference Between DBMS and RDBMS
13 pages
Manoj Behera Resume
No ratings yet
Manoj Behera Resume
2 pages
Chap 6 Iot Sol
No ratings yet
Chap 6 Iot Sol
4 pages
VI - Sem - ISE - Syllabus - 2021 Scheme
No ratings yet
VI - Sem - ISE - Syllabus - 2021 Scheme
47 pages
Syllabus409 W23gong
No ratings yet
Syllabus409 W23gong
2 pages
Survey of PERL - Group 6 Presentation
No ratings yet
Survey of PERL - Group 6 Presentation
12 pages
Praveen Gupta: Experience
No ratings yet
Praveen Gupta: Experience
1 page
Jpa Hibernate
No ratings yet
Jpa Hibernate
19 pages
CCD CT 2 Model Answer 2022-23
No ratings yet
CCD CT 2 Model Answer 2022-23
5 pages
Comprehensive Guide to SQL Basics
No ratings yet
Comprehensive Guide to SQL Basics
5 pages
Domain Driven Design Using Naked Objects The Pragmatic Programmers 1st Edition by Dan Haywood ISBN 1934356441 978-1934356449
100% (11)
Domain Driven Design Using Naked Objects The Pragmatic Programmers 1st Edition by Dan Haywood ISBN 1934356441 978-1934356449
80 pages
Tarang JI - Edited
No ratings yet
Tarang JI - Edited
20 pages
College Enquiry Chat Bot System With Text To Speech
No ratings yet
College Enquiry Chat Bot System With Text To Speech
5 pages
Fake Jobs Code
No ratings yet
Fake Jobs Code
3 pages
CompSci HL P3 Case Study
No ratings yet
CompSci HL P3 Case Study
7 pages
Secure File Sharing Using Access Control
No ratings yet
Secure File Sharing Using Access Control
73 pages
Anusha CV
No ratings yet
Anusha CV
1 page
DBMS Assignment for Bahria University
No ratings yet
DBMS Assignment for Bahria University
7 pages
Advanced Information Networking and Applications 2021
No ratings yet
Advanced Information Networking and Applications 2021
801 pages
Streaming Data For Real-Time AI Applications
No ratings yet
Streaming Data For Real-Time AI Applications
52 pages
Advanced Customer Segmentation Using Azure Synapse
No ratings yet
Advanced Customer Segmentation Using Azure Synapse
12 pages
AI System Design: Data Acquisition & Models
No ratings yet
AI System Design: Data Acquisition & Models
1 page
Benu - Java Developer 2.6 Years Exp
No ratings yet
Benu - Java Developer 2.6 Years Exp
3 pages
Advancements in Artificial Intelligence and Their Implications For Society
No ratings yet
Advancements in Artificial Intelligence and Their Implications For Society
2 pages
DWM PYQs
No ratings yet
DWM PYQs
7 pages
Plag Report
No ratings yet
Plag Report
3 pages
Assignment 1: Q1. Write Overview of Artificial Intelligence
No ratings yet
Assignment 1: Q1. Write Overview of Artificial Intelligence
1 page
ChatGPT Data Science Prompts
80% (15)
ChatGPT Data Science Prompts
67 pages

An Improvement of FP-Growth Association Rule Minin

Uploaded by

An Improvement of FP-Growth Association Rule Minin

Uploaded by

MATEC Web of Conferences 189, 10012 (2018) https://doi.org/10.

An improvement of FP-Growth association rule

Chinese helicopter design institute, Tianjin, P.R China

Abstract. FP-Growth algorithm is an association rule mining algorithm

3 Improvement of FP-Growth algorithm based on adjacent table

T400 �4,�2 ,�3 ,�1 ,�5

Fig.1. The transaction database is converted into the FP-Tree.

3.1 Generation of adjacency table

3.2 The mining of frequent itemsets

(1) HashMap<String, HashMap<String,Integer>> GraphMap;//Define an adjacency table

4 Time complexity analysis

Fig.3. Comparison of effects on different numbers of sparse transaction items.

Fig.4. Comparison of different support counting for sparse transaction items.

Fig.5. Comparison of effects on different numbers of dense transaction items.

Fig.6. Comparison of different support counting for dense transaction items.

6 Conclusions and future scope

You might also like