0% found this document useful (0 votes)

247 views9 pages

Answer To Assignment 3

This document provides the details of an assignment involving market basket analysis of transaction data. The key steps are: 1) The document identifies all frequent itemsets and sequential patterns in the transaction data using the Apriori algorithm, finding itemsets and sequences that meet minimum support thresholds. 2) It derives association rules from the frequent itemsets that meet minimum confidence thresholds. 3) Based on the analysis, a recommendation is made to store management to place certain items near each other to encourage customers to purchase them together.

Uploaded by

lastindor

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

247 views9 pages

Answer To Assignment 3

Uploaded by

lastindor

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

ACCTG 6910, Spring 2003

DESB, University of Utah

Assignment 3 (3/27 – 4/8)

Question 1(50 points): Given the following transactions and minimum support - 50%
and minimum confidence - 80% large item sets, sequential patterns, rules, lifts,
recommend some management decisions

TID Brand_Item_bought
100 King’s-Crab, Sunset-Milk, Dairyland-Cheese, Best-Bread
200 Best-Cheese, Dairyland-Milk, Goldenfarm-Apple, Tasty-Pie, Wonder-Bread
300 Westcoast-Apple, Dairyland-Milk, Wonder-Bread, Tasty-Pie
400 Wonder-Bread, Sunset-Milk, Dairyland-Cheese

a) At the granularity of item without brand (e.g., “milk” and “bread”), please identify all
large itemsets using the Apriori algorithm. Be sure to include all steps in Apriori, i.e.,
Large (k-1)-itemset  Candidate k-itemset (Join, Prune)  Large k-itemset.

Step 1: Identify all large 1-itemsets

{Apple} 2/4 = 50%
{Bread} 4/4 = 100%
{Cheese} 3/4 = 75%
{Milk} 4/4 = 100%
{Pie} 2/4 = 50%

Step 2: Generate Candidate 2-itemsets by join

{Apple, Bread} {Apple, Cheese} {Apple, Milk} {Apple, Pie}
{Bread, Cheese} {Bread, Milk} {Bread, Pie}
{Cheese, Milk} {Cheese, Pie}
{Milk, Pie}

Step 3: Identify large 2-itemsets

{Apple, Bread} 2/4 = 50%
{Apple, Milk} 2/4 = 50%
{Apple, Pie} 2/4 = 50%
{Bread, Cheese} 3/4 = 75%
{Bread, Milk} 4/4 = 100%
{Bread, Pie} 2/4 = 50%
{Cheese, Milk} 3/4 = 75%
{Milk, Pie} 2/4 = 50%
Step 4: Generate candidate 3-itemsets by join
{Apple, Bread, Milk} {Apple, Bread, Pie} {Apple, Milk, Pie}
{Bread, Cheese, Milk} {Bread, Cheese, Pie} {Bread, Milk, Pie}

Step 5: Prune candidate 3-itemsets

{Apple, Bread, Milk} {Apple, Bread, Pie} {Apple, Milk, Pie}
{Bread, Cheese, Milk} {Bread, Milk, Pie}

{Bread, Cheese, Pie} is pruned because its subset {Cheese, Pie} is not large 2-
itemset.

Step 6: Identify Large 3-itemsets

{Apple, Bread, Milk} 2/4 = 50%
{Apple, Bread, Pie} 2/4 = 50%
{Apple, Milk, Pie} 2/4 = 50%
{Bread, Cheese, Milk} 3/4 = 75%
{Bread, Milk, Pie} 2/4 = 50%

Step 7: Generate candidate 4-itemsets by join

{Apple, Bread, Milk, Pie}

Step 8: prune candidate 4-itemsets

{Apple, Bread, Milk, Pie}

Step 9: Identify Large 4-itemsets

{Apple, Bread, Milk, Pie} 2/4 = 50%

b) At the granularity of brand-item (e.g., “Sunset-Milk” and “Wonder-Bread”), please

identify all large itemsets using the Apriori algorithm. Be sure to include all steps in
Apriori, i.e., Large (k-1)-itemset  Candidate k-itemset (Join, Prune)  Large k-
itemset.

Step 1: Identify all large 1-itemsets

{Dairyland-Cheese} 2/4 = 50%
{Dairyland-Milk} 2/4 = 50%
{Sunset-Milk} 2/4 = 50%
{Tasty-Pie} 2/4 = 50%
{Wonder-Bread} 3/4 = 75%

Step 2: Generate candidate 2-itemsets by join

{Dairyland-Cheese, Dairyland-Milk} {Dairyland-Cheese, Sunset-Milk}
{Dairyland-Cheese, Tasty-Pie} {Dairyland-Cheese, Wonder-Bread}
{Dairyland-Milk, Sunset-Milk} {Dairyland-Milk, Tasty-Pie}
{ Dairyland-Milk, Wonder-Bread} {Sunset-Milk, Tasty-Pie}
{Sunset-Milk, Wonder-Bread} {Tasty-Pie, Wonder-Bread }

Step 3: Identify large 2-itemsets

{Dairyland-Cheese, Sunset-Milk} 2/4 = 50%
{Dairyland-Milk, Tasty-Pie} 2/4 = 50%
{Dairyland-Milk, Wonder-Bread} 2/4 = 50%
{Tasty-Pie, Wonder-Bread} 2/4 = 50%

Step 4: Generate candidate 3-itemsets by join

{Dairyland-Milk, Tasty-Pie, Wonder-Bread}

Step 5: Prune candidate 3-itemsets

{Dairyland-Milk, Tasty-Pie, Wonder-Bread}

Step 6: Identify Large 3-itemsets

{Dairyland-Milk, Tasty-Pie, Wonder-Bread} 2/4 = 50%

c) Please list all association rules (i.e., association rules that meet minimum support and
minimum confidence requirements) derived from the itemsets you derived in b) and
their supports, confidences and lifts.

Dairyland-Cheese => Sunset-Milk

support = 50% confidence = 50%/50% = 100% lift = 100%/50% = 2

Sunset-Milk => Dairyland-Cheese

support = 50% confidence = 50%/50% = 100% lift = 100%/50% = 2

Dairyland-Milk => Tasty-Pie

support = 50% confidence = 50%/50% = 100% lift = 100%/50% = 2

Tasty-Pie => Dairyland-Milk

support = 50% confidence = 50%/50% = 100% lift = 100%/50% = 2

Dairyland-Milk => Wonder-Bread

support = 50% confidence = 50%/50% = 100% lift = 100%/75% = 1.33

Tasty-Pie => Wonder-Bread

support = 50% confidence = 50%/50% = 100% lift = 100%/75% = 1.33

Dairyland-Milk ∧ Tasty-Pie => Wonder-Bread

support = 50% confidence = 50%/50% = 100% lift = 100%/75% = 1.33

Dairyland-Milk ∧Wonder-Bread => Tasty-Pie

support = 50% confidence = 50%/50% = 100% lift = 100%/50% = 2

Tasty-Pie ∧Wonder-Bread => Dairyland-Milk

support = 50% confidence = 50%/50% = 100% lift = 100%/50% = 2

Dairyland-Milk => Tasty-Pie ∧Wonder-Bread

support = 50% confidence = 50%/50% = 100% lift = 100%/50% = 2

Tasty-Pie => Dairyland-Milk ∧Wonder-Bread

support = 50% confidence = 50%/50% = 100% lift = 100%/50% = 2

d) Please give one recommendation (e.g., store layout or promotion) to store

management based on the association rules and large item sets you discovered.

The store can put the Tasty-Pie and Wonder-Bread near the Dairyland-Milk to further
encourage the customer to buy them together.

Question 2 (25 points): Let the minimum support be 60% when you derive large
sequences from the following transaction database.

Customer ID Transaction ID Items

A 100 1,2
A 200 3,4
A 300 5,6
A 400 1,2
B 500 1
B 600 3
B 700 5
B 800 1
C 900 2
C 1000 4
C 1100 6
C 1200 2

a) Please identify all large sequencies using the Apriori algorithm. Be sure to include all
steps in Apriori, i.e., Large (k-1)-sequences  Candidate k-sequencies (Join, Prune) 
Large k-sequences.

Version 1 (no repetitive itemsets in sequences)

Step 1: Identify large 1-sequencies
<{1}> 2/3 = 66.67%
<{2}> 2/3 = 66.67%
<{3}> 2/3 = 66.67%
<{4}> 2/3 = 66.67%
<{5}> 2/3 = 66.67%
<{6}> 2/3 = 66.67%

Step 2: Generate candidate 2-sequencies by join

<{1}, {2}> <{2}, {1}> <{1}, {3}> <{3}, {1}>
<{1}, {4}> <{4}, {1}> <{1}, {5}> <{5}, {1}> <{1}, {6}> <{6}, {1}>
<{2}, {3}> <{3}, {2}> <{2}, {4}> <{4}, {2}>
<{2}, {5}> <{5}, {2}> <{2}, {6}> <{6}, {2}>
<{3}, {4}> <{4}, {3}> <{3}, {5}> <{5}, {3}>
<{3}, {6}> <{6}, {3}>
<{4}, {5}> <{5}, {4}> <{4}, {6}> <{6}, {4}>
<{5}, {6}> <{6}, {5}>

Step 3: Identify large 2-sequencies

<{1}, {3}> 2/3 = 66.67%
<{1}, {5}> 2/3 = 66.67%
<{2}, {4}> 2/3 = 66.67%
<{2}, {6}> 2/3 = 66.67%
<{3}, {1}> 2/3 = 66.67%
<{3}, {5}> 2/3 = 66.67%
<{4}, {2}> 2/3 = 66.67%
<{4}, {6}> 2/3 = 66.67%
<{5}, {1}> 2/3 = 66.67%
<{6}, {2}> 2/3 = 66.67%

Step 4: Generate candidate 3-sequencies by join

<{1}, {3}, {5}> <{1}, {5}, {3}>
<{2}, {4}, {6}> <{2}, {6}, {4}>
<{3}, {1}, {5}> <{3}, {5}, {1}>
<{4}, {2}, {6}> <{4}, {6}, {2}>

Step 4: Prune candidate 3-sequencies

<{1}, {3}, {5}>
<{2}, {4}, {6}>
<{3}, {1}, {5}> <{3}, {5}, {1}>
<{4}, {2}, {6}> <{4}, {6}, {2}>

Step 5: Identify large 3-sequencies

<{1}, {3}, {5}> 2/3 = 66.67%
<{2}, {4}, {6}> 2/3 = 66.67%
<{3}, {5}, {1}> 2/3 = 66.67%
<{4}, {6}, {2}> 2/3 = 66.67%

Step 6: Generate candidate 4-sequencies by join

no 4-sequence can be generated.
Version 2 (repetitive itemsets included in sequences)
Step 1: Identify large 1-sequencies
<{1}> 2/3 = 66.67%
<{2}> 2/3 = 66.67%
<{3}> 2/3 = 66.67%
<{4}> 2/3 = 66.67%
<{5}> 2/3 = 66.67%
<{6}> 2/3 = 66.67%

Step 2: Generate candidate 2-sequencies by join

<{1}, {1}> <{1}, {2}> <{2}, {1}> <{1}, {3}> <{3}, {1}>
<{1}, {4}> <{4}, {1}> <{1}, {5}> <{5}, {1}> <{1}, {6}> <{6}, {1}>
<{2}, {2}> <{2}, {3}> <{3}, {2}> <{2}, {4}> <{4}, {2}>
<{2}, {5}> <{5}, {2}> <{2}, {6}> <{6}, {2}>
<{3}, {3}> <{3}, {4}> <{4}, {3}> <{3}, {5}> <{5}, {3}>
<{3}, {6}> <{6}, {3}>
<{4}, {4}> <{4}, {5}> <{5}, {4}> <{4}, {6}> <{6}, {4}>
<{5}, {5}> <{5}, {6}> <{6}, {5}>
<{6}, {6}>

Step 3: Identify large 2-sequencies

<{1}, {1}> 2/3 = 66.67%
<{1}, {3}> 2/3 = 66.67%
<{1}, {5}> 2/3 = 66.67%
<{2}, {2}> 2/3 = 66.67%
<{2}, {4}> 2/3 = 66.67%
<{2}, {6}> 2/3 = 66.67%
<{3}, {1}> 2/3 = 66.67%
<{3}, {5}> 2/3 = 66.67%
<{4}, {2}> 2/3 = 66.67%
<{4}, {6}> 2/3 = 66.67%
<{5}, {1}> 2/3 = 66.67%
<{6}, {2}> 2/3 = 66.67%

Step 4: Generate candidate 3-sequencies by join

<{1}, {1}, {1}> <{1}, {1}, {3}> <{1}, {3}, {1}> <{1}, {3}, {3}>
<{1}, {1}, {5}> <{1}, {5}, {1}> <{1}, {5}, {5}>
<{1}, {3}, {5}> <{1}, {5}, {3}>
<{2}, {2}, {2}> <{2}, {2}, {4}> <{2}, {4}, {2}> <{2}, {4}, {4}>
<{2}, {2}, {6}> <{2}, {6}, {2}> <{2}, {6}, {6}>
<{2}, {4}, {6}> <{2}, {6}, {4}>
<{3}, {1}, {1}> <{3}, {1}, {5}> <{3}, {5}, {1}> <{3}, {5}, {5}>
<{4}, {2}, {2}> <{4}, {2}, {6}> <{4}, {6}, {2}> <{4}, {6}, {6}>
<{5}, {1}, {1}> <{6}, {2}, {2}>
Step 4: Prune candidate 3-sequencies
<{1}, {1}, {1}> <{1}, {1}, {3}> <{1}, {3}, {1}>
<{1}, {1}, {5}> <{1}, {5}, {1}>
<{1}, {3}, {5}>
<{2}, {2}, {4}> <{2}, {4}, {2}> <{2}, {2}, {6}> <{2}, {6}, {2}>
<{2}, {4}, {6}>
<{3}, {1}, {1}> <{3}, {1}, {5}> <{3}, {5}, {1}>
<{4}, {2}, {2}> <{4}, {2}, {6}> <{4}, {6}, {2}>
<{5}, {1}, {1}> <{6}, {2}, {2}>

Step 5: Identify large 3-sequencies

<{1}, {3}, {1}> 2/3 = 66.67%
<{1}, {3}, {5}> 2/3 = 66.67%
<{1}, {5}, {1}> 2/3 = 66.67%
<{2}, {4}, {2}> 2/3 = 66.67%
<{2}, {4}, {6}> 2/3 = 66.67%
<{2}, {6}, {2}> 2/3 = 66.67%
<{3}, {5}, {1}> 2/3 = 66.67%
<{4}, {6}, {2}> 2/3 = 66.67%

Step 6: Generate candidate 4-sequencies by join

<{1}, {3}, {1}, {1}> <{1}, {3}, {1}, {5}> <{1}, {3}, {5}, {1}> <{1}, {3}, {5}, {5}>
<{1}, {5}, {1}, {1}>
<{2}, {4}, {2}, {2}> <{2}, {4}, {2}, {6}> <{2}, {4}, {6}, {2}> <{2}, {4}, {6}, {6}>
<{2}, {6}, {2}, {2}>
<{3}, {5}, {1}, {1}>
<{4}, {6}, {2}, {2}>

Step 7: Prune candidate 4-sequencies

<{1}, {3}, {5}, {1}>
<{2}, {4}, {6}, {2}>

Step 8: Identify large 4-sequencies

<{1}, {3}, {5}, {1}> 2/3 = 66.67%
<{2}, {4}, {6}, {2}> 2/3 = 66.67%

Step 9: Generate candidate 5-sequencies

no 5-sequencies since the largest number of transactions of one customer is 4 in term of
the given dataset.

Question3 (25 points): Go to an ecommerce web site such as [Link] or [Link].

Discover and describe one application of the use association rules or sequential patterns.
Please comment on whether it is effective or needs improvement.

In [Link], when you are looking at description of a book, it also provides

you the information about the books that the customers who bought this book also
bought, the title that the customers are interested in may also be interested in, and the
customers who bought this book may also buy the books by other authors. This correlated
information about the book you are going to buy is provided by association rules, which
are mined from the past sales transactions. It is effective if [Link] wants to
recommend relevant books to the customer who is going to buy a book of certain topic.
However, we do not know if the [Link] sort the associated books according to the
support, confidence or lift, which may be helpful for the customer to locate the books
they really need efficiently.

DWM Exp8
No ratings yet
DWM Exp8
8 pages
Chota Bheem
No ratings yet
Chota Bheem
6 pages
Algorithm
No ratings yet
Algorithm
8 pages
Solutions To All Problem (1) - Compressed
No ratings yet
Solutions To All Problem (1) - Compressed
25 pages
Ex 1
No ratings yet
Ex 1
8 pages
Fa22-Bcs-025 MOAZ Assignment 1
No ratings yet
Fa22-Bcs-025 MOAZ Assignment 1
9 pages
Weantuday: T Deuhh Anytha
No ratings yet
Weantuday: T Deuhh Anytha
23 pages
Mining Frequent Itemsets and Rules
No ratings yet
Mining Frequent Itemsets and Rules
27 pages
Ex 9 TH
No ratings yet
Ex 9 TH
7 pages
1 Lab Program 3 2 Vinay Sirohi 3 2139472: December 1, 2021
No ratings yet
1 Lab Program 3 2 Vinay Sirohi 3 2139472: December 1, 2021
6 pages
Apriori Algorithm Explained
No ratings yet
Apriori Algorithm Explained
4 pages
Market Basket Analysis & Apriori Algorithm
No ratings yet
Market Basket Analysis & Apriori Algorithm
10 pages
R - Practical
No ratings yet
R - Practical
50 pages
Exp 9
No ratings yet
Exp 9
9 pages
Data Mining 2, 3 Material
No ratings yet
Data Mining 2, 3 Material
173 pages
Apriori Algorithm Case Study
No ratings yet
Apriori Algorithm Case Study
3 pages
ML Algorithm
No ratings yet
ML Algorithm
12 pages
De Exp 3
No ratings yet
De Exp 3
6 pages
Apriori Algorithm Examples
No ratings yet
Apriori Algorithm Examples
45 pages
Datamining Lect2 Frequent
No ratings yet
Datamining Lect2 Frequent
59 pages
Association Rules
No ratings yet
Association Rules
58 pages
DataMining Chapter2
No ratings yet
DataMining Chapter2
8 pages
Apriori Algorithm
No ratings yet
Apriori Algorithm
3 pages
Apriori Algorithm
No ratings yet
Apriori Algorithm
7 pages
BDA Module 5
No ratings yet
BDA Module 5
212 pages
Apriori Algorithm Example Problems
100% (1)
Apriori Algorithm Example Problems
8 pages
Big Data Analytics Unit3
No ratings yet
Big Data Analytics Unit3
27 pages
Association Rule Mining Guide
No ratings yet
Association Rule Mining Guide
67 pages
Association Rule Mining Explained
No ratings yet
Association Rule Mining Explained
5 pages
Data Science for Bookstore Revival
100% (1)
Data Science for Bookstore Revival
29 pages
Apriori Algorithm
No ratings yet
Apriori Algorithm
13 pages
Mining Frequent Patterns in Transactions
No ratings yet
Mining Frequent Patterns in Transactions
37 pages
Association Rules
No ratings yet
Association Rules
24 pages
L2: Frequent Itemsets Mining and Association Rules
No ratings yet
L2: Frequent Itemsets Mining and Association Rules
54 pages
Association Rule
No ratings yet
Association Rule
5 pages
Rule Mining
No ratings yet
Rule Mining
20 pages
Bigdata Section4
No ratings yet
Bigdata Section4
18 pages
Equent Patterns
No ratings yet
Equent Patterns
74 pages
Unit 4
No ratings yet
Unit 4
97 pages
Module 5 - Frequent Pattern Mining
No ratings yet
Module 5 - Frequent Pattern Mining
111 pages
Solutions For Tutorial Exercises Association Rule Mining.: Exercise 1. Apriori
No ratings yet
Solutions For Tutorial Exercises Association Rule Mining.: Exercise 1. Apriori
5 pages
Ass 2
No ratings yet
Ass 2
3 pages
Slides
No ratings yet
Slides
92 pages
06 FPBasic
No ratings yet
06 FPBasic
77 pages
Associationrule 1
No ratings yet
Associationrule 1
30 pages
Additional Exercises
No ratings yet
Additional Exercises
4 pages
Data Mining for Retail Insights
No ratings yet
Data Mining for Retail Insights
44 pages
Data Mining - Module2
No ratings yet
Data Mining - Module2
112 pages
Class 4-Associative Analysis
No ratings yet
Class 4-Associative Analysis
42 pages
Experiment No. 9
No ratings yet
Experiment No. 9
9 pages
Association: Market Basket Analysis
No ratings yet
Association: Market Basket Analysis
40 pages
Frequent Itemsets in Data Mining
No ratings yet
Frequent Itemsets in Data Mining
105 pages
Interesting Python
No ratings yet
Interesting Python
5 pages
Haunted UK Mines and Ghostly Legends
No ratings yet
Haunted UK Mines and Ghostly Legends
5 pages
CBSE Class 9 Maths Sample Question Paper (Set 1) 2024-25 FREE PDF
No ratings yet
CBSE Class 9 Maths Sample Question Paper (Set 1) 2024-25 FREE PDF
41 pages
Atividades 3º Bimestre Texto Todas
No ratings yet
Atividades 3º Bimestre Texto Todas
1 page
MANG3072 GRADE DESCRIPTOR - Main
No ratings yet
MANG3072 GRADE DESCRIPTOR - Main
3 pages
Walking System
No ratings yet
Walking System
2 pages
Required Reading Instincts and Their Vicissitudes
No ratings yet
Required Reading Instincts and Their Vicissitudes
7 pages
(Bail Matters) : Versus
No ratings yet
(Bail Matters) : Versus
358 pages
A Phylogenetic Study of Some Septoria Species Pathogenic To Asteraceae Based On ITS Ribosomal DNA - 0
No ratings yet
A Phylogenetic Study of Some Septoria Species Pathogenic To Asteraceae Based On ITS Ribosomal DNA - 0
9 pages
Understanding the Filipino Calendar
No ratings yet
Understanding the Filipino Calendar
3 pages
Ear Health Assessment Guide
No ratings yet
Ear Health Assessment Guide
7 pages
CBS News Poll Finds Tipping Expectations Have Grown
No ratings yet
CBS News Poll Finds Tipping Expectations Have Grown
6 pages
LT739 QCMD EQA For Molecular Infectious Disease Testing JAN25
No ratings yet
LT739 QCMD EQA For Molecular Infectious Disease Testing JAN25
84 pages
Hunger Games & Across the Alley Analysis
No ratings yet
Hunger Games & Across the Alley Analysis
9 pages
Lesson Plan Paed
No ratings yet
Lesson Plan Paed
10 pages
Legal Ethics Reviewer
No ratings yet
Legal Ethics Reviewer
6 pages
Kedah STPM 2009 Biology Marking Scheme
No ratings yet
Kedah STPM 2009 Biology Marking Scheme
14 pages
Diagnostic Test Finals
100% (1)
Diagnostic Test Finals
4 pages
Apollon Chapter 3: Infirmary Reflections
No ratings yet
Apollon Chapter 3: Infirmary Reflections
19 pages
Math Lesson Slides 2 8-2 11
No ratings yet
Math Lesson Slides 2 8-2 11
36 pages
GoldenGate Sync for DBAs
100% (1)
GoldenGate Sync for DBAs
5 pages
Companies Act 2017 (Test)
No ratings yet
Companies Act 2017 (Test)
69 pages
Styles of Communication
No ratings yet
Styles of Communication
3 pages
Beneficial Gift in Explaining The Usool of Haddadiyyah
No ratings yet
Beneficial Gift in Explaining The Usool of Haddadiyyah
19 pages
BOOK V DAVIDSON
100% (2)
BOOK V DAVIDSON
15 pages
Lesson Plan Rubrics for Assessment
No ratings yet
Lesson Plan Rubrics for Assessment
1 page
Apology and Love Letter to Duduu
No ratings yet
Apology and Love Letter to Duduu
2 pages
Fule V Legare
No ratings yet
Fule V Legare
1 page
Final Exam April 2014
No ratings yet
Final Exam April 2014
3 pages
Chemistry 211 Experiment 10
No ratings yet
Chemistry 211 Experiment 10
9 pages
St. Petri Messenger: August, 2020
No ratings yet
St. Petri Messenger: August, 2020
9 pages

Answer To Assignment 3

Uploaded by

Answer To Assignment 3

Uploaded by

ACCTG 6910, Spring 2003

DESB, University of Utah

Step 1: Identify all large 1-itemsets

Step 2: Generate Candidate 2-itemsets by join

Step 3: Identify large 2-itemsets

Step 5: Prune candidate 3-itemsets

Step 6: Identify Large 3-itemsets

Step 7: Generate candidate 4-itemsets by join

Step 8: prune candidate 4-itemsets

Step 9: Identify Large 4-itemsets

b) At the granularity of brand-item (e.g., “Sunset-Milk” and “Wonder-Bread”), please

Step 1: Identify all large 1-itemsets

Step 2: Generate candidate 2-itemsets by join

Step 3: Identify large 2-itemsets

Step 4: Generate candidate 3-itemsets by join

Step 5: Prune candidate 3-itemsets

Step 6: Identify Large 3-itemsets

Dairyland-Cheese => Sunset-Milk

Sunset-Milk => Dairyland-Cheese

Dairyland-Milk => Tasty-Pie

Tasty-Pie => Dairyland-Milk

Dairyland-Milk => Wonder-Bread

Tasty-Pie => Wonder-Bread

Dairyland-Milk ∧ Tasty-Pie => Wonder-Bread

Dairyland-Milk ∧Wonder-Bread => Tasty-Pie

Tasty-Pie ∧Wonder-Bread => Dairyland-Milk

Dairyland-Milk => Tasty-Pie ∧Wonder-Bread

Tasty-Pie => Dairyland-Milk ∧Wonder-Bread

d) Please give one recommendation (e.g., store layout or promotion) to store

Customer ID Transaction ID Items

Version 1 (no repetitive itemsets in sequences)

Step 2: Generate candidate 2-sequencies by join

Step 3: Identify large 2-sequencies

Step 4: Generate candidate 3-sequencies by join

Step 4: Prune candidate 3-sequencies

Step 5: Identify large 3-sequencies

Step 6: Generate candidate 4-sequencies by join

Step 2: Generate candidate 2-sequencies by join

Step 3: Identify large 2-sequencies

Step 4: Generate candidate 3-sequencies by join

Step 5: Identify large 3-sequencies

Step 6: Generate candidate 4-sequencies by join

Step 7: Prune candidate 4-sequencies

Step 8: Identify large 4-sequencies

Step 9: Generate candidate 5-sequencies

Question3 (25 points): Go to an ecommerce web site such as [Link] or [Link].

In [Link], when you are looking at description of a book, it also provides

You might also like