0% found this document useful (0 votes)

221 views4 pages

E-commerce Product Recommendation Challenge

An e-commerce company wants to predict the top 3 categories each user might purchase from in the future based on their past transaction data. The training dataset contains user_id, product purchased, and order value for each transaction. Participants must predict the top 3 categories for each user in the test dataset and submit it in a CSV file with user_id and the 3 predicted categories (pred3) by July 17th. Submissions will be evaluated based on mean reciprocal rank and precision metrics which measure how accurate the top 3 predictions are for each user compared to actual future purchases.

Uploaded by

Tanmay Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

221 views4 pages

E-commerce Product Recommendation Challenge

Uploaded by

Tanmay Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Problem Statement

An e-commerce company wants to recommend products to its users.

The company has collected only transaction data in the past. The
training dataset has only 3 columns - user_id, Product bought and
Order value of the product. Using this dataset, predict for all the users
in the training dataset, the top 3 categories that the user might buy
from.

Training dataset sample

aov = Order Value of the product

category = Product Category where the purchase was made

What do you need to predict?

For each user, predict the top 3 probable product categories that they
may purchase from, in the future.

Timeline
DEADLINE EXTENDED

21 DAYS LEFT

SUBMISSIONS OPEN SAT JUN 26

LAST DATE SAT JUL 17

Training Data

This file contains the detailed purchasing history for every user. It has
order value and the category of the product.

Training Data Target

This file contains data for some users about the category of items they
bought in future.

Test Data

This file contains the detailed purchasing history for some users. It has
the order value and the category of the product. You have to predict
the top 3 categories that the users with these user_ids will purchase
from in the future.

Evaluation
Measurements will be based on mean relevance rank
(mrr) and precision. Both the measurements are explained here.

Mean Relevance Rank

User Reciproca |
Products in the order shown Product bought
id Rank
E-readers, Kitchen Supplies, Phones, Comics,
1 1/3
Technology books Technology Books
2 Phones, Comics, Fruits None 0
3 Groceries, Fruits, Phones None 0
Fruits, Home Decor,
4 Phones, Home Decor, readers 1/2
readers
Home Decor, Home Furnishings,
5 Phones, Books, Fruits 0
Kitchen Supplies

Technology Books is the 3rd top prediction for user_id 1 and that is

the one bought by the user - hence the reciprocal rank is 1/rank of the
right prediction which is 1/3. If there is more than one product
matching, the reciprocal rank still takes only the first matching product.
For instance - user_id 4 though both Home Decor and readers are
matching, the first match product is at position 2 and hence the
reciprocal rank is 1/2. Once we get the reciprocal ranks, we do an
average of the reciprocal ranks to get the mean reciprocal rank.
Final MRR
= ( ⅓ + ½ ) / 2 = 0.41666

Precision or Accuracy

We first find the Number of products in the prediction in each row that
matches with the number of products of the user_id. We then average
this number across all valid predictions. For the above table, precision
would look like -

User
Products in the order shown Product bought Precision
id
E-readers, Kitchen Supplies, Phones, Comics, Technology
1 1
Technology books Books
2 Phones, Comics, Fruits None NA
3 Groceries, Fruits, Phones None NA
4 Phones, Home Decor, readers Fruits, Home Decor, readers 2
Home Decor, Home Furnishings,
5 Phones, Books, Fruits NA
Kitchen Supplies

For user_id 1, one product matched and for user_id 4, two products

matched. So, accuracy is

number of items that matched / number of unique users with a

prediction.

Here it will be 3/2 = 1.5

Recall in this case, the number of items for which there is a prediction =
2/5 = 0.4

Ready to submit?

Submissions should be made in the same format as the sample

submission provided.

Sample Submission
Submissions should be made in the same format as the sample
provided.

Sample Prediction Dataset

Prediction dataset should be a .csv file with 19,981 rows (and one row for
headers) and the columns user_id and pred3 in the same format as the file
below.

BigMart Sales Prediction Python Project
No ratings yet
BigMart Sales Prediction Python Project
5 pages
Data Analysis On BigMart Sales
67% (3)
Data Analysis On BigMart Sales
17 pages
Black Friday Sales Prediction Project
No ratings yet
Black Friday Sales Prediction Project
14 pages
DSP Research Paper by Shanmukh and Meher
No ratings yet
DSP Research Paper by Shanmukh and Meher
33 pages
Sales Prediction and Product Recommendation Model Through
No ratings yet
Sales Prediction and Product Recommendation Model Through
20 pages
Final DMT Report PDF
No ratings yet
Final DMT Report PDF
27 pages
Black Friday Sales
No ratings yet
Black Friday Sales
26 pages
HET Ka FML
No ratings yet
HET Ka FML
13 pages
SS Teamproject Documentation
No ratings yet
SS Teamproject Documentation
33 pages
Retail Sales Prediction Models
No ratings yet
Retail Sales Prediction Models
68 pages
DGS IA: Inquiry Process Document (IPD)
No ratings yet
DGS IA: Inquiry Process Document (IPD)
5 pages
1142pm - 1.EPRA JOURNALS 14814
No ratings yet
1142pm - 1.EPRA JOURNALS 14814
6 pages
Group11 DL Project Presentation
No ratings yet
Group11 DL Project Presentation
19 pages
Capstone Project 1 1
33% (3)
Capstone Project 1 1
4 pages
FML Micro Project
No ratings yet
FML Micro Project
12 pages
Coursera Capstone Project
No ratings yet
Coursera Capstone Project
4 pages
Laptop Price Pred
No ratings yet
Laptop Price Pred
11 pages
E Commerce
No ratings yet
E Commerce
20 pages
Seippel MA Eemcs
No ratings yet
Seippel MA Eemcs
95 pages
Big Mart Sales Prediction Using Machine Learning Report PDF
No ratings yet
Big Mart Sales Prediction Using Machine Learning Report PDF
56 pages
Synopsis-Big Mart Sales Prediction
No ratings yet
Synopsis-Big Mart Sales Prediction
3 pages
Chetan Research Paper
No ratings yet
Chetan Research Paper
7 pages
Big Mart Project Report
No ratings yet
Big Mart Project Report
19 pages
PPIR
No ratings yet
PPIR
8 pages
Retail Sales Forecasting Model
No ratings yet
Retail Sales Forecasting Model
8 pages
Bigmart Sales Prediction Analysis
No ratings yet
Bigmart Sales Prediction Analysis
47 pages
ML Project
100% (1)
ML Project
10 pages
IJCRT2105404 Bigmart 4
No ratings yet
IJCRT2105404 Bigmart 4
4 pages
Java
No ratings yet
Java
34 pages
ML Project Stage 2
No ratings yet
ML Project Stage 2
9 pages
Group 9 Paper Presentation
No ratings yet
Group 9 Paper Presentation
24 pages
GStore Revenue Prediction Analysis
No ratings yet
GStore Revenue Prediction Analysis
15 pages
U2431791 DS7010 2324 T2 Introduction
No ratings yet
U2431791 DS7010 2324 T2 Introduction
2 pages
RP 3
No ratings yet
RP 3
12 pages
Major ppt-1
No ratings yet
Major ppt-1
13 pages
Final PBL of Aaryan & Satyam
No ratings yet
Final PBL of Aaryan & Satyam
19 pages
Electronics 13 03953 v2
No ratings yet
Electronics 13 03953 v2
29 pages
E-Retail Customer Behavior Analysis
No ratings yet
E-Retail Customer Behavior Analysis
10 pages
Lecture 11
No ratings yet
Lecture 11
50 pages
Retail Sales Prediction Report
No ratings yet
Retail Sales Prediction Report
9 pages
Machine Learning Project
No ratings yet
Machine Learning Project
10 pages
Ammmp2023 87 94
No ratings yet
Ammmp2023 87 94
8 pages
Majorpptfin
No ratings yet
Majorpptfin
19 pages
Assignment 2
No ratings yet
Assignment 2
6 pages
Mini PRJCT
No ratings yet
Mini PRJCT
11 pages
Big Mart Sales Analysis
No ratings yet
Big Mart Sales Analysis
4 pages
Machine Learning PBL
No ratings yet
Machine Learning PBL
9 pages
Black Friday Sales Analysis & Predictions
No ratings yet
Black Friday Sales Analysis & Predictions
16 pages
Final Year Project
No ratings yet
Final Year Project
41 pages
SUKUMARREVIEWPPT2
No ratings yet
SUKUMARREVIEWPPT2
24 pages
QuickBasket Basket Breakdown
No ratings yet
QuickBasket Basket Breakdown
4 pages
A Novel Approach To Optimizing Customer Profiles in Relation To Business Metrics
No ratings yet
A Novel Approach To Optimizing Customer Profiles in Relation To Business Metrics
11 pages
Big Mart Outlets
100% (2)
Big Mart Outlets
11 pages
Improvizing Big Market Sales Prediction: Meghana N
No ratings yet
Improvizing Big Market Sales Prediction: Meghana N
7 pages
Master Theses MBirkeland
No ratings yet
Master Theses MBirkeland
70 pages
Retail Sales Prediction Using Machine Learning Algorithms
No ratings yet
Retail Sales Prediction Using Machine Learning Algorithms
9 pages
Milestone Adv Test 03 Paper 01 Class 12th Phase 01-08-06 2025 Solution
No ratings yet
Milestone Adv Test 03 Paper 01 Class 12th Phase 01-08-06 2025 Solution
8 pages
Chiller Data Sheet
No ratings yet
Chiller Data Sheet
1 page
There Are Three Main Types of Production Functions Based On Returns To Scale
No ratings yet
There Are Three Main Types of Production Functions Based On Returns To Scale
3 pages
Polymer Hydrogen Bond Analysis
No ratings yet
Polymer Hydrogen Bond Analysis
8 pages
Convert Spool to PDF and Email
No ratings yet
Convert Spool to PDF and Email
7 pages
Courier 6 HX: High-Performance On-Stream Solution Analyzer System From Outokumpu Technology
No ratings yet
Courier 6 HX: High-Performance On-Stream Solution Analyzer System From Outokumpu Technology
8 pages
Silex
No ratings yet
Silex
112 pages
Powerpoint Biology 2
No ratings yet
Powerpoint Biology 2
30 pages
Galileo's Inclined Plane Experiment
No ratings yet
Galileo's Inclined Plane Experiment
9 pages
Ies PR
No ratings yet
Ies PR
25 pages
Ada 28-2008-R2013
No ratings yet
Ada 28-2008-R2013
19 pages
Individual Footings (17.12.09) EDIT by J3
No ratings yet
Individual Footings (17.12.09) EDIT by J3
32 pages
OKI MAnual
No ratings yet
OKI MAnual
1,267 pages
Charpy Impact Test Guide
No ratings yet
Charpy Impact Test Guide
5 pages
Span1 Advice
No ratings yet
Span1 Advice
3 pages
Dual-Band Flat Antenna For Polarization Diversity With High Isolation
No ratings yet
Dual-Band Flat Antenna For Polarization Diversity With High Isolation
4 pages
Transport Query - Variants &amp Layouts
No ratings yet
Transport Query - Variants &amp Layouts
15 pages
Reservoir Connectivity Insights
No ratings yet
Reservoir Connectivity Insights
42 pages
Log WAUACJ8V5E1017425 147389km 91583mi
No ratings yet
Log WAUACJ8V5E1017425 147389km 91583mi
14 pages
Forex Trading for Advanced Traders
No ratings yet
Forex Trading for Advanced Traders
10 pages
RHS 100x60x4.0
No ratings yet
RHS 100x60x4.0
2 pages
Highway Design Training Guide
No ratings yet
Highway Design Training Guide
30 pages
Organic Chemistry: Structure & Bonding
No ratings yet
Organic Chemistry: Structure & Bonding
18 pages
NMOS and CMOS Transistor Analysis
No ratings yet
NMOS and CMOS Transistor Analysis
14 pages
Modern Sensor Technologies
No ratings yet
Modern Sensor Technologies
19 pages
Iia-4. Permutations and Combinations
100% (2)
Iia-4. Permutations and Combinations
5 pages
Population-Scale Genomic Data Augmentation Based On Conditional Generative Adversarial Networks
No ratings yet
Population-Scale Genomic Data Augmentation Based On Conditional Generative Adversarial Networks
6 pages
Electrostatics DPP 13 Insp
No ratings yet
Electrostatics DPP 13 Insp
5 pages
Quiz 1
No ratings yet
Quiz 1
5 pages
Grade 7 Term 3 Energy Review
100% (1)
Grade 7 Term 3 Energy Review
9 pages

E-commerce Product Recommendation Challenge

Uploaded by

E-commerce Product Recommendation Challenge

Uploaded by

Problem Statement

An e-commerce company wants to recommend products to its users.

Training dataset sample

aov = Order Value of the product

category = Product Category where the purchase was made

What do you need to predict?

SUBMISSIONS OPEN SAT JUN 26

LAST DATE SAT JUL 17

Training Data Target

Mean Relevance Rank

Technology Books is the 3rd top prediction for user_id 1 and that is

For user_id 1, one product matched and for user_id 4, two products

number of items that matched / number of unique users with a

Here it will be 3/2 = 1.5

Submissions should be made in the same format as the sample

Sample Prediction Dataset

You might also like