SNo.
Topic
Perform the following operations on Sales.csv
file.
1 Consider the Sales.csv file and summarize
the total number of customers in each
country.
2 Calculate the cumulative sales of all
products after each month.
3 Calculate the monthly % increase or %
decrease of sales of all products in each
country.
4 Summarize the data to show which
product has maximum sales over all the
products in all the countries.
5 Apply Subtotal of total quantity ordered
per date using pivot table.
Perform the following operations on Sales.csv
file
1. Find out the maximum profit gained by all
products over each region by each month.
2. Summarize the sales of all products
over all years broken down by each
quarter then by each month.
(subtotals).
3. Compare the Product which has
maximum sales with each other
product as a percentage.
Perform the following operations on
FoodSales.csv
1. Summarize the data to show which
product has shown maximum sales.
2. Summarize the data to show which
sales person has shown maximum
sales of all products in each region.
3. Given the cost and selling price of
each product, find out the maximum
profit gained by all products over
each region by each month.
4
Perform the following Operations on
FoodSales.csv
1. Display the total sales of all products for
all years in all stores.
2. Compare total sales for all stores, product
by product, between year 2005 and 2006.
3. Show comparison of total sales for all
stores, product by product, between years
2005 and 2006 only for those products
with reduced sales.
Sign
4. Show comparison of sales by individual
stores, product by product, between years
2005 and 2006 only for those products
with reduced sales.
5. Show the result of the previous query, but
rotating and switching the columns with
rows.
Pentaho Data Integration Tool:
5
Read a text file in the tool and show its contents
in PDI tool Filter the records and show its
contents in excel files.
Convert a .csv file to Relational Database tables
after Filtering Records.
Read all text files related to match [group1group4]. Group the data of all the files to show
how many matches are played by teams. Discard
the attributes which are not required.
Create mapping by reading XML file named
countries.xml in PDI tool and convert it into excel
file input for summarization. Summarize the data
to show which language has highest percentage in
each country.
Weka Explorer:
Perform Classification of weather data using
Weka explorer and show the result in graph.
Show in what weather conditions match should be
played.
10
Perform Association Rule Mining on the given
data in Weka Explorer.
11
Perform k-means Clustering on the given data in
Weka Explorer to show different clusters.
INDEX
S.N
o
Title
Summarize the data to calculate total number of
customers in each country who have purchased
particular product quaterly.
Calculate the cumulative sales of all products after
each month.
Calculate the monthly % increase or % decrease of
sales of all products in each country. Show the
products whose sales have increased over each
month
Summarize the date to show which product has
maximum sales over all the products in all the
countries.
Compare the Product which has maximum sales with
each other product as a percentage.
Summarize the sales of all products over all years
broken down by each quarter then by each month.
(subtotals).
Summarize the data to show which sales person has
shown maximum sales of all products in each region.
Given the cost and selling price of each product, find
out the maximum profit gained by all products over
each region by each month.
Calculate the % contribution of sales of each region
to the total sales.
10
Summarize the given data in a single sheet to display
sales of all products.
11
Read a text file input in Pentaho data intergration
tool(PDI) and convert it into Relational database
P.No
Signature
table after filtering records.
12
Read all the text files related to the match [group1group4]. Group the data of all the files to show how
many matches are played by teams. Discard the
attributed which are not required in the result.
13
Create mapping by reading XML file named
countries.xml in PDI tool and convert it
into the excel file output.
14
. Perform classification of weather data using Weka
explorer and show the result in graph. Show in what
weather conditions match should be played.
15
Perform k-means clustering on the given data to
show different clusters.
16
Perform Associations Rule mining on the given data in
Weka Explorer
17
18
19