0% found this document useful (0 votes)
78 views18 pages

DADS304 - MBA 3 - Visualization

Data Visualisation assignment

Uploaded by

Varun Asthana
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
78 views18 pages

DADS304 - MBA 3 - Visualization

Data Visualisation assignment

Uploaded by

Varun Asthana
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 18

Varun Asthana

Roll No. 2114501153


Program – Online MBA
Course – Visualization Directorate of Online Education
ASSIGNMENT

SESSION AUG/SEP 2022


PROGRAM MASTER OF BUSINESS ADMINISTRATION (MBA)
SEMESTER III
COURSE CODE & NAME DADS304 – VISUALIZATION
CREDITS 04
NUMBER OF ASSIGNMENTS & MARKS 02
30 Marks each

Note: Answer all questions. Kindly note that answers for 10 marks questions should be approximately of 400 -
450 words. Each question is followed by evaluation scheme.

Q.No Assignment Set – 1 Marks Total Marks


Questions
1. Explain how data Visualization helps in business development. 6+4 10
Write the R script to draw the Histogram of below data using GGPLOT
Library:
Age Gender
30 Male
40 Female
60 Male
80 Female
50 Male
40 Female
80 Male
30 Female
55 Male
76 Female
Give the following specification in the graph:
 Set color=green
 Set bin size -8
 Classify the histogram according to Gender
 Provide title to the chart
 Provide Axis to the chart
2. Explain Anatomy of Shiny App. Design the given UI using R Shiny 5+5 10
Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Visualization Directorate of Online Education

3. Compare and Analyze different tools available for data visualization. 7+3 10
Write the steps to create a BMI field in Tableau assuming dataset as
follows:
S. N. Name Height Weight

Q.No Assignment Set – 2 Marks Total Marks


Questions
4. Explain different types of filters available in Tableau. 10 10
5. Explain LOD. Write the steps to deploy the dashboard in Tableau Public. 3+7 10
6. Differentiate between Microsoft PowerBI and Tableau. Also differentiate 5+5 10
between Power Query and Power Pivot.
Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Visualization Directorate of Online Education
A1 – Data visualization is a powerful tool for business development, as it enables businesses to
analyze large and complex data sets quickly and easily. Here are some ways in which data
visualization can help in business development:

 Identify trends and patterns: Data visualization can help businesses identify trends and
patterns in their data that may be difficult to see in a tabular format. This can provide
valuable insights into customer behaviour, market trends, and business performance.
 Monitor key performance indicators (KPIs): Data visualization can help businesses monitor
and track their KPIs in real-time, making it easier to identify areas that need improvement
and take corrective action.
 Communicate insights effectively: Data visualization enables businesses to communicate
complex data insights effectively to stakeholders, including executives, employees, and
customers. This can help to build understanding, engagement, and trust.
 Support decision-making: Data visualization can provide businesses with the information
they need to make informed decisions quickly and confidently. This can be particularly useful
in fast-paced, competitive industries where timely decisions can make a significant impact
on business outcomes.
 Identify new opportunities: Data visualization can help businesses identify new
opportunities for growth and expansion by revealing trends and patterns that may not have
been apparent before. This can provide a competitive advantage and drive innovation and
strategic planning.

The visualization is as following:

> library(ggplot2)
# we have loaded the ggplot2 package

> histogram_data <-read.csv("C:\\Users\\Varun Asthana\\Documents\\R Course Files\\GGPlot_1_hi


stogram.csv")
# imported the data with the above code
# defined data file with name 'histogram_data'

> ggplot(histogram_data, aes(x=Age)) + geom_histogram()

# histogram plotted by defining age on X-axis

Output: visualization does not specify the gender. So needs improvement

As per the instruction (1) the color of the histogram is to be set to Green
Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Visualization Directorate of Online Education
> ggplot(histogram_data, aes(x=Age)) + geom_histogram(color = 'black', fill = 'green')

As per the instruction (2) bin size is to be set to 8.


> ggplot(histogram_data, aes(x=Age)) + geom_histogram(color = 'black', fill = 'green', bins = 8)

> ggplot(histogram_data, aes(x=Age, fill = Gender)) + geom_histogram()

As per instruction (3) histogram has to be classified as per the Gender. By filling the bars classificatio
n can be done. For this we have to remove the green fill from histogram. Instead we will use ‘fill’ in a
es and shall define it by ‘Gender’

> ggplot(histogram_data, aes(x=Age, fill = Gender)) + geom_histogram(color = 'black', bins = 8)


Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Visualization Directorate of Online Education
As per instruction (4) Main title has to be given to histogram. So first we will define our plot as ‘Assig
mentplot’. And then we will give title by using +ggtitles .
> AssignmentPlot <- ggplot(histogram_data, aes(x=Age, fill = Gender)) + geom_histogram(color = 'bla
ck', bins = 8)
> Print(AssignmentPlot + ggtitle("Gender & Age Distribution"))

> print(AssignmentPlot + labs(y = "Count of Citizens", x = "Age of Citizens"))

A2) – Structure of Shiny is as following:

First, we will load all the Shiny related packages in R

Then we create the input and output UI for the appearance


Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Visualization Directorate of Online Education
Then we will run command to return the values to outputs. We will also tell Shiny that this is an app.

After that we will define what will be the UI components of the Shiny App.

Q3) Different tools available for visualization.


A3) There are many different tools available for data visualization, each has its own capabilities and
limitations. Below is a comparison of some of the most popular tools:

MS Excel - Excel is a widely used spreadsheet program that also includes basic data visualization
capabilities. While it is not as powerful as some of the other tools on this list, it can be a good choice
for simple charts and graphs.

Tableau - Tableau is a powerful and flexible data visualization tool. It is widely used in industry. It
allows users to create interactive dashboards, charts, and graphs, and supports a wide range of data
sources. Tableau is known for its intuitive drag-and-drop interface and its ability to handle large and
complex data sets.

Power BI - Power BI is a data visualization tool developed by Microsoft that is designed to integrate
with other Microsoft products. It allows users to create interactive reports and dashboards, and
supports a wide range of data sources. Power BI is known for its ease of use and its ability to create
visually appealing graphics.

Python - Python is a programming language that has become popular for data analysis and
visualization. There are many libraries available for Python that allow users to create charts, graphs,
and other visualizations. Some of the most popular libraries include Matplotlib, Seaborn, and Plotly.

R - R is a programming language that is widely used for statistical analysis and data visualization. Like
Python, there are many libraries available for R that allow users to create charts, graphs, and other
visualizations. Some of the most popular libraries include ggplot2, lattice, and plotly.

D3.js - D3.js is a JavaScript library for creating dynamic and interactive data visualizations in web
browsers. It is known for its ability to create complex and customized visualizations, but it requires
more coding expertise than some of the other tools on this list.

BMI Field set up and calculation


Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Visualization Directorate of Online Education
The data is imported and is now ready to be put in any type of visualization.

The data when plotted shall appear like this:

Then we will create a calculation field

Adding BMI Field to the column:


Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Visualization Directorate of Online Education

Q4) Explain different types of filters available in Tableau.

To access the filters let’s make a new sheet from sample data available – Superstore.
We can either apply data at the visualization level or we can apply the same at the data source level.

Data Source Filters


The Filters that are applied at the data source level is known as Data Source Filters. The data filtered
at the source level will be continued for other reports and visualizations.

Suppose I want to analyse /visualize the sales of only few regions and not all then I will go on :
 Data Source
 Filter  Add
 Select  Region  Select regions to analyse like Central & East

Filter applied will be shown like this :


Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Visualization Directorate of Online Education
The Graph will be automatically to changed to the two selected regions

Extract Filters – this flter helps in data filteration at the local system and then it shall be applicable to
the rest of the data sets and visualizations. To do this we will go to edit extract as shown below:

Lets say that I want to visualize the Bulk Quantity sales ( > 5) in the filtered region of Central & East.

Now we will see that the filter is stored and we shall click “Create Extract”. After the the local file will
be created which can be uploaded to establish the correlation.

Dimension Filters – this is the filter that is used to filter in the visualization part. This filter is located
as shown below:
Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Visualization Directorate of Online Education
Suppose in the below plotted graph I want to perform any slicing and want to view any particular
domain of sales then I can apply dimension filters.

To view the sales in the field of ‘Technology’ dimension filter of ‘category’ field will be applied and
‘furniture’ and ‘Office Supplies’ will be unselected from multi-select options.

We shall get a plot like this:

Radio & Multi-select Filters


The filtration shown in the previous screen provides a filter to select data for the user as well. We
can do the same by making use of ‘show filter’. We can select single value and multiple values based
on the filtering option to give it to the user.
Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Visualization Directorate of Online Education

Measure Filter
Measure filters are applied based on the values of the fields selected in the visualization.

Below is the plot shown as per the above condition.


Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Visualization Directorate of Online Education
Context Filter – If we create a visualization to see the top 10 states on the basis of sales then we will
be seeing a plot as below:

So here we will use “context filter” to honour the parent filter.

Q5) Explain LOD. Write the steps to deploy the dashboard in Tableau Public.

A5) Level Of Detail (LOD) allows to calculated the values at the data source level as well as at the
data visualization level. There are three types of LOD that gives control over the level of granularity
at which calculation has to be done.
 INCLUDE – it is at the most granular level
 EXCLUDE – it is at the less granular level
 FIXED – it is entirely independent level

Example of LOD – calculation can be done as following :


Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Visualization Directorate of Online Education

The above plot shall be obtained after applying the LOD calculation

The Tableau Public login screen will open after that. The screen is shown below where you can
create online dashboard:

To connect to the data we can upload the file from the computer.

Here I have uploaded my own datasheet from the computer.


Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Visualization Directorate of Online Education

Created this visualization of Negative feedbacks of the categories

Then we have to follow these below shared prompts to share our dashboard :
Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Visualization Directorate of Online Education
The below field can be edited and more information can be entered by the author for the
visualization shared for public to view.

Q3) Different tools available for visualization


A3) There are many different tools available for data visualization, each has its own capabilities and
limitations. Below is a comparison of some of the most popular tools:

MS Excel - Excel is a widely used spreadsheet program that also includes basic data visualization
capabilities. While it is not as powerful as some of the other tools on this list, it can be a good choice
for simple charts and graphs.

Tableau - Tableau is a powerful and flexible data visualization tool. It is widely used in industry. It
allows users to create interactive dashboards, charts, and graphs, and supports a wide range of data
sources. Tableau is known for its intuitive drag-and-drop interface and its ability to handle large and
complex data sets.

Power BI - Power BI is a data visualization tool developed by Microsoft that is designed to integrate
with other Microsoft products. It allows users to create interactive reports and dashboards, and
supports a wide range of data sources. Power BI is known for its ease of use and its ability to create
visually appealing graphics.

Python - Python is a programming language that has become popular for data analysis and
visualization. There are many libraries available for Python that allow users to create charts, graphs,
and other visualizations. Some of the most popular libraries include Matplotlib, Seaborn, and Plotly.

R - R is a programming language that is widely used for statistical analysis and data visualization. Like
Python, there are many libraries available for R that allow users to create charts, graphs, and other
visualizations. Some of the most popular libraries include ggplot2, lattice, and plotly.

D3.js - D3.js is a JavaScript library for creating dynamic and interactive data visualizations in web
browsers. It is known for its ability to create complex and customized visualizations, but it requires
more coding expertise than some of the other tools on this list.

In general, the best tool for data visualization depends on whether it is able to analyze the data and
give user satisfaction. Tableau and Power BI are good choices for users who want a powerful and
flexible tool with a user-friendly interface, while Python and R are good choices for users who are
having programming experience and want more control over their visualizations. D3.js is a good
Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Visualization Directorate of Online Education
choice for users who want to create highly customized and interactive visualizations for web
applications. Excel is a good choice for users who want to create basic visualizations quickly and
easily.

BMI Field set up and calculation

The data is imported and is now ready to be put in any type of visualization.

The data when plotted shall appear like this:

Then we will create a calculation field


Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Visualization Directorate of Online Education

Adding BMI Field to the column:

Q6: Differentiate between Microsoft PowerBI and Tableau. Also differentiate between Power
Query and Power Pivot.
A6) Difference between PowerBI and Tableau
Power BI and Tableau are two of the most popular data visualization tools used for business
intelligence and data analysis. Both tools have similar functionalities and serve the same purpose,
but they differ in certain aspects. Here are some key differences between Power BI and Tableau:

Cost: Power BI is relatively cheaper than Tableau. Power BI offers a free version and a reasonably
priced pro version, while Tableau’s pricing model is generally considered more expensive.

Ease of use: Power BI is generally considered easier to use than Tableau, with a more intuitive
interface and less complex workflows. This makes it more suitable for beginners.

Customization: Tableau is generally considered more flexible and customizable than Power BI. This is
because Tableau allows for more advanced customizations, including the ability to create custom
calculations and complex data blending.

Data connectivity: Power BI is more tightly integrated with Microsoft’s suite of tools, including Excel
and Azure, making it easier to connect to and integrate data from these sources. Tableau has a
larger selection of data connectors and can connect to more diverse data sources.
Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Visualization Directorate of Online Education
Mobile support: Power BI has better mobile support than Tableau, with a dedicated mobile app for
iOS and Android. Tableau has a mobile app as well, but it is not as well optimized for mobile devices
as Power BI.

Overall, both tools are powerful and versatile, and the choice between them ultimately comes down
to personal preference, budget, and specific needs of the user.

Difference between Power Query & Power Pivot


Power Query and Power Pivot are two key components of Microsoft's Power BI suite of tools, used
for data analysis and business intelligence. While both tools are used for data processing and
manipulation, they serve different functions:

Power Query: Power Query is a data connection and transformation tool that allows users to
connect to a wide range of data sources and transform the data into the format required for
analysis. It provides a graphical user interface for connecting to data sources, filtering and cleaning
data, and creating custom calculations. Power Query is designed to work with large datasets, and it
supports complex data transformations, such as merging and appending tables, pivoting and
unpivoting data, and removing duplicates.P

Power Pivot: Power Pivot is a data modeling tool that allows users to create complex data models
and relationships between tables. It enables users to combine data from multiple sources and create
relationships between tables using a visual interface. Power Pivot is designed to work with large
datasets, and it can handle complex calculations, such as time intelligence, DAX formulas, and
hierarchies.

In summary, Power Query is used for data transformation, while Power Pivot is used for data
modeling and creating relationships between tables. Both tools are essential for data analysis and
business intelligence, and they are often used in combination to create powerful data models and
visualizations.

You might also like