Foundation of Data Science
(CS G320)
BITS Pilani Data Visualization
Dubai Campus
Data Visualization
• Data visualization is the practice of translating information into a
visual context, such as a map or graph, to make data easier for the
human brain to understand and pull insights from.
• The main goal of data visualization is to make it easier to identify
patterns, trends and outliers in large data sets.
• It is one of the steps of the data science process
• The ability to absorb information quickly, improve insights and
make faster decisions.
BITS Pilani, Dubai Campus
Common types of data visualization
BITS Pilani, Dubai Campus
Visualizing change over time
Line chart (comparison)
It displays data points connected by a line, often used to show trends
over time or continuous data.
Bar chart (comparison and distribution)
It represents data using rectangular bars of varying lengths, commonly
used to compare categorical data.
BITS Pilani, Dubai Campus
Visualizing change over time…
Box plot (distribution and comparison)
It demonstrates graphically the locality, spread and skewness
groups of numerical data through their quartiles.
Similar to box plot
BITS Pilani, Dubai Campus
Visualizing part-to-whole composition
Pie chart
It is a way of representing data in a circular graph. Pie slices of the
chart show the relative size of the data.
Donut chart
A pie chart that features a hole in the center, to show a relevant
single numerical value with its progress.
Stacked bar chart
A bar chart where each bar is divided into multiple sub-bars. A
single bar can be used as an alternative to pie or donut chart.
BITS Pilani, Dubai Campus
Visualizing flow and processes
Funnel chart
It provides a visual representation of data
as it progresses through different stages
of a process.
Gantt chart
It is a horizontal bar chart used in project management
to visually represent a project over time.
BITS Pilani, Dubai Campus
Visualizing data distribution
Histogram
Similar to a bar chart, but used when a variable takes continuous numeric values.
The range is divided into bins for aggregating counts.
BITS Pilani, Dubai Campus
Visualizing relationships between variables
Scatter plot
It is used to show relationship between two numerical variables
Bubble chart
Scatter plot with point size dictated by a third numeric variable
Heatmap
Extension of bar charts and histograms to two variables, each of
which can be categorical or numerical
BITS Pilani, Dubai Campus
Popular data visualization tools
• Tableau
• Infogram
• ChartBlocks
• D3.js
• Google Charts
• Fusion Charts
• Chart.js
BITS Pilani, Dubai Campus
Thank you
BITS Pilani, Dubai Campus