0% found this document useful (0 votes)

16 views21 pages

DSBDA Practical 8 Tutorial

Uploaded by

kausubhk999999

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views21 pages

DSBDA Practical 8 Tutorial

Uploaded by

kausubhk999999

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Practical 8

Tutorial
In this we will be working on the titanic dataset and plotting various different
plots. The practical is already very simple and can be easily executed my reading
the manual. We have given the final code at the last page. Check it if you have any
errors.
First open anaconda and launch spyder.
We will first use the titanic dataset which is already defined in the library.
Let's see what the Titanic dataset looks like. Execute the following script:

import pandas as pd

import numpy as np

import matplotlib.pyplot as plt

import seaborn as sns

dataset = sns.load_dataset('titanic')dataset.head()

The dataset contains 891 rows and 15 columns and contains information about the passengers
who boarded the unfortunate Titanic ship. The original task is to predict whether or not the
passenger survived depending upon different features such as their age, ticket, cabin they
boarded, the class of the ticket, etc. We will use the Seaborn library to see if we can find any
patterns in the data.

1. Finding patterns of data.

Patterns of data can be find out with the help of different types of plots
Types of plots are:

A. Distribution Plots

a. Dist-Plot

b. Joint Plot

c. d. Rug Plot
B. Categorical Plots

a. Bar Plot

b. Count Plot

c. Box Plot

d. Violin Plot

C. Advanced Plots

a. Strip Plot

b. Swarm Plot

D. Matrix Plots

a. Heat Map

b. Cluster Map

A. Distribution Plots:

These plots help us to visualise the distribution of data. We can use these plots to understand

themean, median, range, variance, deviation, etc of the data.

a. Distplot

● Dist plot gives us the histogram of the selected continuous variable.

● It is an example of a univariate analysis.

● We can change the number of bins i.e. number of vertical bars in a histogram

import seaborn as sns

sns.distplot(x = dataset['age'], bins = 10)

The line that you see represents the kernel density estimation. You can remove this lineby
passing False as the parameter for the kde attribute as shown below

sns.distplot(dataset['age'], bins = 10,kde=False)

Here the x-axis is the age and the y-axis displays frequency. For example, for bins = 10,there

are around 50 people having age 0 to 10

i.b. Joint Plot

● It is the combination of the distplot of two variables.

● It is an example of bivariate analysis.

● We additionally obtain a scatter plot between the variables to reflect their linear
relationship. We can customise the scatter plot into a hexagonal plot, where,
themore the colour intensity, the more will be the number of observations.

import seaborn as sns# For Plot 1

sns.jointplot(x = dataset['age'], y = dataset['fare'], kind ='scatter')

# For Plot 2

sns.jointplot(x = dataset['age'], y = dataset['fare'], kind = 'hex')

● From the output, you can see that a joint plot has three parts. A distribution plot at the
topfor the column on the x-axis, a distribution plot on the right for the column on the y-
axis and a scatter plot in between that shows the mutual distribution of data for both the
columns. You can see that there is no correlation observed between prices and the fares.

● You can change the type of the joint plot by passing a value for the kind parameter. For
instance, if instead of a scatter plot, you want to display the distribution of data in the
form of a hexagonal plot, you can pass the value hex for the kind parameter.
● In the hexagonal plot, the hexagon with the most number of points gets darker colour. So
if you look at the above plot, you can see that most of the passengers are between
theages of 20 and 30 and most of them paid between 10-50 for the tickets.

a. c. The Rug Plot

b. The rugplot() is used to draw small bars along the x-axis for each point in the dataset. To plot a
rug plot, you need to pass the name of the column. Let's plot a rug plot for fare.
sns.rugplot(dataset['fare'])

From the output, you can see that most of the instances for the fares have values between 0 and
100.

These are some of the most commonly used distribution plots offered by the Python's Seaborn
Library. Let's see some of the categorical plots in the Seaborn library.

2. Categorical Plots
Categorical plots, as the name suggests, are normally used to plot categorical data. The categorical
plots plot the values in the categorical column against another categorical column ora numeric
column. Let's see some of the most commonly used categorical data.

b. The Bar Plot

The barplot() is used to display the mean value for each value in a categorical column, against a
numeric column. The first parameter is the categorical column, the second parameter is the
numeric column while the third parameter is the dataset. For instance, if you want to know the
mean value of the age of the male and female passengers, you can use the bar plot as follows.
sns.barplot(x='sex', y='age', data=dataset)

From the output, you can clearly see that the average age of male passengers is just less than 40
while the average age of female passengers is around 33.

In addition to finding the average, the bar plot can also be used to calculate other aggregate values
for each category. To do so, you need to pass the aggregate function to the estimator. For
instance, you can calculate the standard deviation for the age of each gender as follows:

import numpy as np

import matplotlib.pyplot as pltimport seaborn

as sns

sns.barplot(x='sex', y='age', data=dataset, estimator=np.std)

Notice, in the above script we use the std aggregate function from the numpy library to calculate
the standard deviation for the ages of male and female passengers. The output looks like this:
c. The Count Plot
The count plot is similar to the bar plot, however it displays the count of the categories in a specific
column. For instance, if we want to count the number of males and women passenger we can do
so using count plot as follows:

sns.countplot(x='sex', data=dataset)

d. The Box Plot

The box plot is used to display the distribution of the categorical data in the form of quartiles.
The centre of the box shows the median value. The value from the lower whisker to the bottom
of the box shows the first quartile. From the bottom of the box to the middle of the box lies the
second quartile. From the middle of the box to the top of the box lies the third quartile and finally
from the top of the box to the top whisker lies the last quartile.

Now let's plot a box plot that displays the distribution for the age with respect to each gender.
You need to pass the categorical column as the first parameter (which is sex in our case) and the
numeric column (age in our case) as the second parameter. Finally, the dataset is passed as the
third parameter, take a look at the following script:

sns.boxplot(x='sex', y='age', data=dataset)

Let's try to understand the box plot for females. The first quartile starts at around 1 and ends at
20 which means that 25% of the passengers are aged between 1 and 20. The second quartile
starts at around 20 and ends at around 28 which means that 25% of the passengers are aged
between20 and 28. Similarly, the third quartile starts and ends between 28 and 38, hence 25%
passengers are aged within this range and finally the fourth or last quartile starts at 38 and ends
around 64.

If there are any outliers or the passengers that do not belong to any of the quartiles, they are
called outliers and are represented by dots on the box plot.

You can make your box plots more fancy by adding another layer of distribution. For instance, if
you want to see the box plots of forage of passengers of both genders, along with the information
about whether or not they survived, you can pass the survived as value to the hue parameter as
shown below:
sns.boxplot(x='sex', y='age', data=dataset, hue="survived")
Now in addition to the information about the age of each gender, you can also see the distribution
of the passengers who survived. For instance, you can see that among the male passengers, on
average more younger people survived as compared to the older ones. Similarly, you can see that
the variation among the age of female passengers who did not survive is much greater than the
age of the surviving female passengers.

e. The Violin Plot

The violin plot is similar to the box plot, however, the violin plot allows us to display all the
components that actually correspond to the data point. The violinplot() function is used to plot
the violin plot. Like the box plot, the first parameter is the categorical column, the second
parameter is the numeric column while the third parameter is the dataset.

Let's plot a violin plot that displays the distribution for the age with respect to each gender.

sns.violinplot(x='sex', y='age', data=dataset)

You can see from the figure above that violin plots provide much more information about the data
as compared to the box plot. Instead of plotting the quartile, the violin plot allows us to see all the
components that actually correspond to the data. The area where the violin plot is thicker has a
higher number of instances for the age. For instance, from the violin plot for males, it is clearly
evident that the number of passengers with age between 20 and 40 is higher than all the rest of
the age brackets.

Like box plots, you can also add another categorical variable to the violin plot using the hue
parameter as shown below:
sns.violinplot(x='sex', y='age', data=dataset, hue='survived')
Department of Computer Engineering Subject : DSBDAL

Now you can see a lot of information on the violin plot. For instance, if you look at the bottom of
the violin plot for the males who survived (left-orange), you can see that it is thicker than the
bottom of the violin plot for the males who didn't survive (left-blue). This means that the number
of young male passengers who survived is greater than the number of young male passengers who
did not survive

Advanced Plots:

a. The Strip Plot

The strip plot draws a scatter plot where one of the variables is categorical. We have seen scatter
plots in the joint plot and the pair plot sections where we had two numeric variables. The strip
plot is different in a way that one of the variables is categorical in this case, and for each category
in the categorical variable, you will see a scatter plot with respect to the numeric column.

The stripplot() function is used to plot the violin plot. Like the box plot, the first parameter is the
categorical column, the second parameter is the numeric column while the third parameter is the
dataset. Look at the following script:

sns.stripplot(x='sex', y='age', data=dataset, jitter=False)

SNJB’s Late Sau. K B Jain College of Engineering, Chandwad Dist. Nashik, MS

Department of Computer Engineering Subject : DSBDAL

You can see the scattered plots of age for both males and females. The data points look like strips.
It is difficult to comprehend the distribution of data in this form. To better comprehend the data,
pass True for the jitter parameter which adds some random noise to the data. Look at the
following script:

sns.stripplot(x='sex', y='age', data=dataset, jitter=True)

Now you have a better view for the distribution of age across the genders.

Like violin and box plots, you can add an additional categorical column to strip plot using hue
parameter as shown below:

sns.stripplot(x='sex', y='age', data=dataset, jitter=True, hue='survived')

SNJB’s Late Sau. K B Jain College of Engineering, Chandwad Dist. Nashik, MS

b. The Swarm Plot
The swarm plot is a combination of the strip and the violin plots. In the swarm plots, the points
are adjusted in such a way that they don't overlap. Let's plot a swarm plot for the distribution of
age against gender. The swarmplot() function is used to plot the violin plot. Like the box plot, the
first parameter is the categorical column, the second parameter is the numeric column while the
third parameter is the dataset. Look at the following script:

sns.swarmplot(x='sex', y='age', data=dataset)

You can clearly see that the above plot contains scattered data points like the strip plot and the
data points are not overlapping. Rather they are arranged to give a view similar to that of a violin
plot.

Let's add another categorical column to the swarm plot using the hue parameter.
sns.swarmplot(x='sex', y='age', data=dataset, hue='survived')
From the output, it is evident that the ratio of surviving males is less than the ratio of surviving
females. Since for the male plot, there are more blue points and less orange points. On the other
hand, for females, there are more orange points (surviving) than the blue points (not surviving).
Another observation is that amongst males of age less than 10, more passengers survived as
compared to those who didn't.

1. Matrix Plots
Matrix plots are the type of plots that show data in the form of rows and columns. Heat maps are
the prime examples of matrix plots.

a. Heat Maps
Heat maps are normally used to plot correlation between numeric columns in the form of a matrix.
It is important to mention here that to draw matrix plots, you need to have meaningful
information on rows as well as columns. Let's plot the first five rows of the Titanic dataset to see
if both the rows and column headers have meaningful information. Execute the following script:

import pandas as pdimport

numpy as np

import matplotlib.pyplot as pltimport seaborn

as sns

dataset = sns.load_dataset('titanic')dataset.head()
From the output, you can see that the column headers contain useful information such as
passengers surviving, their age, fare etc. However the row headers only contain indexes 0, 1, 2,
etc. To plot matrix plots, we need useful information on both columns and row headers. One way
to do this is to call the corr() method on the dataset. The corr() function returns the correlation
between all the numeric columns of the dataset. Execute the following script:

dataset.corr()

In the output, you will see that both the columns and the rows have meaningful header
information, as shown below:

Now to create a heat map with these correlation values, you need to call the heatmap() function
and pass it your correlation dataframe. Look at the following script:

corr = dataset.corr()

sns.heatmap(corr)
From the output, it can be seen that what heatmap essentially does is that it plots a box for every
combination of rows and column value. The colour of the box depends upon the gradient. For
instance, in the above image if there is a high correlation between two features, the
corresponding cell or the box is white, on the other hand if there is no correlation, the
corresponding cell remains black.

The correlation values can also be plotted on the heatmap by passing True for the annot
parameter. Execute the following script to see this in action:

corr = dataset.corr() sns.heatmap(corr,

annot=True)
You can also change the colour of the heatmap by passing an argument for the cmap
parameter.For now, just look at the following script:

corr = dataset.corr()

sns.heatmap(corr)

b. Cluster Map:
In addition to the heat map, another commonly used matrix plot is the cluster map. The cluster
map basically uses Hierarchical Clustering to cluster the rows and columns of the matrix.
Let's plot a cluster map for the number of passengers who travelled in a specific month ofa specific
year. Execute the following script:
4. Checking how the price of the ticket (column name: 'fare') for each passenger is distributed
by plotting a histogram.
import seaborn as sns

dataset = sns.load_dataset('titanic') sns.histplot(dataset['fare'], kde=False,

bins=10)
From the histogram, it is seen that for around 730 passengers the price of the ticket is 50.For
100 passengers the price of the ticket is 100 and so on.

Lastly save the file and the histogram by clicking on the save all plots button
above the console window.
Conclusion-
Seaborn is an advanced data visualisation library built on top of Matplotlib library.
In this assignment, we looked at how we can draw distributional and categorical
plots using the Seaborn library. We have seen how to plot matrix plots in Seaborn.
We also saw how to change plot styles and use grid functions to manipulate
subplots.

Experiment No 8
No ratings yet
Experiment No 8
26 pages
Ass 8 DSBDL
No ratings yet
Ass 8 DSBDL
27 pages
Titanic Fare Distribution Analysis
No ratings yet
Titanic Fare Distribution Analysis
21 pages
Data Visualization II: Downloading The Seaborn Library
No ratings yet
Data Visualization II: Downloading The Seaborn Library
14 pages
Exp 8
No ratings yet
Exp 8
19 pages
Data Visualization With Seaborn PDF
No ratings yet
Data Visualization With Seaborn PDF
12 pages
Exp 8
No ratings yet
Exp 8
2 pages
Part A Assignment - No - 8
No ratings yet
Part A Assignment - No - 8
19 pages
Seaborn Data Visualization Guide
No ratings yet
Seaborn Data Visualization Guide
17 pages
Lec-5 Seaborn
No ratings yet
Lec-5 Seaborn
30 pages
Seaborn 1655435139
No ratings yet
Seaborn 1655435139
13 pages
Matplotlib Guide for Data Scientists
No ratings yet
Matplotlib Guide for Data Scientists
5 pages
Pandas Cheat Sheet 2
No ratings yet
Pandas Cheat Sheet 2
12 pages
Seaborn Data Visualization Guide
No ratings yet
Seaborn Data Visualization Guide
49 pages
DVA Practical
No ratings yet
DVA Practical
19 pages
Datavisualization Interview
No ratings yet
Datavisualization Interview
3 pages
Data Analysis Graphs
No ratings yet
Data Analysis Graphs
9 pages
Experiment No 9
No ratings yet
Experiment No 9
13 pages
Visualization Library Documentation
No ratings yet
Visualization Library Documentation
16 pages
DSBDL Write Ups 8 To 10
No ratings yet
DSBDL Write Ups 8 To 10
7 pages
DSBDAL - Assignment No 9
No ratings yet
DSBDAL - Assignment No 9
12 pages
Mfds QnA
No ratings yet
Mfds QnA
8 pages
3-Data Description
No ratings yet
3-Data Description
91 pages
Seaborn Plot Types and Examples
No ratings yet
Seaborn Plot Types and Examples
20 pages
Data Visualization Part 2
No ratings yet
Data Visualization Part 2
18 pages
Ds 8 Titanic
No ratings yet
Ds 8 Titanic
1 page
Seaborn Cheat Sheet Python For Data Science: 3 Plotting With Seaborn 3 Plotting With Seaborn
No ratings yet
Seaborn Cheat Sheet Python For Data Science: 3 Plotting With Seaborn 3 Plotting With Seaborn
1 page
Seaborn - Ipynb - Colaboratory
No ratings yet
Seaborn - Ipynb - Colaboratory
8 pages
Data Visualization in Python With Libraries
No ratings yet
Data Visualization in Python With Libraries
28 pages
Data Visualization I: Downloading The Seaborn Library
No ratings yet
Data Visualization I: Downloading The Seaborn Library
6 pages
Seaborn
No ratings yet
Seaborn
7 pages
Lecture Week3
No ratings yet
Lecture Week3
51 pages
Tung Wah College GEN3005 / GED3005 Big Data and Data Sciences
No ratings yet
Tung Wah College GEN3005 / GED3005 Big Data and Data Sciences
7 pages
Content From Jose Portilla's Udemy Course Learning Python For Data Analysis and Visualization Notes by Michael Brothers, Available On
No ratings yet
Content From Jose Portilla's Udemy Course Learning Python For Data Analysis and Visualization Notes by Michael Brothers, Available On
13 pages
Ultimate Data Visualization Guide With Python
No ratings yet
Ultimate Data Visualization Guide With Python
26 pages
Class 5-Python
No ratings yet
Class 5-Python
21 pages
Data Visualization - U5
No ratings yet
Data Visualization - U5
31 pages
Seaborn Cheat Sheet for Data Visualization
100% (1)
Seaborn Cheat Sheet for Data Visualization
1 page
Seaborn Data Visualization Guide
No ratings yet
Seaborn Data Visualization Guide
7 pages
Advanced Plot Types With Seaborn
No ratings yet
Advanced Plot Types With Seaborn
4 pages
Seaborn Merged
No ratings yet
Seaborn Merged
106 pages
Data Visualization Tools for EDA
No ratings yet
Data Visualization Tools for EDA
10 pages
Data Visualization Using Matplotlib
No ratings yet
Data Visualization Using Matplotlib
10 pages
Data Visualization Techniques Guide
No ratings yet
Data Visualization Techniques Guide
48 pages
Data Visualization with Matplotlib
No ratings yet
Data Visualization with Matplotlib
18 pages
An Introduction To Seaborn
No ratings yet
An Introduction To Seaborn
42 pages
Seaborn EDA for Python Users
No ratings yet
Seaborn EDA for Python Users
39 pages
Lecture 2.3
No ratings yet
Lecture 2.3
25 pages
EDA Techniques with Python and Seaborn
No ratings yet
EDA Techniques with Python and Seaborn
42 pages
Data Visualization Essentials
No ratings yet
Data Visualization Essentials
87 pages
DSAL Titles
No ratings yet
DSAL Titles
3 pages
DSBDA Practical 4 Tutorial
No ratings yet
DSBDA Practical 4 Tutorial
8 pages
DSBDA Practical 2 Tutorial
No ratings yet
DSBDA Practical 2 Tutorial
14 pages
DSBDA Practical 6 Tutorial
No ratings yet
DSBDA Practical 6 Tutorial
3 pages
DSBDA Practical 7 Tutorial
No ratings yet
DSBDA Practical 7 Tutorial
11 pages
DSBDA Practical 9 Tutorial
No ratings yet
DSBDA Practical 9 Tutorial
1 page
Dsal Lab Manual GBK
No ratings yet
Dsal Lab Manual GBK
86 pages
DSBDA Practical 5 Tutorial
No ratings yet
DSBDA Practical 5 Tutorial
4 pages
DSBDA Practical 1 Tutorial
No ratings yet
DSBDA Practical 1 Tutorial
8 pages
Wa0005.
No ratings yet
Wa0005.
96 pages
DSBDA Practical 10 Tutorial
No ratings yet
DSBDA Practical 10 Tutorial
2 pages
4 - Jpa Advance Part
No ratings yet
4 - Jpa Advance Part
11 pages
Full-Mesh VPN Performance Evaluation For A Secure
No ratings yet
Full-Mesh VPN Performance Evaluation For A Secure
22 pages
FRM Part 1: Basic Statistics
No ratings yet
FRM Part 1: Basic Statistics
28 pages
Introduction To C++ - Day 1
100% (1)
Introduction To C++ - Day 1
43 pages
Ms. Jai Quotation
No ratings yet
Ms. Jai Quotation
2 pages
Scattering and Angular Velocity Analysis
No ratings yet
Scattering and Angular Velocity Analysis
8 pages
Chapter 1 The Last Lesson by Alphonse Daudet
No ratings yet
Chapter 1 The Last Lesson by Alphonse Daudet
5 pages
New Crossarm Catalog
No ratings yet
New Crossarm Catalog
20 pages
Chapter 3 HUMAN RESOURCE MANAGEMENT
No ratings yet
Chapter 3 HUMAN RESOURCE MANAGEMENT
10 pages
GenChem1 Lesson 2
No ratings yet
GenChem1 Lesson 2
48 pages
Estoque Filtros
No ratings yet
Estoque Filtros
2 pages
Debit Card and Credit Card
0% (2)
Debit Card and Credit Card
51 pages
MD 008 C7MD00248C
No ratings yet
MD 008 C7MD00248C
12 pages
Kyle Lesso: Music Educator Resume
No ratings yet
Kyle Lesso: Music Educator Resume
3 pages
CS102 Test 2 Section A (Theory)
No ratings yet
CS102 Test 2 Section A (Theory)
2 pages
Microsoft Fabric Data Engineer Interview Roadmap
No ratings yet
Microsoft Fabric Data Engineer Interview Roadmap
2 pages
Beyondheroesunlimiteduniverse15 The Bestiary 1
No ratings yet
Beyondheroesunlimiteduniverse15 The Bestiary 1
288 pages
Question Paper - Unit 2 - June 2022
No ratings yet
Question Paper - Unit 2 - June 2022
20 pages
MBARARA DDP III Vol I
No ratings yet
MBARARA DDP III Vol I
173 pages
Ilonggo-Literature MT 3B
No ratings yet
Ilonggo-Literature MT 3B
10 pages
Freelancers' Guide to Effective DMs
100% (1)
Freelancers' Guide to Effective DMs
4 pages
New Client Copy Tool 1735966670
No ratings yet
New Client Copy Tool 1735966670
15 pages
02 Metformin
No ratings yet
02 Metformin
9 pages
News Release 1
No ratings yet
News Release 1
3 pages
Cytology Practical
No ratings yet
Cytology Practical
34 pages
Ecg
No ratings yet
Ecg
64 pages
Đề kiểm tra cuối học kì 1 Tiếng anh 11 Global Success Theo form 2025 có lời giải - Đề 3-1734097424
No ratings yet
Đề kiểm tra cuối học kì 1 Tiếng anh 11 Global Success Theo form 2025 có lời giải - Đề 3-1734097424
28 pages
Cmmi Acquisition Module (CMMI-AM), Version 1.1
No ratings yet
Cmmi Acquisition Module (CMMI-AM), Version 1.1
49 pages
Global Perspective in Education Current Issues
No ratings yet
Global Perspective in Education Current Issues
11 pages
Gateway Error and SQL Used
No ratings yet
Gateway Error and SQL Used
2 pages
Unani Medicine Expert's Profile
100% (2)
Unani Medicine Expert's Profile
7 pages
Inuit Throat Singing Lesson Plan
No ratings yet
Inuit Throat Singing Lesson Plan
3 pages

DSBDA Practical 8 Tutorial

Uploaded by

DSBDA Practical 8 Tutorial

Uploaded by

Practical 8

import matplotlib.pyplot as plt

import seaborn as sns

1. Finding patterns of data.

themean, median, range, variance, deviation, etc of the data.

● Dist plot gives us the histogram of the selected continuous variable.

● It is an example of a univariate analysis.

import seaborn as sns

sns.distplot(x = dataset['age'], bins = 10)

sns.distplot(dataset['age'], bins = 10,kde=False)

are around 50 people having age 0 to 10

i.b. Joint Plot

● It is the combination of the distplot of two variables.

● It is an example of bivariate analysis.

import seaborn as sns# For Plot 1

sns.jointplot(x = dataset['age'], y = dataset['fare'], kind ='scatter')

sns.jointplot(x = dataset['age'], y = dataset['fare'], kind = 'hex')

a. c. The Rug Plot

b. The Bar Plot

import matplotlib.pyplot as pltimport seaborn

sns.barplot(x='sex', y='age', data=dataset, estimator=np.std)

d. The Box Plot

sns.boxplot(x='sex', y='age', data=dataset)

e. The Violin Plot

sns.violinplot(x='sex', y='age', data=dataset)

a. The Strip Plot

sns.stripplot(x='sex', y='age', data=dataset, jitter=False)

SNJB’s Late Sau. K B Jain College of Engineering, Chandwad Dist. Nashik, MS

sns.stripplot(x='sex', y='age', data=dataset, jitter=True)

sns.stripplot(x='sex', y='age', data=dataset, jitter=True, hue='survived')

SNJB’s Late Sau. K B Jain College of Engineering, Chandwad Dist. Nashik, MS

sns.swarmplot(x='sex', y='age', data=dataset)

import pandas as pdimport

import matplotlib.pyplot as pltimport seaborn

corr = dataset.corr() sns.heatmap(corr,

dataset = sns.load_dataset('titanic') sns.histplot(dataset['fare'], kde=False,

You might also like