KANIKA CHOUHAN
Key Pandas
Functions in
Data Analysis
kanikachouhan1515
kanikachouhan1515@[Link]
KANIKA CHOUHAN
1
1. Data Loading
pd.read_csv(): Load data from a CSV
file.
pd.read_excel() : Load data from an
Excel file.
pd.read_sql(): Load data from a SQL
database.
kanikachouhan1515
kanikachouhan1515@[Link]
KANIKA CHOUHAN
kanikachouhan1515
kanikachouhan1515@[Link]
KANIKA CHOUHAN 2
2. Data Exploration
[Link](): Display the first few rows of the
DataFrame.
[Link](): Display the last few rows of the DataFrame.
[Link](): Get a summary of the DataFrame,
including data types and non-null counts.
[Link](): Generate descriptive statistics for
numerical columns.
[Link]: Get the number of rows and columns in
the DataFrame.
[Link]: List all column names.
[Link]: Get data types of all columns.
df.value_counts(): Count unique values in a column.
[Link](): Count distinct values in each column.
kanikachouhan1515
kanikachouhan1515@[Link]
KANIKA CHOUHAN
kanikachouhan1515
kanikachouhan1515@[Link]
KANIKA CHOUHAN
3
3. Data Cleaning
[Link](): Remove missing values.
[Link](): Fill missing values.
df.drop_duplicates(): Remove
duplicate rows.
[Link](): Rename columns.
[Link](): Replace specific values.
[Link](): Change data types of
columns.
kanikachouhan1515
kanikachouhan1515@[Link]
KANIKA CHOUHAN
kanikachouhan1515
kanikachouhan1515@[Link]
KANIKA CHOUHAN
4
4. Data Manipulation
df.sort_values(): Sort DataFrame by column
values.
[Link](): Group data by a column and
perform aggregate operations.
df.pivot_table(): Create a pivot table for more
advanced grouping and summarizing.
[Link](): Unpivot DataFrame from wide to long
format.
[Link](): Merge DataFrames based on
common columns or indexes.
[Link](): Join DataFrames on indexes.
kanikachouhan1515
kanikachouhan1515@[Link]
KANIKA CHOUHAN
columns='columns_column', aggfunc='sum')
kanikachouhan1515
kanikachouhan1515@[Link]
KANIKA CHOUHAN
5
5. Data Transformation
[Link](): Apply a function to each
element or row/column.
[Link](): Map values of a Series using
a dictionary or function.
[Link]: Access and manipulate string
values.
[Link](), [Link](),
[Link](), [Link](): Perform
cumulative operations.
kanikachouhan1515
kanikachouhan1515@[Link]
KANIKA CHOUHAN
kanikachouhan1515
kanikachouhan1515@[Link]
KANIKA CHOUHAN
6
6. Data Analysis
[Link](): Compute the correlation
matrix for numerical columns.
[Link](): Compute covariance matrix for
numerical columns.
[Link](): Perform multiple aggregate
operations on columns.
[Link](): Calculate quantiles for
numerical data.
[Link](): Randomly sample rows.
kanikachouhan1515
kanikachouhan1515@[Link]
KANIKA CHOUHAN
kanikachouhan1515
kanikachouhan1515@[Link]
KANIKA CHOUHAN
7. Data Visualization
[Link](): Basic plotting, which
integrates with matplotlib for charts like
line plots, histograms, etc.
[Link](): Quickly plot histograms of
numerical columns.
[Link](): Generate box plots for
numerical columns.
kanikachouhan1515
kanikachouhan1515@[Link]
KANIKA CHOUHAN
kanikachouhan1515
kanikachouhan1515@[Link]
KANIKA CHOUHAN
FOUND THIS USEFUL ???
LIKE
&
FOLLOW
for more
kanikachouhan1515
kanikachouhan1515@[Link]