0% found this document useful (0 votes)

144 views10 pages

Pandas Cheatsheet

The document is a cheat sheet for using the Pandas library in Python, detailing various functions for importing, exporting, inspecting, selecting, cleaning, sorting, filtering, grouping, merging, and visualizing data. It provides code snippets for common operations such as reading CSV files, filtering data, and creating plots. This resource serves as a quick reference for users looking to perform data manipulation and analysis with Pandas.

Uploaded by

konihe5892

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

144 views10 pages

Pandas Cheatsheet

Uploaded by

konihe5892

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

coding_knowladge

Harry

Pandas
CheatSheet
coding_knowladge
Harry

Import Export Data

pd.read_csv(filename): Read data from a CSV file.
pd.read_table(filename): Read data from a
delimited text file.
pd.read_excel(filename): Read data from an Excel
file.
pd.read_sql(query, connection_object): Read
data from a SQL table/database.
pd.read_json(json_string): Read data from a
JSON formatted string, URL, or file.
pd.read_html(url): Parse an HTML URL, string, or file
to extract tables to a list of DataFrames.
pd.DataFrame(dict): Create a DataFrame from a
dictionary (keys as column names, values as lists).
df.to_csv(filename): Write to a CSV file.
df.to_excel(filename): Write to an Excel file.
df.to_sql(table_nm, connection_object): Write to
a SQL table.
df.to_json(filename): Write to a file in JSON format.
coding_knowladge
Harry

Inspect Data
df.head(): View the first 5 rows of the DataFrame.

df.tail(): View the last 5 rows of the DataFrame.

df.sample(): View the random 5 rows of the

DataFrame.

df.shape: Get the dimensions of the DataFrame.

df.info(): Get a concise summary of the

DataFrame.

df.describe(): Summary statistics for numerical

columns.

df.dtypes: Check data types of columns.

df.columns: List column names.

df.index: Display the index range.

comment “pands” and get it’s

complete pdf in your DM 📌
coding_knowladge
Harry

Select Index Data

df['column']: Select a single column.

df[['col1', 'col2']]: Select multiple columns.

df.iloc[0]: Select the first row by position.

df.loc[0]: Select the first row by index label.

df.iloc[0, 0]: Select a specific element by position.

df.loc[0, 'column']: Select a specific element by

label.

df[df['col'] > 5]: Filter rows where column > 5.

df.iloc[0:5, 0:2]: Slice rows and columns.

df.set_index('column'): Set a column as the index.

coding_knowladge
Harry

Cleaning Data

df.isnull(): Check for null values.

df.notnull(): Check for non-null values.

df.dropna(): Drop rows with null values.

df.fillna(value): Replace null values with a specific

value.

df.replace(1, 'one'): Replace specific values.

df.rename(columns={'old': 'new'}): Rename

columns.

df.astype('int'): Change data type of a column.

df.drop_duplicates(): Remove duplicate rows.

df.reset_index(): Reset the index.

coding_knowladge
Harry

Sort Filter Data

df.sort_values('col'): Sort by column in ascending
order.

df.sort_values('col', ascending=False): Sort by

column in descending order.

df.sort_values(['col1', 'col2'], ascending=[True,

False]): Sort by multiple columns.

df[df['col'] > 5]: Filter rows based on condition.

df.query('col > 5'): Filter using a query string.

df.sample(5): Randomly select 5 rows.

df.nlargest(3, 'col'): Get top 3 rows by column.

df.nsmallest(3, 'col'): Get bottom 3 rows by column.

df.filter(like='part'): Filter columns by substring.

coding_knowladge
Harry

Group Data
df.groupby('col'): Group by a column.

df.groupby('col').mean(): Mean of groups.

df.groupby('col').sum(): Sum of groups.

df.groupby('col').count(): Count non-null values in

groups.

df.groupby('col') ['other_col'].max(): Max value in

another column for groups.

df.pivot_table(values='col', index='group',
aggfunc='mean'): Create a pivot table.

df.agg({'col1': 'mean', 'col2': 'sum'}): Aggregate

multiple columns.

df.apply(np.mean): Apply a function to columns.

df.transform(lambda x: x + 10): Transform data

column-wise.
coding_knowladge
Harry

Merge Join Data

pd.concat([df1, df2]): Concatenate DataFrames
vertically.

pd.concat([df1, df2], axis=1): Concatenate

DataFrames horizontally.

df1.merge(df2, on='key'): Merge two DataFrames

on a key.

df1.join(df2): SQL-style join.

df1.append(df2): Append rows of one DataFrame

to another.

pd.merge(df1, df2, how='outer', on='key'): Outer

join.

pd.merge(df1, df2, how='inner', on='key'): Inner

join.

pd.merge(df1, df2, how='left', on='key'): Left join.

pd.merge(df1, df2, how='right', on='key'): Right

join.
coding_knowladge
Harry

Statistical Operations

df.mean(): Column-wise mean.

df.median(): Column-wise median.

df.std(): Column-wise standard deviation.

df.var(): Column-wise variance.

df.sum(): Column-wise sum.

df.min(): Column-wise minimum.

df.max(): Column-wise maximum.

df.count(): Count of non-null values per column.

df.corr(): Correlation matrix.

comment “panda” and get it’s

complete pdf in your DM 📌
coding_knowladge
Harry

Data Visualization

df.plot(kind='line'): Line plot.

df.plot(kind='bar'): Vertical bar plot.

df.plot(kind='barh'): Horizontal bar plot.

df.plot(kind='hist'): Histogram.

df.plot(kind='box'): Box plot.

df.plot(kind='kde'): Kernel density estimation

plot.

df.plot(kind='pie', y='col'): Pie chart.

df.plot.scatter(x='c1', y='c2'): Scatter plot.

df.plot(kind='area'): Area plot.

comment “panda” and get it’s

complete pdf in your DM 📌

Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
11 pages
Imp Pandas Cheatsheet
No ratings yet
Imp Pandas Cheatsheet
11 pages
Essential Pandas DataFrame Operations
No ratings yet
Essential Pandas DataFrame Operations
20 pages
Pyspark Cheatsheet
No ratings yet
Pyspark Cheatsheet
10 pages
Pandas Cheat Sheet for Data Science
No ratings yet
Pandas Cheat Sheet for Data Science
5 pages
Pandas Cheat Sheet PDF
67% (3)
Pandas Cheat Sheet PDF
1 page
Python Cheat Sheet Code Academy
100% (1)
Python Cheat Sheet Code Academy
1 page
Data Science Cheat Sheet: KEY Imports
100% (1)
Data Science Cheat Sheet: KEY Imports
1 page
PANDAS Cheatsheet
No ratings yet
PANDAS Cheatsheet
4 pages
Python Data Science Cheat Sheet
0% (1)
Python Data Science Cheat Sheet
3 pages
Introduction to Pandas DataFrames
No ratings yet
Introduction to Pandas DataFrames
25 pages
Learn Data Analysis With Pandas - Introduction
No ratings yet
Learn Data Analysis With Pandas - Introduction
2 pages
Pandas Commands
No ratings yet
Pandas Commands
3 pages
Pandas Guide
No ratings yet
Pandas Guide
50 pages
Pandas DataFrame Notes
100% (1)
Pandas DataFrame Notes
10 pages
Python Data Science Cheat Sheet
100% (2)
Python Data Science Cheat Sheet
6 pages
Pandas Cheat Sheet
100% (1)
Pandas Cheat Sheet
2 pages
Introduction To Pandas
No ratings yet
Introduction To Pandas
27 pages
Data Aggregation and Group Operations
No ratings yet
Data Aggregation and Group Operations
34 pages
Pandas
No ratings yet
Pandas
13 pages
Essential Pandas Operations Guide
No ratings yet
Essential Pandas Operations Guide
8 pages
Chapter Notes - Data Handling Using Pandas DataFrame
No ratings yet
Chapter Notes - Data Handling Using Pandas DataFrame
16 pages
7.2 - Data Frame Basics - mp4
No ratings yet
7.2 - Data Frame Basics - mp4
3 pages
Pandas DataFrames & Jupyter Guide
No ratings yet
Pandas DataFrames & Jupyter Guide
10 pages
Pandas and Python
No ratings yet
Pandas and Python
24 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
13 pages
Pandas DataFrame Cheat Sheet
No ratings yet
Pandas DataFrame Cheat Sheet
4 pages
Pandas DataFrame Cheat Sheet
100% (1)
Pandas DataFrame Cheat Sheet
10 pages
05 Pandas Data Frames
No ratings yet
05 Pandas Data Frames
33 pages
Pandas Cheat Sheet Free Resources At: Dataquest - Io/guide
No ratings yet
Pandas Cheat Sheet Free Resources At: Dataquest - Io/guide
7 pages
Pandas Data Wrangling Cheat Sheet
100% (2)
Pandas Data Wrangling Cheat Sheet
6 pages
Ainotes
No ratings yet
Ainotes
5 pages
Pandas Essentials for Data Scientists
No ratings yet
Pandas Essentials for Data Scientists
22 pages
Python Data Science: Pandas & ML Basics
100% (1)
Python Data Science: Pandas & ML Basics
41 pages
Introduction to Pandas for Data Analysis
No ratings yet
Introduction to Pandas for Data Analysis
12 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
10 pages
File Ip
No ratings yet
File Ip
22 pages
Pandas DataFrame Cheat Sheet Guide
No ratings yet
Pandas DataFrame Cheat Sheet Guide
12 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
1 page
Overview of Pandas DataFrames
No ratings yet
Overview of Pandas DataFrames
21 pages
Pandas
No ratings yet
Pandas
5 pages
Python & Pandas Cheat Sheet Guide
No ratings yet
Python & Pandas Cheat Sheet Guide
11 pages
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
100% (1)
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
12 pages
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
100% (4)
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
11 pages
Lecture Week2
No ratings yet
Lecture Week2
72 pages
Unit - 4 - Part 2
No ratings yet
Unit - 4 - Part 2
36 pages
Pandas
No ratings yet
Pandas
27 pages
Python Cheat Sheet For Excel Users
100% (2)
Python Cheat Sheet For Excel Users
5 pages
Pandas Cheatsheet
No ratings yet
Pandas Cheatsheet
1 page
Python Pandas Cheat Sheet Guide
No ratings yet
Python Pandas Cheat Sheet Guide
11 pages
Pandas
No ratings yet
Pandas
13 pages
AccuMark CAD Database Updates Guide
No ratings yet
AccuMark CAD Database Updates Guide
23 pages
PHP Nikit
No ratings yet
PHP Nikit
7 pages
COBOL File Status Codes Guide
No ratings yet
COBOL File Status Codes Guide
3 pages
Database Management System Lesson Plan
No ratings yet
Database Management System Lesson Plan
3 pages
Data Warehousing for IT Students
No ratings yet
Data Warehousing for IT Students
64 pages
List of Applicants 20210701 0930
No ratings yet
List of Applicants 20210701 0930
107 pages
FoDB - Lab 3
No ratings yet
FoDB - Lab 3
15 pages
(Project Name) : Inception: High-Level Technical Design Template
No ratings yet
(Project Name) : Inception: High-Level Technical Design Template
4 pages
BAED-PROG3114 Written Work 1 20OVER20
0% (1)
BAED-PROG3114 Written Work 1 20OVER20
13 pages
Database Concepts
No ratings yet
Database Concepts
25 pages
Introduction to Oracle Hyperion Essbase
No ratings yet
Introduction to Oracle Hyperion Essbase
36 pages
NUMANTRA SAP HANA Training - Module - I
No ratings yet
NUMANTRA SAP HANA Training - Module - I
71 pages
HDB 6.5.1 JDBC Eng
No ratings yet
HDB 6.5.1 JDBC Eng
92 pages
Schedules
No ratings yet
Schedules
10 pages
Lakshya Excel 70 Formulas
No ratings yet
Lakshya Excel 70 Formulas
5 pages
Data Profiling with IBM Quality Stage
No ratings yet
Data Profiling with IBM Quality Stage
2 pages
Project Write-Up in Online Student Management System
No ratings yet
Project Write-Up in Online Student Management System
7 pages
MDN 1605DG
No ratings yet
MDN 1605DG
88 pages
Oracle Database Data Files Instances Creating A Database
No ratings yet
Oracle Database Data Files Instances Creating A Database
39 pages
Data Analytics-Introduction: manish@IIITA
No ratings yet
Data Analytics-Introduction: manish@IIITA
26 pages
Grade 13 Information and Communication Technology 1st Term Test Paper 2020 North Western Province
No ratings yet
Grade 13 Information and Communication Technology 1st Term Test Paper 2020 North Western Province
28 pages
How To Use Object Relational Mapping in Node - Js - Optimize Database Interactions With Sequelize ORM
No ratings yet
How To Use Object Relational Mapping in Node - Js - Optimize Database Interactions With Sequelize ORM
12 pages
Lec-1.2 Database Management System Unit-1 BCS-501 DBMS Aktu 3rd Year Aktu Exams - Multi Atoms Plus (720p, h264, Youtube)
No ratings yet
Lec-1.2 Database Management System Unit-1 BCS-501 DBMS Aktu 3rd Year Aktu Exams - Multi Atoms Plus (720p, h264, Youtube)
3 pages
EER Modelling Concepts and Errors
100% (1)
EER Modelling Concepts and Errors
5 pages
IPP KCC For Print
No ratings yet
IPP KCC For Print
3 pages
Chapter-4 Database Recovery
No ratings yet
Chapter-4 Database Recovery
32 pages
Factsheet Jedox All in One Eng
No ratings yet
Factsheet Jedox All in One Eng
9 pages
90 (Informatics Practices)
No ratings yet
90 (Informatics Practices)
12 pages
Data Engineering & Analysis Training
No ratings yet
Data Engineering & Analysis Training
4 pages
Assignment 3
No ratings yet
Assignment 3
16 pages

Pandas Cheatsheet

Uploaded by

Pandas Cheatsheet

Uploaded by

coding_knowladge

Import Export Data

df.tail(): View the last 5 rows of the DataFrame.

df.sample(): View the random 5 rows of the

df.shape: Get the dimensions of the DataFrame.

df.info(): Get a concise summary of the

df.describe(): Summary statistics for numerical

df.dtypes: Check data types of columns.

df.columns: List column names.

df.index: Display the index range.

comment “pands” and get it’s

Select Index Data

df['column']: Select a single column.

df[['col1', 'col2']]: Select multiple columns.

df.iloc[0]: Select the first row by position.

df.loc[0]: Select the first row by index label.

df.iloc[0, 0]: Select a specific element by position.

df.loc[0, 'column']: Select a specific element by

df[df['col'] > 5]: Filter rows where column > 5.

df.iloc[0:5, 0:2]: Slice rows and columns.

df.set_index('column'): Set a column as the index.

df.isnull(): Check for null values.

df.notnull(): Check for non-null values.

df.dropna(): Drop rows with null values.

df.fillna(value): Replace null values with a specific

df.replace(1, 'one'): Replace specific values.

df.rename(columns={'old': 'new'}): Rename

df.astype('int'): Change data type of a column.

df.drop_duplicates(): Remove duplicate rows.

df.reset_index(): Reset the index.

Sort Filter Data

df.sort_values('col', ascending=False): Sort by

df.sort_values(['col1', 'col2'], ascending=[True,

df[df['col'] > 5]: Filter rows based on condition.

df.query('col > 5'): Filter using a query string.

df.sample(5): Randomly select 5 rows.

df.nlargest(3, 'col'): Get top 3 rows by column.

df.nsmallest(3, 'col'): Get bottom 3 rows by column.

df.filter(like='part'): Filter columns by substring.

df.groupby('col').mean(): Mean of groups.

df.groupby('col').sum(): Sum of groups.

df.groupby('col').count(): Count non-null values in

df.groupby('col') ['other_col'].max(): Max value in

df.agg({'col1': 'mean', 'col2': 'sum'}): Aggregate

df.apply(np.mean): Apply a function to columns.

df.transform(lambda x: x + 10): Transform data

Merge Join Data

pd.concat([df1, df2], axis=1): Concatenate

df1.merge(df2, on='key'): Merge two DataFrames

df1.join(df2): SQL-style join.

df1.append(df2): Append rows of one DataFrame

pd.merge(df1, df2, how='outer', on='key'): Outer

pd.merge(df1, df2, how='inner', on='key'): Inner

pd.merge(df1, df2, how='left', on='key'): Left join.

pd.merge(df1, df2, how='right', on='key'): Right

df.mean(): Column-wise mean.

df.median(): Column-wise median.

df.std(): Column-wise standard deviation.

df.var(): Column-wise variance.

df.sum(): Column-wise sum.

df.min(): Column-wise minimum.

df.max(): Column-wise maximum.

df.count(): Count of non-null values per column.

df.corr(): Correlation matrix.

comment “panda” and get it’s

df.plot(kind='line'): Line plot.

df.plot(kind='bar'): Vertical bar plot.

df.plot(kind='barh'): Horizontal bar plot.

df.plot(kind='box'): Box plot.

df.plot(kind='kde'): Kernel density estimation

df.plot(kind='pie', y='col'): Pie chart.

df.plot.scatter(x='c1', y='c2'): Scatter plot.

df.plot(kind='area'): Area plot.

comment “panda” and get it’s

You might also like