0% found this document useful (0 votes)

16 views6 pages

? Pandas Study Guide

The Pandas Study Guide provides an overview of essential Pandas functionalities, including importing data, exploring DataFrames, selecting and filtering data, and performing aggregation and mathematical operations. It covers practical examples and tips for data cleaning, creating new columns, and managing missing values. Key methods such as .loc, .iloc, and groupby() are highlighted for effective data manipulation.

Uploaded by

tazshakas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views6 pages

? Pandas Study Guide

Uploaded by

tazshakas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

📘 Pandas Study Guide

🔧 1. Pandas Basics
a. Importing Pandas

import pandas as pd

Always start with this line to use pandas.

b. Reading CSV Files

df = pd.read_csv("url_or_path")

# Example with dictionary

data = {
'regiment': ['Nighthawks', 'Dragoons', 'Scouts'],
'deaths': [523, 234, 62],
'origin': ['Arizona', 'Iowa', 'Oregon']
}
df = pd.DataFrame(data)

# Example with tab-separated data

url = 'https://raw.githubusercontent.com/justmarkham/DAT8/master/data/chipotle.tsv
chipo = pd.read_csv(url, sep='\t')

Loads a CSV file into a DataFrame.

Use local paths: pd.read_csv("data.csv") .

🧾 2. DataFrame Exploration
a. Displaying Rows

df.head() # First 5 rows

df.tail() # Last 5 rows

b. Basic Info

1/6
📘 Pandas Study Guide

df.info() # Summary (columns, nulls, types)

df.describe() # Statistical summary for numeric columns

c. Shape and Columns

df.shape # Tuple of (rows, columns)

df.columns # List of column names

d. Sorting Data

df.sort_values(by='Goals', ascending=False) # Sort by Goals, descending

df.sort_values(by=['Group', 'Goals'], ascending=[True, False]) # Multi-column so

sort_values : Sorts the DataFrame by one or more columns.

ascending=True for ascending, False for descending.

🔍 3. Selecting and Filtering

a. Selecting Columns

df['Goals'] # Single column (returns Series)

df[['Team', 'Goals']] # Multiple columns (returns DataFrame)

b. Filtering Rows

df[df['Goals'] > 5] # Filter by condition

df[df['Team'] == 'Germany'] # Exact match
df[df['Team'].str.startswith('G')] # Filter rows where Team starts with 'G'

str.startswith : Filters string columns based on prefix (case-sensitive).

c. Conditional with isin

df[df['Team'].isin(['Germany', 'Spain'])]

d. Selecting with .loc (Label-based)

2/6
📘 Pandas Study Guide

df.loc[0] # Select row by index label

df.loc[0:2, ['Team', 'Goals']] # Select rows 0-2 and specific columns
df.loc[df['Goals'] > 5, 'Team'] # Select Team column where Goals > 5

.loc : Access rows and columns by labels or conditions.

e. Selecting with .iloc (Index-based)

df.iloc[0] # Select first row

df.iloc[0:3, 1:3] # Select rows 0-2 and columns 1-2

.iloc : Access rows and columns by integer positions.

📐 4. Aggregation and Grouping

a. Using len() to Count

num_teams = len(df)

b. Grouping

df.groupby('Group')['Goals'].mean() # Mean goals per group

c. Applying Functions with .apply

df['Goals_Doubled'] = df['Goals'].apply(lambda x: x * 2) # Double each Goals valu

df['Team_Category'] = df['Team'].apply(lambda x: 'Elite' if x in ['Germany', 'Spai

.apply : Applies a function to each element in a Series or DataFrame.

🧮 5. Math and Stats

df['Goals'].sum() # Total
df['Goals'].mean() # Average
df['Goals'].max() # Max value

3/6
📘 Pandas Study Guide

✂️6. Column Operations

a. Creating New Columns

df['Goals per Match'] = df['Goals'] / df['Matches Played']

b. Renaming Columns

df.rename(columns={'old': 'new'}, inplace=True)

c. Setting Index

df.set_index('Team', inplace=True) # Set Team column as index

df.reset_index(inplace=True) # Reset index to default

set_index : Sets a column as the DataFrame index.

reset_index : Reverts index to default integer index.

🧼 7. Data Cleaning
a. Checking for Missing Values

df.isnull().sum()

b. Dropping Columns or Rows

df.drop(columns=['Red Cards'], inplace=True)

df.dropna(inplace=True)

c. Removing Duplicates

df.drop_duplicates(subset=['Team'], keep='first', inplace=True)

drop_duplicates : Removes duplicate rows based on specified columns.

subset : Columns to check for duplicates.
keep='first' : Keeps first occurrence; use last or False for other behaviors.
4/6
📘 Pandas Study Guide

📚 Examples You Should Try

# 1. Get teams with more than 6 goals
df[df['Goals'] > 6]

# 2. Find number of teams in the dataset

len(df)

# 3. Get top 3 teams with most yellow cards

df.sort_values(by='Yellow Cards', ascending=False).head(3)

# 4. Teams that received no red cards

df[df['Red Cards'] == 0]

# 5. Create a new column: 'Goals per Match'

df['Goals per Match'] = df['Goals'] / df['Matches Played']

# 6. Filter teams starting with 'S'

df[df['Team'].str.startswith('S')]

# 7. Select first 2 rows and 'Team', 'Goals' columns using .loc

df.loc[0:1, ['Team', 'Goals']]

# 8. Select first 2 rows and first 2 columns using .iloc

df.iloc[0:2, 0:2]

# 9. Double goals using .apply

df['Goals_Doubled'] = df['Goals'].apply(lambda x: x * 2)

# 10. Remove duplicate teams

df.drop_duplicates(subset=['Team'], keep='first')

# 11. Set 'Team' as index and select rows where Goals > 5
df.set_index('Team').loc[df['Goals'] > 5]

✅ Tips to Remember
df[df['col'] > value] is your main tool for filtering.
Use .head() to preview data often.
Pandas treats missing values ( NaN ) differently — always check with .isnull().sum() .
Column operations ( df['new'] = ... ) let you engineer features quickly.
groupby() is powerful for grouped aggregation (like finding averages by category).

5/6
📘 Pandas Study Guide
Use .loc for label-based selection, .iloc for position-based selection.
sort_values helps organize data; combine with head() for top/bottom rows.
.apply is flexible for custom transformations.
Use drop_duplicates to ensure unique data entries.
set_index is useful for making a column the index for easier lookups.

6/6

Pandas Plots
No ratings yet
Pandas Plots
14 pages
Pandas
No ratings yet
Pandas
13 pages
Pandas Data Handling & Visualization Guide
100% (1)
Pandas Data Handling & Visualization Guide
37 pages
Mastering Pandas: A Comprehensive Guide
No ratings yet
Mastering Pandas: A Comprehensive Guide
13 pages
Pandas Basics Cheat Sheet Guide
No ratings yet
Pandas Basics Cheat Sheet Guide
1 page
Pandas Basics Cheat Sheet Guide
No ratings yet
Pandas Basics Cheat Sheet Guide
1 page
Pandas - Cheat - Sheet (1) - 240511 - 113437
No ratings yet
Pandas - Cheat - Sheet (1) - 240511 - 113437
1 page
Pandas Guide for Data Analysts
No ratings yet
Pandas Guide for Data Analysts
9 pages
Pandas Research
No ratings yet
Pandas Research
14 pages
Module - 3 New
No ratings yet
Module - 3 New
38 pages
Pandas Series and DataFrame Guide
No ratings yet
Pandas Series and DataFrame Guide
98 pages
PandasGUIA PYTHON-04
No ratings yet
PandasGUIA PYTHON-04
1 page
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
2 pages
Pandas Data Wrangling Cheat Sheet
100% (2)
Pandas Data Wrangling Cheat Sheet
6 pages
Practical File 2024
No ratings yet
Practical File 2024
25 pages
Pandas Notes
No ratings yet
Pandas Notes
20 pages
Pandas Cheat Sheet
100% (1)
Pandas Cheat Sheet
2 pages
Content Pandas Cheat Sheet
No ratings yet
Content Pandas Cheat Sheet
9 pages
4 PythonPandas
No ratings yet
4 PythonPandas
8 pages
Introduction to Pandas DataFrames
No ratings yet
Introduction to Pandas DataFrames
25 pages
Python For Data Science 1662157639
No ratings yet
Python For Data Science 1662157639
6 pages
Data Aggregation and Group Operations
No ratings yet
Data Aggregation and Group Operations
34 pages
Mastering Pandas: DataFrame Operations
100% (2)
Mastering Pandas: DataFrame Operations
33 pages
Python Data Analysis Cheat Sheet
100% (3)
Python Data Analysis Cheat Sheet
9 pages
Python Cheat Sheet For Excel Users
100% (2)
Python Cheat Sheet For Excel Users
5 pages
Essential Pandas Cheat Sheet Guide
No ratings yet
Essential Pandas Cheat Sheet Guide
5 pages
DAP 3 Module
No ratings yet
DAP 3 Module
62 pages
Pandas Cheat Sheet
85% (13)
Pandas Cheat Sheet
2 pages
Pandas Cheat Sheet CN
No ratings yet
Pandas Cheat Sheet CN
4 pages
Pandas Cheat Sheet
100% (4)
Pandas Cheat Sheet
2 pages
Using Groupby and Pivot
No ratings yet
Using Groupby and Pivot
7 pages
Cheat Python
No ratings yet
Cheat Python
8 pages
Pandas Cheat Sheet for Data Science
No ratings yet
Pandas Cheat Sheet for Data Science
5 pages
Pandas Python For Data Science
100% (1)
Pandas Python For Data Science
1 page
Pandas Cheat Sheet for Data Science
No ratings yet
Pandas Cheat Sheet for Data Science
1 page
Python 2.1.3
No ratings yet
Python 2.1.3
6 pages
Pandaspythonfordatascience
No ratings yet
Pandaspythonfordatascience
1 page
Pandas
No ratings yet
Pandas
25 pages
Pandas Cheat Sheet for Data Science
No ratings yet
Pandas Cheat Sheet for Data Science
1 page
Pandas
No ratings yet
Pandas
25 pages
Exercise 3
No ratings yet
Exercise 3
12 pages
Pandas Dataframe Cheat Sheet
No ratings yet
Pandas Dataframe Cheat Sheet
3 pages
Pandas Notes
No ratings yet
Pandas Notes
8 pages
Pandas
No ratings yet
Pandas
13 pages
Class 12 Panda Project
No ratings yet
Class 12 Panda Project
13 pages
Pandas DataFrame Manipulation Guide
No ratings yet
Pandas DataFrame Manipulation Guide
30 pages
Data Handling Part Ii
No ratings yet
Data Handling Part Ii
41 pages
Pandas Module Overview and Usage Guide
No ratings yet
Pandas Module Overview and Usage Guide
15 pages
Managing Diversity Toward A Globally Inclusive Workplace 4th Edition Barak Test Bank Download
100% (27)
Managing Diversity Toward A Globally Inclusive Workplace 4th Edition Barak Test Bank Download
14 pages
Syllabus For The Post of Lecturer Mathematics in School Education Department 13 - 05 - 2025
No ratings yet
Syllabus For The Post of Lecturer Mathematics in School Education Department 13 - 05 - 2025
3 pages
MCQs On Vedic Age
No ratings yet
MCQs On Vedic Age
11 pages
Chemistry For The Ib Diploma Programme (Higher Level) 3Rd Edition Brown - Ebook PDF Install Download
No ratings yet
Chemistry For The Ib Diploma Programme (Higher Level) 3Rd Edition Brown - Ebook PDF Install Download
81 pages
Engineering Geology Course Plan 2018
No ratings yet
Engineering Geology Course Plan 2018
5 pages
Fuentes BorgesActionNarrative 1986
No ratings yet
Fuentes BorgesActionNarrative 1986
11 pages
Defendant Memo: Virtual Moot Court 2020
No ratings yet
Defendant Memo: Virtual Moot Court 2020
29 pages
Hyperconjugation: The Reactions of Alkenes - The Stereochemistry of Addition Reactions
No ratings yet
Hyperconjugation: The Reactions of Alkenes - The Stereochemistry of Addition Reactions
2 pages
A New Low-Cost Iot Based Monitoring System Design For Stand-Alone Solar Photovoltaic Plant and Power Estimation
No ratings yet
A New Low-Cost Iot Based Monitoring System Design For Stand-Alone Solar Photovoltaic Plant and Power Estimation
13 pages
The Foreign Policy of Park Chung Hee (1968 - 1979)
No ratings yet
The Foreign Policy of Park Chung Hee (1968 - 1979)
217 pages
Population Dynamics Assignment 1
No ratings yet
Population Dynamics Assignment 1
6 pages
PM Debug Info
No ratings yet
PM Debug Info
20 pages
A Doll's House
No ratings yet
A Doll's House
45 pages
A.K. Ramanujan's Poetic Exploration of Rivers
100% (1)
A.K. Ramanujan's Poetic Exploration of Rivers
4 pages
John and Philosophy: A New Reading of The Fourth Gospel 1st Edition Engberg-Pedersen Kindle & PDF Formats
No ratings yet
John and Philosophy: A New Reading of The Fourth Gospel 1st Edition Engberg-Pedersen Kindle & PDF Formats
94 pages
20 000002
No ratings yet
20 000002
44 pages
The Princess Bride
No ratings yet
The Princess Bride
125 pages
Kamna Pay As Hi Resume
No ratings yet
Kamna Pay As Hi Resume
1 page
Industrial Refrigeration Solutions
No ratings yet
Industrial Refrigeration Solutions
6 pages
Acne Studios Sustainability Report 20-21
No ratings yet
Acne Studios Sustainability Report 20-21
25 pages
Just A Little Thing V2
No ratings yet
Just A Little Thing V2
15 pages
CE Physics Solution
100% (5)
CE Physics Solution
24 pages
Daily Lesson Plan P-Math-10-Q1-Week-2-Day - 2
No ratings yet
Daily Lesson Plan P-Math-10-Q1-Week-2-Day - 2
5 pages
Wyckoff Theory Cheat Sheet
No ratings yet
Wyckoff Theory Cheat Sheet
4 pages
Bcss Presentation
No ratings yet
Bcss Presentation
12 pages
Short-Term Wave Height Distribution Analysis
No ratings yet
Short-Term Wave Height Distribution Analysis
1 page
Flow Nets
No ratings yet
Flow Nets
2 pages
03 Evaluating External Environment
No ratings yet
03 Evaluating External Environment
19 pages
Victoria Kaspi
No ratings yet
Victoria Kaspi
4 pages
SMP English Exam Questions
No ratings yet
SMP English Exam Questions
9 pages

? Pandas Study Guide

Uploaded by

? Pandas Study Guide

Uploaded by

📘 Pandas Study Guide