0% found this document useful (0 votes)

18 views6 pages

Week 2

Uploaded by

srujjanbelamgi12

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views6 pages

Week 2

Uploaded by

srujjanbelamgi12

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

import pandas as pd

# DataFrame 1
data1 = {'Name': ['Pankaj', 'Meghna', 'Lisa'],
'Country': ['India', 'India', 'USA'],
'Role': ['CEO', 'CTO', 'CTO']}
df1 = [Link](data1)
# DataFrame 2
data2 = {'ID': [1, 2, 3],
'Name': ['Pankaj', 'Anupam', 'Amit']}
df2 = [Link](data2)
print("DataFrame 1:")
print(df1)
print("\nDataFrame 2:")
print(df2)

DataFrame 1:
Name Country Role
0 Pankaj India CEO
1 Meghna India CTO
2 Lisa USA CTO

DataFrame 2:
ID Name
0 1 Pankaj
1 2 Anupam
2 3 Amit

result_row = [Link](df1, df2, on='Name')

print(result_row)

Name Country Role ID

0 Pankaj India CEO 1

# Left Join
result_left = [Link](df1, df2, on='Name', how='left')
print("\nResult Left Join:")
print(result_left)
# Right Join
result_right = [Link](df1, df2, on='Name', how='right')
print("\nResult Right Join:")
print(result_right)
# Outer Join

result_outer = [Link](df1, df2, on='Name', how='outer')

print("\nResult Outer Join:")
print(result_outer)

Result Left Join:

Name Country Role ID
0 Pankaj India CEO 1.0
1 Meghna India CTO NaN
2 Lisa USA CTO NaN

Result Right Join:

Name Country Role ID
0 Pankaj India CEO 1
1 Anupam NaN NaN 2
2 Amit NaN NaN 3

Result Outer Join:

Name Country Role ID
0 Amit NaN NaN 3.0
1 Anupam NaN NaN 2.0
2 Lisa USA CTO NaN
3 Meghna India CTO NaN
4 Pankaj India CEO 1.0

result_outer = [Link](df1, df2, on='Name', how='outer')

print("\nResult Outer Join:")
print(result_outer)

Result Left Join:

Name Country Role ID
0 Pankaj India CEO 1.0
1 Meghna India CTO NaN
2 Lisa USA CTO NaN

Result Right Join:

Name Country Role ID
0 Pankaj India CEO 1
1 Anupam NaN NaN 2
2 Amit NaN NaN 3

Result Outer Join:

Name Country Role ID
0 Amit NaN NaN 3.0
1 Anupam NaN NaN 2.0
2 Lisa USA CTO NaN
3 Meghna India CTO NaN
4 Pankaj India CEO 1.0

# Sales Dictionary and Region Dictionary

sales_dict = {'ID': [1, 2, 3, 4],
'Amount': [100, 200, 300, 400]}
region_dict = {'ID': [1, 2, 3, 5],
'Region': ['East', 'West', 'North', 'South']}
# Create DataFrames
sales_df = [Link].from_dict(sales_dict)
region_df = [Link].from_dict(region_dict)
print("Sales DataFrame:")
print(sales_df)
print("\nRegion DataFrame:")
print(region_df)

Sales DataFrame:
ID Amount
0 1 100
1 2 200
2 3 300
3 4 400

Region DataFrame:
ID Region
0 1 East
1 2 West
2 3 North
3 5 South

# b) Merging with Inner Join

result_inner = [Link](sales_df, region_df, on='ID', how='inner')
print("\nInner Join:")
print(result_inner)
# c) Merging with Left Join
result_left = [Link](sales_df, region_df, on='ID', how='left')
print("\nLeft Join:")
print(result_left)
# d) Merging with Right Join
result_right = [Link](sales_df, region_df, on='ID', how='right')
print("\nRight Join:")
print(result_right)
# e) Merging with Outer Join
result_outer = [Link](sales_df, region_df, on='ID', how='outer')
print("\nOuter Join:")
print(result_outer)
Inner Join:
ID Amount Region
0 1 100 East
1 2 200 West
2 3 300 North

Left Join:
ID Amount Region
0 1 100 East
1 2 200 West
2 3 300 North
3 4 400 NaN

Right Join:
ID Amount Region
0 1 100.0 East
1 2 200.0 West
2 3 300.0 North
3 5 NaN South

Outer Join:
ID Amount Region
0 1 100.0 East
1 2 200.0 West
2 3 300.0 North
3 4 400.0 NaN
4 5 NaN South

import numpy as np
import pandas as pd
# Data with Missing Values
data = {'A': [1, [Link], 3, 4],
'B': [5, 6, [Link], 8],
'C': [[Link], [Link], 9, 10]}
df = [Link](data)
print("Original DataFrame:")
print(df)
# 1. Drop rows with any missing value
print("\nDrop rows with any missing values:")
print([Link]())
# 2. Drop columns with at least one missing value
print("\nDrop columns with at least one missing value:")
print([Link](axis=1))
# 3. Drop rows/columns with all missing values
print("\nDrop rows/columns with all missing values:")
print([Link](how='all'))
# 4. Drop rows/columns based on threshold (at least 2 non-NaN values)
print("\nDrop rows/columns based on threshold:")
print([Link](thresh=2))
# 5. Replace NaN with the previous value (Forward Fill)
print("\nReplace NaN with the previous value:")
print([Link]()) # Using ffill() instead of fillna(method='pad')
# 6. Replace NaN with the previous value, limit=1 (Forward Fill with Limit)
print("\nReplace NaN with the previous value, limit=1:")
print([Link](limit=1)) # Using ffill() with limit
# 7. Replace NaN with the next value (Backward Fill)
print("\nReplace NaN with the forward value:")
print([Link]()) # Using bfill() instead of fillna(method='bfill')

Original DataFrame:
A B C
0 1.0 5.0 NaN
1 NaN 6.0 NaN
2 3.0 NaN 9.0
3 4.0 8.0 10.0

Drop rows with any missing values:

A B C
3 4.0 8.0 10.0

Drop columns with at least one missing value:

Empty DataFrame
Columns: []
Index: [0, 1, 2, 3]
Drop rows/columns with all missing values:
A B C
0 1.0 5.0 NaN
1 NaN 6.0 NaN
2 3.0 NaN 9.0
3 4.0 8.0 10.0

Drop rows/columns based on threshold:

A B C
0 1.0 5.0 NaN
2 3.0 NaN 9.0
3 4.0 8.0 10.0

Replace NaN with the previous value:

A B C
0 1.0 5.0 NaN
1 1.0 6.0 NaN
2 3.0 6.0 9.0
3 4.0 8.0 10.0

Replace NaN with the previous value, limit=1:

A B C
0 1.0 5.0 NaN
1 1.0 6.0 NaN
2 3.0 6.0 9.0
3 4.0 8.0 10.0

Replace NaN with the forward value:

A B C
0 1.0 5.0 9.0
1 3.0 6.0 9.0
2 3.0 8.0 9.0
3 4.0 8.0 10.0

import pandas as pd

fruit = { 'orange' : [3,2,0,1], 'apple' : [0,3,7,2], 'grapes' : [7,14,6,15] }

df1 = [Link](fruit)
df1

orange apple grapes

0 3 0 7

1 2 3 14

2 0 7 6

3 1 2 15

Next steps: Generate code with df1

toggle_off View recommended plots New interactive sheet

fruit = { 'grapes' : [13,12,10,2,55,98], 'mango' : [10,13,17,2,9,76], 'banana' : [20,23,27,4,[Link],[Link]]} # Added [Link]

df2 = [Link](fruit)
df2

grapes mango banana

0 13 10 20.0

1 12 13 23.0

2 10 17 27.0

3 2 2 4.0

4 55 9 NaN

5 98 76 NaN

Next steps: Generate code with df2

toggle_off View recommended plots New interactive sheet

df2 = [Link]([Link][2])
df2
grapes mango banana

0 13 10 20.0

1 12 13 23.0

3 2 2 4.0

4 55 9 NaN

5 98 76 NaN

Next steps: Generate code with df2

toggle_off View recommended plots New interactive sheet

[Link]((df1, df2), axis = 0)

orange apple grapes mango banana

0 3.0 0.0 7 NaN NaN

1 2.0 3.0 14 NaN NaN

2 0.0 7.0 6 NaN NaN

3 1.0 2.0 15 NaN NaN

0 NaN NaN 13 10.0 20.0

1 NaN NaN 12 13.0 23.0

3 NaN NaN 2 2.0 4.0

4 NaN NaN 55 9.0 NaN

5 NaN NaN 98 76.0 NaN

df1

orange apple grapes

0 3 0 7

1 2 3 14

2 0 7 6

3 1 2 15

Next steps: Generate code with df1

toggle_off View recommended plots New interactive sheet

[Link]([df1, df2], ignore_index=True)

orange apple grapes mango banana

0 3.0 0.0 7 NaN NaN

1 2.0 3.0 14 NaN NaN

2 0.0 7.0 6 NaN NaN

3 1.0 2.0 15 NaN NaN

4 NaN NaN 13 10.0 20.0

5 NaN NaN 12 13.0 23.0

6 NaN NaN 2 2.0 4.0

7 NaN NaN 55 9.0 NaN

8 NaN NaN 98 76.0 NaN

%%time
df = [Link](columns=['A'])
for i in range(30):
# Instead of append, use concat to add rows
df = [Link]([df, [Link]([{'A': i*2}])], ignore_index=True)

CPU times: user 17.4 ms, sys: 0 ns, total: 17.4 ms

Wall time: 16.7 ms

%%time
df = [Link]([[Link]([i*2], columns=['A']) for i in range(30)], ignore_index=True)

CPU times: user 11.4 ms, sys: 1.04 ms, total: 12.5 ms
Wall time: 39.6 ms

Start coding or generate with AI.

Exp 3
No ratings yet
Exp 3
10 pages
Top Machine Learning Artificial Intelligence AI Data Science Cheat Sheets ForML & Deep Learning Engineers
No ratings yet
Top Machine Learning Artificial Intelligence AI Data Science Cheat Sheets ForML & Deep Learning Engineers
14 pages
Pandas Data Wrangling Cheatsheet Datacamp PDF
No ratings yet
Pandas Data Wrangling Cheatsheet Datacamp PDF
1 page
DSP Unit-5 Updated
No ratings yet
DSP Unit-5 Updated
23 pages
Edp 3
No ratings yet
Edp 3
16 pages
Unit3 - 3) Pandas - Ipynb - Colab
No ratings yet
Unit3 - 3) Pandas - Ipynb - Colab
11 pages
Pandas For Python Pro Level Cheat Sheet
No ratings yet
Pandas For Python Pro Level Cheat Sheet
14 pages
Essential Pandas Cheat Sheet Guide
No ratings yet
Essential Pandas Cheat Sheet Guide
5 pages
Pandas Part-2
No ratings yet
Pandas Part-2
9 pages
Chapter 2 Python Pandas - II
No ratings yet
Chapter 2 Python Pandas - II
19 pages
Data Integration and Missing Values Analysis
No ratings yet
Data Integration and Missing Values Analysis
23 pages
Handling Duplicates in DataFrames
No ratings yet
Handling Duplicates in DataFrames
7 pages
Pandas
No ratings yet
Pandas
44 pages
Pandas Cheat Sheet for Data Manipulation
No ratings yet
Pandas Cheat Sheet for Data Manipulation
1 page
Module - d2
No ratings yet
Module - d2
41 pages
Content Pandas Cheat Sheet
No ratings yet
Content Pandas Cheat Sheet
9 pages
Pandas Cheat Sheet for Data Science
No ratings yet
Pandas Cheat Sheet for Data Science
1 page
Pandas Cheat Sheet
100% (1)
Pandas Cheat Sheet
2 pages
Pandas Cheat Sheet for Data Science
No ratings yet
Pandas Cheat Sheet for Data Science
1 page
Pandas Python For Data Science
100% (1)
Pandas Python For Data Science
1 page
Unit 4 1
No ratings yet
Unit 4 1
3 pages
Introduction to Pandas DataFrames
100% (1)
Introduction to Pandas DataFrames
21 pages
Pandaspythonfordatascience
No ratings yet
Pandaspythonfordatascience
1 page
Revision Notes DataFrame XII IP
No ratings yet
Revision Notes DataFrame XII IP
8 pages
Pandas Dataframe Cheat Sheet
No ratings yet
Pandas Dataframe Cheat Sheet
3 pages
Pandas Data Wrangling Cheat Sheet
100% (2)
Pandas Data Wrangling Cheat Sheet
6 pages
Learn Pandas
No ratings yet
Learn Pandas
37 pages
Pandas Moderate
No ratings yet
Pandas Moderate
15 pages
07 Data Wrangling
No ratings yet
07 Data Wrangling
51 pages
Data Wrangling with Pandas
No ratings yet
Data Wrangling with Pandas
16 pages
4th Unit Answer Bank
No ratings yet
4th Unit Answer Bank
40 pages
Unit 4 DSE
No ratings yet
Unit 4 DSE
9 pages
9.9.24 Revision
No ratings yet
9.9.24 Revision
9 pages
Pandas DataFrame Cheat Sheet
No ratings yet
Pandas DataFrame Cheat Sheet
6 pages
Different Methods of Plotting
No ratings yet
Different Methods of Plotting
4 pages
Python Data Science Cheat Sheet
97% (33)
Python Data Science Cheat Sheet
11 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
2 pages
Pandas
No ratings yet
Pandas
26 pages
Unit 3 Python B.SC IT
No ratings yet
Unit 3 Python B.SC IT
18 pages
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
No ratings yet
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
1 page
Ch-2 - Panda - Part-1 - 2nd - Day
No ratings yet
Ch-2 - Panda - Part-1 - 2nd - Day
4 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
2 pages
Exp 6
No ratings yet
Exp 6
9 pages
Python For DS Unit4
No ratings yet
Python For DS Unit4
11 pages
Python Interviews
No ratings yet
Python Interviews
154 pages
What Can You Do With Dataframes Using Pandas?: Pandas Is A High-Level Data Manipulation Tool Developed by Wes Mckinney
No ratings yet
What Can You Do With Dataframes Using Pandas?: Pandas Is A High-Level Data Manipulation Tool Developed by Wes Mckinney
10 pages
Pandas Cheat Sheet
85% (13)
Pandas Cheat Sheet
2 pages
Numpy Programs
No ratings yet
Numpy Programs
3 pages
Num Py Lab Part-1-2
No ratings yet
Num Py Lab Part-1-2
2 pages
Network Address Translation NAT
No ratings yet
Network Address Translation NAT
10 pages
Katalon Demo ST
No ratings yet
Katalon Demo ST
15 pages
Sociology Quantitative & Qualitative Research Guide
No ratings yet
Sociology Quantitative & Qualitative Research Guide
22 pages
Historicist Conceptions of Rationality
No ratings yet
Historicist Conceptions of Rationality
17 pages
Thill Ebc11 tb05
No ratings yet
Thill Ebc11 tb05
34 pages
Radner & Shepp 1996 - Risk Vs Profit Potentials A Model For Corporate Strategies (JEDC)
No ratings yet
Radner & Shepp 1996 - Risk Vs Profit Potentials A Model For Corporate Strategies (JEDC)
21 pages
The System - How To Take Control of Your Vices
No ratings yet
The System - How To Take Control of Your Vices
6 pages
Jaquar Company Overview and Market Research
100% (1)
Jaquar Company Overview and Market Research
10 pages
Creative Writing in Drama: Lessons 1-4
No ratings yet
Creative Writing in Drama: Lessons 1-4
17 pages
No. 4 Find Solution Using Simplex (Bigm) Method Max Z 3X - 4Y + 3Z Subject To - X + Y + Z 0 and X, Y, Z 0 Solution: Problem Is
No ratings yet
No. 4 Find Solution Using Simplex (Bigm) Method Max Z 3X - 4Y + 3Z Subject To - X + Y + Z 0 and X, Y, Z 0 Solution: Problem Is
5 pages
The Marketing Environment Determines The Success of Marketing Strategies.
No ratings yet
The Marketing Environment Determines The Success of Marketing Strategies.
14 pages
En Aep000004068
No ratings yet
En Aep000004068
29 pages
Term 3 Investigation Grade 7
100% (1)
Term 3 Investigation Grade 7
7 pages
ZHU - DESHMUKH - 2003 - Application Bayesian Decision Networks
No ratings yet
ZHU - DESHMUKH - 2003 - Application Bayesian Decision Networks
13 pages
PET For Schools Listening Part 3
No ratings yet
PET For Schools Listening Part 3
10 pages
2017 International Biology Competition Results
No ratings yet
2017 International Biology Competition Results
1 page
Front Cover Dac 21903
No ratings yet
Front Cover Dac 21903
3 pages
Proposed SAM Interfaces: January 13, 2011
No ratings yet
Proposed SAM Interfaces: January 13, 2011
9 pages
MICA Checklist
No ratings yet
MICA Checklist
5 pages
Half Girlfriend: Struggles and Success
No ratings yet
Half Girlfriend: Struggles and Success
6 pages
West Bengal Environment Report
No ratings yet
West Bengal Environment Report
404 pages
Personal Data Anonymization
No ratings yet
Personal Data Anonymization
7 pages
OTC 4854 Ultimate Strength of Tubular Joints Subjected To Combined Loads
No ratings yet
OTC 4854 Ultimate Strength of Tubular Joints Subjected To Combined Loads
10 pages
SSM63817 - Smart Key Programming Advice SSM March 2013 v1
No ratings yet
SSM63817 - Smart Key Programming Advice SSM March 2013 v1
5 pages
Precision Bearings Guide
No ratings yet
Precision Bearings Guide
46 pages
Lesson Plan 2
No ratings yet
Lesson Plan 2
3 pages
Case Details
No ratings yet
Case Details
2 pages
1.1 Social Stratification - Class, Status and Power by Peter Worsely
100% (3)
1.1 Social Stratification - Class, Status and Power by Peter Worsely
6 pages
RSLTE001 - System Program Cell Level - RSLTE-LNBTS-2-Day-rslte LTE17A Reports RSLTE001 XML-2018 03-27-06!40!24 955
No ratings yet
RSLTE001 - System Program Cell Level - RSLTE-LNBTS-2-Day-rslte LTE17A Reports RSLTE001 XML-2018 03-27-06!40!24 955
1,000 pages
The Setup PDF
No ratings yet
The Setup PDF
921 pages
Consumer Attitudes: Motorola in Bangladesh
No ratings yet
Consumer Attitudes: Motorola in Bangladesh
28 pages
McLoyd, Vonnie C.. (1998) - Socioeconomic Disadvantage and Child Development.. American Psychologist, 53
No ratings yet
McLoyd, Vonnie C.. (1998) - Socioeconomic Disadvantage and Child Development.. American Psychologist, 53
20 pages

Week 2

Uploaded by

Week 2

Uploaded by

import pandas as pd

result_row = [Link](df1, df2, on='Name')

Name Country Role ID

result_outer = [Link](df1, df2, on='Name', how='outer')

Result Left Join:

Result Right Join:

Result Outer Join:

result_outer = [Link](df1, df2, on='Name', how='outer')

Result Left Join:

Result Right Join:

Result Outer Join:

# Sales Dictionary and Region Dictionary

# b) Merging with Inner Join

Drop rows with any missing values:

Drop columns with at least one missing value:

Drop rows/columns based on threshold:

Replace NaN with the previous value:

Replace NaN with the previous value, limit=1:

Replace NaN with the forward value:

fruit = { 'orange' : [3,2,0,1], 'apple' : [0,3,7,2], 'grapes' : [7,14,6,15] }

orange apple grapes

Next steps: Generate code with df1

fruit = { 'grapes' : [13,12,10,2,55,98], 'mango' : [10,13,17,2,9,76], 'banana' : [20,23,27,4,[Link],[Link]]} # Added [Link]

grapes mango banana

Next steps: Generate code with df2

Next steps: Generate code with df2

[Link]((df1, df2), axis = 0)

orange apple grapes mango banana

0 3.0 0.0 7 NaN NaN

1 2.0 3.0 14 NaN NaN

2 0.0 7.0 6 NaN NaN

3 1.0 2.0 15 NaN NaN

0 NaN NaN 13 10.0 20.0

1 NaN NaN 12 13.0 23.0

3 NaN NaN 2 2.0 4.0

4 NaN NaN 55 9.0 NaN

5 NaN NaN 98 76.0 NaN

orange apple grapes

Next steps: Generate code with df1

[Link]([df1, df2], ignore_index=True)

orange apple grapes mango banana

0 3.0 0.0 7 NaN NaN

1 2.0 3.0 14 NaN NaN

2 0.0 7.0 6 NaN NaN

3 1.0 2.0 15 NaN NaN

4 NaN NaN 13 10.0 20.0

5 NaN NaN 12 13.0 23.0

6 NaN NaN 2 2.0 4.0

7 NaN NaN 55 9.0 NaN

8 NaN NaN 98 76.0 NaN

CPU times: user 17.4 ms, sys: 0 ns, total: 17.4 ms

Start coding or generate with AI.

You might also like