0% found this document useful (0 votes)

21 views9 pages

Exp 6

The document provides a comprehensive guide on performing various operations using the pandas library in Python, including creating DataFrames from different data structures, concatenating DataFrames, setting conditions for filtering data, and adding new columns. It also covers techniques for handling missing values, sorting, and grouping data. Each operation is illustrated with example code and outputs to demonstrate functionality.

Uploaded by

23981a4603

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views9 pages

Exp 6

Uploaded by

23981a4603

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Experiment – 6: Perform following operations using pandas

 Creating dataframe
 Concat()
 Setting conditions
 Adding a new column

Creating dataframe
Pandas DataFrames are two-dimensional, tabular data structures with labeled axes (rows and
columns). Here are several ways to create them:

Program – 1: From a Dictionary

import pandas as pd
data = {
"names": ['A', 'B', 'C'],
"rollno": [101, 102, 103]
}
df = pd.DataFrame(data)
print(df)
Output:
names rollno
0 A 50
1 B 40
2 C 45

Program – 2: From a json

import pandas as pd

data = {
"names":{
"0":"A",
"1":"B",
"2":"C",
},
" rollno":{
"0":101,
"1":102,
"2":103,
}
}
df = pd.DataFrame(data)
print(df)

Output:

names rollno
0 A 50
1 B 40
2 C 45
Program – 3: From a NumPy Array

import numpy as np
import pandas as pd

data = np.array([[1, 2, 3], [4, 5, 6], [7, 8, 9]])

# Changing row labels(index’s) and adding column labels

df = pd.DataFrame(data,index=["I","II","III"], columns=['A', 'B', 'C'])
print(df)

Output:
A B C
I 1 2 3
II 4 5 6
III 7 8 9

Concat()

The pd.concat() function is used to concatenate (combine) pandas objects (DataFrames or Series)
along a particular axis (rows or columns). It's one of the primary tools for combining data in pandas.
Program -1: Vertical Concatenation (Stacking DataFrames)

# Concatenating DataFrames with Same Columns

import pandas as pd

d1 = {
'A': [1, 2, 3],
'B': [4, 5, 6]
}
d2 = {
'A': [7, 8, 9],
'B': [10, 11, 12]
}
df1 = pd.DataFrame(d1)
df2 = pd.DataFrame(d2)
result = pd.concat([df1, df2])
print(result)

Output:
A B
0 1 4
1 2 5
2 3 6
0 7 10
1 8 11
2 9 12
Program -2 : Avoiding duplicate indexes
import pandas as pd

d1 = {
'A': [1, 2, 3],
'B': [4, 5, 6]
}
d2 = {
'A': [7, 8, 9],
'B': [10, 11, 12]
}
df1 = pd.DataFrame(d1)
df2 = pd.DataFrame(d2)
result = pd.concat([df1, df2],ignore_index=True)
print(result)

Output:
A B
0 1 4
1 2 5
2 3 6
3 7 10
4 8 11
5 9 12

Program -3 : Avoiding duplicate indexes

import pandas as pd

d1 = {
'A': [1, 2, 3],
'B': [4, 5, 6]
}
d2 = {
'A': [7, 8, 9],
'B': [10, 11, 12]
}
df1 = pd.DataFrame(d1,index=[0, 1, 2])
df2 = pd.DataFrame(d2,index=[3, 4, 5])
result = pd.concat([df1, df2])
print(result)

Output:
A B
0 1 4
1 2 5
2 3 6
3 7 10
4 8 11
5 9 12
Program – 4:
# Concatenating DataFrames with Different Column Names
import pandas as pd

d1 = {
'A': [1, 2, 3],
'B': [4, 5, 6]
}
d2 = {
'C': [7, 8, 9],
'D': [10, 11, 12]
}
df1 = pd.DataFrame(d1)
df2 = pd.DataFrame(d2)
result = pd.concat([df1, df2],ignore_index=True) # Handling Duplicate Index Values
print(result)

Output:
A B C D
0 1.0 4.0 NaN NaN
1 2.0 5.0 NaN NaN
2 3.0 6.0 NaN NaN
3 NaN NaN 7.0 10.0
4 NaN NaN 8.0 11.0
5 NaN NaN 9.0 12.0
Program – 5: # Concatenating DataFrames Horizontally

import pandas as pd
d1 = {
'A': [1, 2, 3],
'B': [4, 5, 6]
}
d2 = {
'C': [7, 8, 9],
'D': [10, 11, 12]
}
df1 = pd.DataFrame(d1)
df2 = pd.DataFrame(d2)
result = pd.concat([df1, df2],axis=1)
print(result)

Output:

A B C D
0 1 4 7 10
1 2 5 8 11
2 3 6 9 12
Setting conditions:
Pandas provides several ways to set conditions to filter or modify DataFrames and Series. Here are
the main methods:
Program:
1.Boolean Indexing: The most common way to filter data based on conditions:

Program:
import pandas as pd

data = {
'A': [1, 2, 3, 4],
'B': ['a', 'b', 'c', 'd'],
'C': [10, 20, 30, 40]
}
# Create a sample DataFrame
df = pd.DataFrame(data)
print("Original Data\n",df)

# Filter rows where column A > 2

r = df[df['A'] > 2]
print("Simple Condition:\n",f)

# Multiple conditions (use & for AND, | for OR, ~ for NOT)
r1 = df[(df['A'] > 1) & (df['C'] < 40)]
print("Compound Condition:\n",f1)

Output:
Original Data
A B C
0 1 a 10
1 2 b 20
2 3 c 30
3 4 d 40
Simple Condition:
A B C
2 3 c 30
3 4 d 40
Compound Condition:
A B C
1 2 b 20
2 3 c 30

2. query() Method
Filter using a query string:
result = df.query('A > 2 and C != 40')
result
Output:
A B C
A B C
2 3 c 30

3. where() Method

Keeps values where condition is True, replaces others with NaN (or specified value):
# Replace values not meeting condition with NaN
result = df.where(df['A'] > 2)
result
Output:
A B C
0 NaN NaN NaN
1 NaN NaN NaN
2 3.0 c 30.0
3 4.0 d 40.0

# Replace with a specific value

result = df.where(df['A'] > 2, other=-1)
result
Output:
A B C
0 -1 -1 -1
1 -1 -1 -1
2 3 c 30
3 4 d 40

4. loc[] for Conditional Selection and Assignment

Select or modify data based on conditions:

# Select data
result = df.loc[df['A'] > 2, ['B', 'C']]
result
Output:
B C
2 c 30
3 d 40

# Modify data based on condition

df.loc[df['A'] > 2, 'C'] = 99
df
Output:
A B C
A B C
0 1 a 10
1 2 b 20
2 3 c 99
3 4 d 99

5. String Conditions
For string operations:
# Rows where column B starts with 'a'
result = df[df['B'].str.startswith('a')]
result
Output:
A B C
0 1 a 10

# Rows where column B ends with 'd'

result = df[df['B'].str.startswith('d')]
result
Output:
A B C
3 4 d 99

# Contains pattern
result = df[df['B'].str.contains('b|c')]
result
Output:
A B C
1 2 b 20
2 3 c 99

Adding a new column:

There are several ways to add a new column to a pandas DataFrame. Here are the most common
methods:

1. Direct Assignment:
The simplest way to add a new column:

Program:
import pandas as pd

data = {
'A': [1, 2, 3],
'B': [4, 5, 6]
}
df = pd.DataFrame(data)

# Add new column 'C' with a scalar value

df['C'] = 10 # All rows get value 10
df
Output:
A B C
0 1 4 10
1 2 5 10
2 3 6 10

# Add new column with a list/array

df['D'] = [7, 8, 9]
df
Output:
A B C D
0 1 4 10 7
1 2 5 10 8
2 3 6 10 9

# Add column based on calculation

df['E'] = df['A'] + df['B']
df
Output:
A B C D E
0 1 4 10 7 5
1 2 5 10 8 7
2 3 6 10 9 9

2. Insert at Specific Position

# Insert column 'I' at position 1 (0-based index)

df.insert(1, 'I', [100, 200, 300])
df
Output:
A I B C D E
0 1 100 4 10 7 5
1 2 200 5 10 8 7
2 3 300 6 10 9 9

3. Adding Columns from Other DataFrames

df2 = pd.DataFrame({'M': [11, 12, 13]})

df['M'] = df2['M'] # Must have same length/index
df
Output:
A I B C D E M
0 1 100 4 10 7 5 11
1 2 200 5 10 8 7 12
2 3 300 6 10 9 9 13

Experiment – 7: Perform following operations using pandas

• Filling NaN with string
• Sorting based on column values
• groupby()

Filling NaN with string:

Important Points:
1. After filling NaN with strings, numeric columns will be converted to object dtype
2. Use inplace=True parameter to modify the original DataFrame instead of creating a copy
3. For mixed-type columns, be careful about type consistency
4. Consider using pd.NA instead of np.nan for newer pandas versions with string data

There are several ways to replace NaN (missing) values with strings in a pandas DataFrame. Here
are the most common methods:
1. Using fillna() Method:
Program:
import pandas as pd
data = {
'A': [1, 2, np.nan, 4],
'B': ['x', np.nan, 'y', np.nan],
'C': [np.nan, 'a', 'b', 'c']
}
# Create sample DataFrame with NaN values
df = pd.DataFrame(data)
print(df)
# Fill all NaN values with a string
result = df.fillna('missing')
result

Output:
A B C
0 1.0 x NaN
1 2.0 NaN a
2 NaN y b
3 4.0 NaN c

Unit 4 DSE
No ratings yet
Unit 4 DSE
9 pages
Python 2.1.2
No ratings yet
Python 2.1.2
7 pages
Pandas DataFrame Merging Guide
No ratings yet
Pandas DataFrame Merging Guide
62 pages
Chapter 2 Python Pandas - II
No ratings yet
Chapter 2 Python Pandas - II
19 pages
Lab Session 06: Perform Following Operations Using Pandas
No ratings yet
Lab Session 06: Perform Following Operations Using Pandas
5 pages
Unit3 - 3) Pandas - Ipynb - Colab
No ratings yet
Unit3 - 3) Pandas - Ipynb - Colab
11 pages
UnitIV 1
No ratings yet
UnitIV 1
4 pages
Pandas Series and DataFrame Guide
No ratings yet
Pandas Series and DataFrame Guide
98 pages
PDF&Rendition 1
No ratings yet
PDF&Rendition 1
47 pages
Python Interviews
No ratings yet
Python Interviews
154 pages
Python and SQL Programming Index
No ratings yet
Python and SQL Programming Index
29 pages
Introduction to Pandas DataFrames
100% (1)
Introduction to Pandas DataFrames
21 pages
Lab Session 06: Perform Following Operations Using Pandas Lab Session 06: Perform Following Operations Using Pandas
No ratings yet
Lab Session 06: Perform Following Operations Using Pandas Lab Session 06: Perform Following Operations Using Pandas
5 pages
Pandas Notes
No ratings yet
Pandas Notes
20 pages
Pandas
No ratings yet
Pandas
44 pages
Exp 3
No ratings yet
Exp 3
10 pages
The Pandas Series Object-Print
No ratings yet
The Pandas Series Object-Print
16 pages
Data Frame Demo
No ratings yet
Data Frame Demo
73 pages
Pandas & PyNumS Essentials
No ratings yet
Pandas & PyNumS Essentials
10 pages
Lab 9
No ratings yet
Lab 9
9 pages
Dataframe Ip
No ratings yet
Dataframe Ip
75 pages
Pandas Cheat Sheet
100% (1)
Pandas Cheat Sheet
2 pages
Python 2.1.3
No ratings yet
Python 2.1.3
6 pages
Pandas Tutorial
No ratings yet
Pandas Tutorial
9 pages
Edp 3
No ratings yet
Edp 3
16 pages
DataFrame Notes1
No ratings yet
DataFrame Notes1
32 pages
DSP Unit-5 Updated
No ratings yet
DSP Unit-5 Updated
23 pages
Python Cheat Sheet For Excel Users
No ratings yet
Python Cheat Sheet For Excel Users
5 pages
Data Wrangling with Pandas
No ratings yet
Data Wrangling with Pandas
16 pages
Lecture 9 Pandas
No ratings yet
Lecture 9 Pandas
176 pages
SBLC 1
No ratings yet
SBLC 1
23 pages
Pandas Adding Modifying Rows
No ratings yet
Pandas Adding Modifying Rows
3 pages
DS Practical
No ratings yet
DS Practical
30 pages
Pandas Introduction: What Is Python Pandas Used For?
No ratings yet
Pandas Introduction: What Is Python Pandas Used For?
28 pages
Python Pandas Dataframe: Parameter & Description
No ratings yet
Python Pandas Dataframe: Parameter & Description
12 pages
LIst of Practicals 2024 - 25 Class Xii
No ratings yet
LIst of Practicals 2024 - 25 Class Xii
10 pages
Pandas Data Structures and Operations
No ratings yet
Pandas Data Structures and Operations
36 pages
Data Frame Notes1
No ratings yet
Data Frame Notes1
7 pages
DataFrame Creation and Operations in Pandas
No ratings yet
DataFrame Creation and Operations in Pandas
15 pages
Python Pandas-Data Frames
No ratings yet
Python Pandas-Data Frames
41 pages
Pandas DataFrame1
No ratings yet
Pandas DataFrame1
22 pages
Python Cheat Sheet For Excel Users
100% (2)
Python Cheat Sheet For Excel Users
5 pages
12 IP Pandas DataFrame - Question Bank
No ratings yet
12 IP Pandas DataFrame - Question Bank
10 pages
Data Wrangling & Analysis Guide
100% (1)
Data Wrangling & Analysis Guide
36 pages
Pandas+With+Python+ +DATAhill+Solutions
No ratings yet
Pandas+With+Python+ +DATAhill+Solutions
24 pages
Advanced Pandas Operations Guide
No ratings yet
Advanced Pandas Operations Guide
64 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
13 pages
Introduction To Pandas in Data Analytics
No ratings yet
Introduction To Pandas in Data Analytics
12 pages
Pandas Data Wrangling Cheat Sheet
100% (2)
Pandas Data Wrangling Cheat Sheet
6 pages
Revision Notes DataFrame XII IP
No ratings yet
Revision Notes DataFrame XII IP
8 pages
Python Data Structures and Libraries Guide
No ratings yet
Python Data Structures and Libraries Guide
7 pages
Data Analysis With Python
No ratings yet
Data Analysis With Python
60 pages
Unit 1 Python Programming-Ii
No ratings yet
Unit 1 Python Programming-Ii
15 pages
Pandas
No ratings yet
Pandas
27 pages
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
100% (4)
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
11 pages
Pandas for Data Analysis Beginners
No ratings yet
Pandas for Data Analysis Beginners
89 pages
2.3-2 Settiing-Up Wireless Access Point
100% (2)
2.3-2 Settiing-Up Wireless Access Point
19 pages
Lecture 4 - Am Circuits
No ratings yet
Lecture 4 - Am Circuits
47 pages
Stabilized Mud Blocks
No ratings yet
Stabilized Mud Blocks
5 pages
Colregs Reviewer
No ratings yet
Colregs Reviewer
39 pages
Essay On India's Space Programme in 400 Words
No ratings yet
Essay On India's Space Programme in 400 Words
3 pages
Carrier Condensador 38ckc
No ratings yet
Carrier Condensador 38ckc
36 pages
Chitirai 09
No ratings yet
Chitirai 09
21 pages
Notebooks
No ratings yet
Notebooks
2 pages
Pronunciation and Stress Practice Test
50% (2)
Pronunciation and Stress Practice Test
8 pages
Structural Design Manual
83% (6)
Structural Design Manual
639 pages
Literary Analysis for Students
No ratings yet
Literary Analysis for Students
3 pages
Myocarditis Cases
No ratings yet
Myocarditis Cases
6 pages
Test 5
No ratings yet
Test 5
6 pages
Mechanical Engineering Thesis
50% (2)
Mechanical Engineering Thesis
97 pages
Industrial Pharmacy by Lachman 4th Edition
No ratings yet
Industrial Pharmacy by Lachman 4th Edition
1,311 pages
R481200E
No ratings yet
R481200E
2 pages
Conditional-Formulas Gempesaw
No ratings yet
Conditional-Formulas Gempesaw
9 pages
Macramé and Basketry Instructional Plan
No ratings yet
Macramé and Basketry Instructional Plan
8 pages
Germanisher Document
No ratings yet
Germanisher Document
70 pages
Datasheet - Type K Thermocouple
No ratings yet
Datasheet - Type K Thermocouple
2 pages
Tech Pack 2
No ratings yet
Tech Pack 2
7 pages
Ruiz Llacsahuanga Et Al Prevalence of Listeria Species On Food Contact Surfaces in Washington State Apple Packinghouses
No ratings yet
Ruiz Llacsahuanga Et Al Prevalence of Listeria Species On Food Contact Surfaces in Washington State Apple Packinghouses
13 pages
Model 88 System: High Speed. High Use Applications
No ratings yet
Model 88 System: High Speed. High Use Applications
2 pages
Bajaj Two Wheelers Company Overview
No ratings yet
Bajaj Two Wheelers Company Overview
22 pages
B.Tech R20 IQuestion Bank CC
No ratings yet
B.Tech R20 IQuestion Bank CC
3 pages
Greek Mythology: Dragon Combat Analysis
No ratings yet
Greek Mythology: Dragon Combat Analysis
13 pages
Unit 5 - Dr.D.umanandhini (Autosaved)
No ratings yet
Unit 5 - Dr.D.umanandhini (Autosaved)
77 pages
TrixiaVillasotoo Casestudy
No ratings yet
TrixiaVillasotoo Casestudy
26 pages
HELMI English Cardi
100% (3)
HELMI English Cardi
5 pages
Palletpack 460: Function Package
No ratings yet
Palletpack 460: Function Package
2 pages

Exp 6

Uploaded by

Exp 6

Uploaded by

Experiment – 6: Perform following operations using pandas

Program – 1: From a Dictionary

Program – 2: From a json

data = np.array([[1, 2, 3], [4, 5, 6], [7, 8, 9]])

# Changing row labels(index’s) and adding column labels

# Concatenating DataFrames with Same Columns

Program -3 : Avoiding duplicate indexes

# Filter rows where column A > 2

# Replace with a specific value

4. loc[] for Conditional Selection and Assignment

Select or modify data based on conditions:

# Modify data based on condition

# Rows where column B ends with 'd'

Adding a new column:

# Add new column 'C' with a scalar value

# Add new column with a list/array

# Add column based on calculation

2. Insert at Specific Position

# Insert column 'I' at position 1 (0-based index)

3. Adding Columns from Other DataFrames

df2 = pd.DataFrame({'M': [11, 12, 13]})

Experiment – 7: Perform following operations using pandas

Filling NaN with string:

You might also like