0% found this document useful (0 votes)

99 views7 pages

Pandas Practice

The document outlines a practice lab for using the Pandas library in Python, focusing on creating DataFrames and Series, as well as selecting and slicing data. It includes exercises on using the loc() and iloc() functions for data selection, along with practical coding examples. The lab aims to enhance understanding of data manipulation using Pandas within a 30-minute timeframe.

Uploaded by

mktpvh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

99 views7 pages

Pandas Practice

Uploaded by

mktpvh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Pandas_Practice

March 6, 2025

1 Practice Lab: Selecting data in a Dataframe

Estimated time needed: 30 minutes

1.1 Objectives
After completing this lab you will be able to:
• Use Pandas Library to create DataFrame and Series
• Locate data in the DataFrame using loc() and iloc() functions
• Use slicing

1.1.1 Exercise 1: Pandas: DataFrame and Series

Pandas is a popular library for data analysis built on top of the Python programming language.
Pandas generally provide two data structures for manipulating data, They are:
• DataFrame
• Series
A DataFrame is a two-dimensional data structure, i.e., data is aligned in a tabular fashion in rows
and columns.
• A Pandas DataFrame will be created by loading the datasets from existing storage.
• Storage can be SQL Database, CSV file, Excel file, etc.
• It can also be created from the lists, dictionaries, and from a list of dictionaries.
Series represents a one-dimensional array of indexed data. It has two main components : 1. An
array of actual data. 2. An associated array of indexes or data labels.
The index is used to access individual data values. You can also get a column of a dataframe as a
Series. You can think of a Pandas series as a 1-D dataframe.

[1]: !pip install pandas

Collecting pandas
Downloading
pandas-2.2.3-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata
(89 kB)
Collecting numpy>=1.26.0 (from pandas)
Downloading
numpy-2.2.3-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata

1
(62 kB)
Requirement already satisfied: python-dateutil>=2.8.2 in
/opt/conda/lib/python3.12/site-packages (from pandas) (2.9.0.post0)
Requirement already satisfied: pytz>=2020.1 in /opt/conda/lib/python3.12/site-
packages (from pandas) (2024.2)
Collecting tzdata>=2022.7 (from pandas)
Downloading tzdata-2025.1-py2.py3-none-any.whl.metadata (1.4 kB)
Requirement already satisfied: six>=1.5 in /opt/conda/lib/python3.12/site-
packages (from python-dateutil>=2.8.2->pandas) (1.17.0)
Downloading
pandas-2.2.3-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (12.7
MB)
�� 12.7/12.7 MB
118.7 MB/s eta 0:00:00
Downloading
numpy-2.2.3-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (16.1 MB)
�� 16.1/16.1 MB
150.7 MB/s eta 0:00:00
Downloading tzdata-2025.1-py2.py3-none-any.whl (346 kB)
Installing collected packages: tzdata, numpy, pandas
Successfully installed numpy-2.2.3 pandas-2.2.3 tzdata-2025.1

[2]: # let us import the Pandas Library

import pandas as pd

Once you’ve imported pandas, you can then use the functions built in it to create and analyze data.
In this practice lab, we will learn how to create a DataFrame out of a dictionary.
Let us consider a dictionary ‘x’ with keys and values as shown below.
We then create a dataframe from the dictionary using the function pd.DataFrame(dict)

[3]: #Define a dictionary 'x'

x = {'Name': ['Rose','John', 'Jane', 'Mary'], 'ID': [1, 2, 3, 4], 'Department':␣

↪['Architect Group', 'Software Group', 'Design Team', 'Infrastructure'],

'Salary':[100000, 80000, 50000, 60000]}

#casting the dictionary to a DataFrame

df = pd.DataFrame(x)

#display the result df

[3]: Name ID Department Salary

0 Rose 1 Architect Group 100000
1 John 2 Software Group 80000
2 Jane 3 Design Team 50000

2
3 Mary 4 Infrastructure 60000

We can see the direct correspondence between the table. The keys correspond to the column labels
and the values or lists correspond to the rows.

Column Selection: To select a column in Pandas DataFrame, we can either access the columns
by calling them by their columns name.
Let’s Retrieve the data present in the ID column.

[4]: #Retrieving the "ID" column and assigning it to a variable x

x = df[['ID']]
x

[4]: ID
0 1
1 2
2 3
3 4

Let’s use the type() function and check the type of the variable.

[5]: #check the type of x

type(x)

[5]: pandas.core.frame.DataFrame

The output shows us that the type of the variable is a DataFrame object.

Access to multiple columns Let us retrieve the data for Department, Salary and ID columns

[6]: #Retrieving the Department, Salary and ID columns and assigning it to a␣

↪variable z

z = df[['Department','Salary','ID']]
z

[6]: Department Salary ID

0 Architect Group 100000 1
1 Software Group 80000 2
2 Design Team 50000 3
3 Infrastructure 60000 4

1.1.2 Try it yourself

Problem 1: Create a dataframe to display the result as below:
[7]: #write your code here

Click here for the solution

3
a = {'Student':['David', 'Samuel', 'Terry', 'Evan'],
'Age':['27', '24', '22', '32'],
'Country':['UK', 'Canada', 'China', 'USA'],
'Course':['Python','Data Structures','Machine Learning','Web Development'],
'Marks':['85','72','89','76']}
df1 = pd.DataFrame(a)
df1

Problem 2: Retrieve the Marks column and assign it to a variable b

[8]: #write your code here

Click here for the solution

b = df1[['Marks']]
b

Problem 3: Retrieve the Country and Course columns and assign it to a variable c
[9]: #write your code here

Click here for the solution

c = df1[['Country','Course']]
c

To view the column as a series, just use one bracket:

[10]: # Get the Student column as a series Object

x = df1['Student']
x

---------------------------------------------------------------------------
NameError Traceback (most recent call last)
Cell In[10], line 3
1 # Get the Student column as a series Object
----> 3 x = df1['Student']
4 x

NameError: name 'df1' is not defined

[ ]: #check the type of x

type(x)

The output shows us that the type of the variable is a Series object.

4
1.1.3 Exercise 2: loc() and iloc() functions
loc() is a label-based data selecting method which means that we have to pass the name of the row
or column that we want to select. This method includes the last element of the range passed in it.
Simple syntax for your understanding:
• loc[row_label, column_label]
iloc() is an indexed-based selecting method which means that we have to pass an integer index in
the method to select a specific row/column. This method does not include the last element of the
range passed in it.
Simple syntax for your understanding:
• iloc[row_index, column_index]
Let us see some examples on the same.

[ ]: # Access the value on the first row and the first column

df.iloc[0, 0]

[ ]: # Access the value on the first row and the third column

df.iloc[0,2]

[ ]: # Access the column using the name

df.loc[0, 'Salary']

Let us create a new dataframe called ‘df2’ and assign ‘df’ to it. Now, let us set the “Name” column
as an index column using the method set_index().

[ ]: df2=df
df2=df2.set_index("Name")

[ ]: #To display the first 5 rows of new dataframe

df2.head()

[ ]: #Now, let us access the column using the name

df2.loc['Jane', 'Salary']

1.1.4 Try it yourself

Use the loc() function,to get the Department of Jane in the newly created dataframe df2.

[ ]: #write your code here

Click here for the solution

df2.loc['Jane', 'Department']

5
Use the iloc() function to get the Salary of Mary in the newly created dataframe df2.

[ ]: #write your code here

Click here for the solution

df2.iloc[3,2]

1.1.5 Exercise 3: Slicing

Slicing uses the [] operator to select a set of rows and/or columns from a DataFrame.
To slice out a set of rows, you use this syntax: data[start:stop],
here the start represents the index from where to consider, and stop represents the index one step
BEYOND the row you want to select. You can perform slicing using both the index and the name
of the column.
NOTE: When slicing in pandas, the start bound is included in the output.
So if you want to select rows 0, 1, and 2 your code would look like this: df.iloc[0:3].
It means you are telling Python to start at index 0 and select rows 0, 1, 2 up to but not including
3.
NOTE: Labels must be found in the DataFrame or you will get a KeyError.
Indexing by labels(i.e. using loc()) differs from indexing by integers (i.e. using iloc()). With loc(),
both the start bound and the stop bound are inclusive. When using loc(), integers can be used,
but the integers refer to the index label and not the position.
For example, using loc() and select 1:4 will get a different result than using iloc() to select rows
1:4.
We can also select a specific data value using a row and column location within the DataFrame
and iloc indexing.

[ ]: # let us do the slicing using old dataframe df

df.iloc[0:2, 0:3]

[ ]: #let us do the slicing using loc() function on old dataframe df where index␣
↪column is having labels as 0,1,2

df.loc[0:2,'ID':'Department']

[ ]: #let us do the slicing using loc() function on new dataframe df2 where index␣
↪column is Name having labels: Rose, John and Jane

df2.loc['Rose':'Jane', 'ID':'Department']

Try it yourself
using loc() function, do slicing on old dataframe df to retrieve the Name, ID and department of
index column having labels as 2,3

6
[ ]: # Write your code below and press Shift+Enter to execute

Click here for the solution

df.loc[2:3,'Name':'Department']

Congratulations, you have completed this lesson and the practice lab on Pandas

Date
(YYYY-MM-DD) Version Changed By Change Description
2022-03-31 0.1 Appalabhaktula Created initial version
Hema

–!>

[ ]:

Pandas Row/Column Selection Guide
No ratings yet
Pandas Row/Column Selection Guide
7 pages
1 - Indexing in Pandas
No ratings yet
1 - Indexing in Pandas
8 pages
Data Science Notes Unit-1 Part - 2
No ratings yet
Data Science Notes Unit-1 Part - 2
22 pages
Dataframes-I (Create - Selection)
No ratings yet
Dataframes-I (Create - Selection)
12 pages
Lecture 2 - Data Wrangling - Update
No ratings yet
Lecture 2 - Data Wrangling - Update
114 pages
Ip Study
No ratings yet
Ip Study
18 pages
Pandas Dataframe
No ratings yet
Pandas Dataframe
8 pages
Pandas DataFrame: Syntax and Usage
No ratings yet
Pandas DataFrame: Syntax and Usage
70 pages
Iloc and Loc Uses PDF
No ratings yet
Iloc and Loc Uses PDF
16 pages
ICT2103 Full Book-Part-3
No ratings yet
ICT2103 Full Book-Part-3
14 pages
DataFrame Ac Win Final
No ratings yet
DataFrame Ac Win Final
30 pages
Python For Data Science 1662157639
No ratings yet
Python For Data Science 1662157639
6 pages
Creating DataFrames in Python Pandas
No ratings yet
Creating DataFrames in Python Pandas
10 pages
For Assignment-3 (Final - Pandas - Lab)
No ratings yet
For Assignment-3 (Final - Pandas - Lab)
40 pages
Pandas
No ratings yet
Pandas
5 pages
Line by Line 12 IP
No ratings yet
Line by Line 12 IP
21 pages
Data Frames
No ratings yet
Data Frames
42 pages
Slicing Pandas Dataframe - GeeksforGeeks
No ratings yet
Slicing Pandas Dataframe - GeeksforGeeks
4 pages
Lec 03 - DS100 Fa24 - Pandas II
No ratings yet
Lec 03 - DS100 Fa24 - Pandas II
63 pages
Pandas Notes
No ratings yet
Pandas Notes
20 pages
SBLC 1
No ratings yet
SBLC 1
23 pages
Python Pandas-Data Frames
No ratings yet
Python Pandas-Data Frames
41 pages
Python Data Science: Pandas & ML Basics
100% (1)
Python Data Science: Pandas & ML Basics
41 pages
Pandas and Python
No ratings yet
Pandas and Python
24 pages
Unit-4Introduction To Pandas
No ratings yet
Unit-4Introduction To Pandas
44 pages
Pandas-Creating Series & Dataframes (DR V Gowri, Srmist)
No ratings yet
Pandas-Creating Series & Dataframes (DR V Gowri, Srmist)
47 pages
Pandas (PPT 5)
No ratings yet
Pandas (PPT 5)
16 pages
Unit 4
No ratings yet
Unit 4
36 pages
Accessing Data From DataFrame
No ratings yet
Accessing Data From DataFrame
4 pages
Pandas Module Overview and Usage Guide
No ratings yet
Pandas Module Overview and Usage Guide
15 pages
Pandas (Assignment 3)
No ratings yet
Pandas (Assignment 3)
24 pages
Data Handing Using Pandas-I
100% (2)
Data Handing Using Pandas-I
46 pages
Lab-3 Pandas Library
No ratings yet
Lab-3 Pandas Library
14 pages
Dataframe
No ratings yet
Dataframe
2 pages
Data Handling Using Pandas-1
No ratings yet
Data Handling Using Pandas-1
60 pages
Pandas DataFrame Basics
No ratings yet
Pandas DataFrame Basics
48 pages
Understanding Pandas Data Structures
No ratings yet
Understanding Pandas Data Structures
56 pages
Mastering Pandas: A Comprehensive Guide
No ratings yet
Mastering Pandas: A Comprehensive Guide
13 pages
Data Frames
No ratings yet
Data Frames
60 pages
Pandas 1
No ratings yet
Pandas 1
49 pages
Pandas: DataFrames & Series Guide
No ratings yet
Pandas: DataFrames & Series Guide
2 pages
IP 12th Chapter 3
No ratings yet
IP 12th Chapter 3
9 pages
Pandas Functions
No ratings yet
Pandas Functions
3 pages
Pandas Guide for Data Enthusiasts
No ratings yet
Pandas Guide for Data Enthusiasts
7 pages
Python Pandas DataFrame Guide
No ratings yet
Python Pandas DataFrame Guide
53 pages
Data Analysis with Pandas
No ratings yet
Data Analysis with Pandas
31 pages
Python Pandas Data Manipulation Guide
No ratings yet
Python Pandas Data Manipulation Guide
11 pages
Murali Internship
No ratings yet
Murali Internship
34 pages
Data Handling with Pandas: Series & DataFrame
No ratings yet
Data Handling with Pandas: Series & DataFrame
44 pages
Cheat Python
No ratings yet
Cheat Python
8 pages
Understanding Pandas DataFrames in Python
No ratings yet
Understanding Pandas DataFrames in Python
35 pages
Creating DataFrames with Pandas
No ratings yet
Creating DataFrames with Pandas
43 pages
Pandas Questions
No ratings yet
Pandas Questions
11 pages
Lec 02 - DS100 Fa23 - Pandas 1
No ratings yet
Lec 02 - DS100 Fa23 - Pandas 1
61 pages
Lecture 1 On DataFrame
No ratings yet
Lecture 1 On DataFrame
4 pages
Lab 9
No ratings yet
Lab 9
9 pages
Introduction To Pandas and Matplotlib: Dr. D. Kothandaraman Associate Professor, SCOPE, VITAP-University
No ratings yet
Introduction To Pandas and Matplotlib: Dr. D. Kothandaraman Associate Professor, SCOPE, VITAP-University
30 pages
German Connectors Weil Dass Darum Deshalb Deswegen Denn
No ratings yet
German Connectors Weil Dass Darum Deshalb Deswegen Denn
2 pages
Easterine
No ratings yet
Easterine
5 pages
MS Excel Features and Overview Guide
No ratings yet
MS Excel Features and Overview Guide
11 pages
Past Perfect and Second Conditional Exercises
No ratings yet
Past Perfect and Second Conditional Exercises
8 pages
Mobikwik Integration Guide
0% (1)
Mobikwik Integration Guide
36 pages
B System Setup CG ncs5000 77x
No ratings yet
B System Setup CG ncs5000 77x
86 pages
Chapter 5
No ratings yet
Chapter 5
29 pages
Unit 2 - Searching & Game Playing
No ratings yet
Unit 2 - Searching & Game Playing
28 pages
RS10375 - Grade 4 Grid Maths - Model Qusetion
No ratings yet
RS10375 - Grade 4 Grid Maths - Model Qusetion
7 pages
A2.Elementary - Unit7 FINAL
100% (2)
A2.Elementary - Unit7 FINAL
22 pages
Emcee Script for Career Guidance 2024
No ratings yet
Emcee Script for Career Guidance 2024
1 page
Language Use Among Gaddang Speakers
No ratings yet
Language Use Among Gaddang Speakers
111 pages
DepEd Emerging-LAS Week1 (Edited)
No ratings yet
DepEd Emerging-LAS Week1 (Edited)
17 pages
创意写作教程书
100% (1)
创意写作教程书
7 pages
Module 4 - Chapter 3
No ratings yet
Module 4 - Chapter 3
7 pages
MQL4 Event Handling Guide
No ratings yet
MQL4 Event Handling Guide
3 pages
SCARA Robot
No ratings yet
SCARA Robot
31 pages
Java Exception Handling and Bank Classes
No ratings yet
Java Exception Handling and Bank Classes
11 pages
Perversion
100% (1)
Perversion
28 pages
Kotlin Beginners Notes
No ratings yet
Kotlin Beginners Notes
101 pages
Lesson Plan Analysis Guide
No ratings yet
Lesson Plan Analysis Guide
1 page
ENG BA Merged
No ratings yet
ENG BA Merged
16 pages
FEBE1002A CompLit
No ratings yet
FEBE1002A CompLit
7 pages
Chapter Iii
No ratings yet
Chapter Iii
9 pages
Theology Simplified by Bob Yandian
100% (1)
Theology Simplified by Bob Yandian
79 pages
Clarissa Dalloway: A Character Analysis
No ratings yet
Clarissa Dalloway: A Character Analysis
2 pages
Mindless Reading
No ratings yet
Mindless Reading
3 pages
Network Enumeration Techniques Overview
No ratings yet
Network Enumeration Techniques Overview
76 pages
Functional Areas of Cortex
No ratings yet
Functional Areas of Cortex
49 pages
Anh 6- Nội Dung Ôn Tập Kiểm Tra Cuối Hk 1
No ratings yet
Anh 6- Nội Dung Ôn Tập Kiểm Tra Cuối Hk 1
5 pages

Pandas Practice

Uploaded by

Pandas Practice

Uploaded by

Pandas_Practice

1 Practice Lab: Selecting data in a Dataframe

1.1.1 Exercise 1: Pandas: DataFrame and Series

[1]: !pip install pandas

[2]: # let us import the Pandas Library

[3]: #Define a dictionary 'x'

x = {'Name': ['Rose','John', 'Jane', 'Mary'], 'ID': [1, 2, 3, 4], 'Department':␣

'Salary':[100000, 80000, 50000, 60000]}

#casting the dictionary to a DataFrame

#display the result df

[3]: Name ID Department Salary

[4]: #Retrieving the "ID" column and assigning it to a variable x

[5]: #check the type of x

[6]: #Retrieving the Department, Salary and ID columns and assigning it to a␣

[6]: Department Salary ID

1.1.2 Try it yourself

Click here for the solution

Problem 2: Retrieve the Marks column and assign it to a variable b

Click here for the solution

Click here for the solution

To view the column as a series, just use one bracket:

NameError: name 'df1' is not defined

[ ]: #check the type of x

[ ]: # Access the column using the name

[ ]: #To display the first 5 rows of new dataframe

[ ]: #Now, let us access the column using the name

1.1.4 Try it yourself

[ ]: #write your code here

Click here for the solution

[ ]: #write your code here

Click here for the solution

1.1.5 Exercise 3: Slicing

[ ]: # let us do the slicing using old dataframe df

Click here for the solution

You might also like