0% found this document useful (0 votes)

5 views12 pages

Dataframes-I (Create - Selection)

The document provides an overview of Pandas DataFrames, which are two-dimensional objects used to represent data in rows and columns, similar to MySQL tables. It details various methods for creating DataFrames using lists, dictionaries, and numpy arrays, as well as techniques for selecting rows and columns using label and integer location. Additionally, it explains how to display specific rows and columns, including using functions like head() and tail().

Uploaded by

RAHUL BARUAH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views12 pages

Dataframes-I (Create - Selection)

Uploaded by

RAHUL BARUAH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

PANDAS DATA FRAME

It is a two dimensional object that is used to represent data in rows and columns.
It is similar to our mysql tables. Once we store data in this format, we can perform
various operations that are useful in analyzing and understanding the data. It can
contain heterogeneous data. The size and data of a dataframe are mutable ie.
they can change.

Admno Name Class Section Marks Column

100 Sushmita 12 A 78 names
101 Sarika 12 A 84
102 Aman 12 B 90
103 Kartavya 12 C 70 Row

Column
Dataframe has row and column index
A dataframe can be created using any of the following:
1. Lists
2. Dictionary
3. Numpy 2D array
4. Series
CREATION OF DATAFRAMES
Ways to create a dataframe:

a) Creating an empty dataframe

import pandas as pd
df=pd.DataFrame()
print(df)

Output:
Empty Dataframe
Columns: [ ]
Index: [ ]

b) Creating a dataframe
Example 1: (Using Lists)
import pandas as pd
d=[12,13,14,15,16]
df=pd.DataFrame(d)
print(df)

Output:
0 Default Column Name
0 12
1 13
2 14
3 15
4 16

Example 2: (Using Sublists)

import pandas as pd
data=[[‘ aman’, 45],[ 'vishal', 56],[ 'soniya', 67]]
df=pd.DataFrame(data,columns=[‘name’,’age’])
print(df)

Output:
name age
0 aman 45
1 vishal 56
2 soniya 67
Example 3: (Using dtype)
import pandas as pd
data=[[‘aman’, 45],[ 'vishal', 56],[ 'soniya', 67]]
df=pd.DataFrame(data,columns=[‘name’,’age’], dtype=float)
print(df)

Output:
name age
0 aman 45.0
1 vishal 56.0
2 soniya 67.0

c) Creating a dataframe using dictionary

Example 1. (With default index)

import pandas as pd
dict1={'names':['aman','vishal','soniya','parth'],'marks':[45,56,67,78]}
df=pd.DataFrame(dict1)
print(df)

Output:
names marks
0 aman 45
1 vishal 56
2 soniya 67
3 parth 78

Example 2. (With specific index)

import pandas as pd
dict1={'names':['aman','vishal','soniya','parth'],'marks':[45,56,67,78]}
df=pd.DataFrame(dict1, index=[100,101,102,103])
print(df)

Output:
names marks
100 aman 45
101 vishal 56
102 soniya 67
103 parth 78
 Creating dataframe from list of dictionary
import pandas as pd
list1=[
{'name':'sushmita', 'surname':'Ghosh'},
{'name':'lakshay', 'surname':'Mehta'},
{'name':'Amir', 'surname':'khan'},
{'name':'Kapil', 'surname':'Dev'}]
df=pd.DataFrame(list1)
print(df)

Output:
name surname
0 sushmita Ghosh
1 lakshay Mehta
2 Amir khan
3 Kapil Dev
Consider the following code to create a dataframe named df, which will be
used as a reference for all the operations on dataframe done below:-

import pandas as pd
dict1={
'names':['aman','vishal','soniya','parth','sushant','Umang'] ,
'marks':[45,56,67,78,80,89] ,
's_class':[11,12,12,12,10,10] ,
'sec':['a','a','e','d','c','d']
}
df=pd.DataFrame(dict1, index=[100,101,102,103,104,105])
print(df)

Output:
SELECTION OF DATA FROM A DATAFRAME

ROWS SELECTION

a) Selection by Label: Rows can be selected by passing row label to a .loc()

function.

Example 1: Selecting Single row label

>>> print(df.loc[101])
Output:
names vishal
marks 56
s_class 12
sec a
Name: 101, dtype: object

Example 2: Selecting Multiple row labels

>>> print(df.loc[[103,104,105]])
Output:
names marks s_class sec
103 parth 78 12 d
104 sushant 80 10 c
105 Umang 89 10 d
b) Selection by Integer location: Rows can be selected by passing integer
location to an iloc() function.

Example 1: Selecting single row index

>>> print(df.iloc[2])
Output:
names soniya
marks 67
s_class 12
sec e
Name: 102, dtype: object

Example 2: Selecting multiple row index

>>> print(df.iloc[[2,4,5]])
Output:
names marks s_class sec
102 soniya 67 12 e
104 sushant 80 10 c
105 Umang 89 10 d
c) Slice Rows: Multiple rows can be selected using ‘ : ’ operator.
Example 1:
>>> print(df[2:4])
Output:
names marks s_class sec
102 soniya 67 12 e
103 parth 78 12 d

Example 2: Use of step value

>>> print(df[2:6:2])
Output:
names marks s_class sec
102 soniya 67 12 e
104 sushant 80 10 c

Example 3: Multiple rows can also be selected by using iloc()

>>> print(df.iloc[2:6:2])
Output:
names marks s_class sec
102 soniya 67 12 e
104 sushant 80 10 c
d) head() and tail ()
head() returns the first n rows (observe the index values). The default number
of elements to display is five, but you may pass a custom number.
Example :
>>> print(df.head(3))
Output
names marks s_class sec
100 aman 45 11 a
101 vishal 56 12 a
102 soniya 67 12 e
tail() returns the last n rows (observe the index values). The default number of
elements to display is five, but you may pass a custom number.
>>> print(df.tail(4))
Output
names marks s_class sec
102 soniya 67 12 e
103 parth 78 12 d
104 sushant 80 10 c
105 Umang 89 10 d
COLUMN SELECTION:

a) To display the contents of a particular column from the DataFrame we

write:
df [‘col_name’])
OR
df.col_name

Example 1:
>>>print(df[‘names’])
Output:
100 aman
101 vishal
102 soniya
103 parth
104 sushant
105 Umang
Name: names, dtype: object

Example 2:
>>>print(df.sec)
Output:
100 a
101 a
102 e
103 d
104 c
105 d
Name: sec, dtype: object
b)To access multiple columns we can write as:
df[ [‘col1’,’col2’, ……] ]

Example:
>>>print(df[['marks','sec']])
Output:
marks sec
100 45 a
101 56 a
102 67 e
103 78 d
104 80 c
105 89 d
SELECTING ROWS AND COLUMNS SIMULTANEOUSLY USING .loc()
Example:
>>> print(df.loc[[101,102],['names','sec']])
Output:
names sec
101 vishal a
102 soniya e

Creating DataFrames in Python Pandas
No ratings yet
Creating DataFrames in Python Pandas
10 pages
Pandas DataFrame Guide for Informatics
No ratings yet
Pandas DataFrame Guide for Informatics
11 pages
DataFrame Ac Win Final
No ratings yet
DataFrame Ac Win Final
30 pages
Creating DataFrames with Pandas
No ratings yet
Creating DataFrames with Pandas
43 pages
Chapter Notes - Data Handling Using Pandas DataFrame
No ratings yet
Chapter Notes - Data Handling Using Pandas DataFrame
16 pages
Pandas and Python
No ratings yet
Pandas and Python
24 pages
Accessing Data From DataFrame
No ratings yet
Accessing Data From DataFrame
4 pages
Pandas DataFrame: Syntax and Usage
No ratings yet
Pandas DataFrame: Syntax and Usage
70 pages
Dataframe Ip
No ratings yet
Dataframe Ip
75 pages
Pandas Dataframe
No ratings yet
Pandas Dataframe
8 pages
Pandas
No ratings yet
Pandas
5 pages
Understanding Pandas DataFrames in Python
No ratings yet
Understanding Pandas DataFrames in Python
35 pages
SBLC 1
No ratings yet
SBLC 1
23 pages
Data Science Notes Unit-1 Part - 2
No ratings yet
Data Science Notes Unit-1 Part - 2
22 pages
05 Pandas Data Frames
No ratings yet
05 Pandas Data Frames
33 pages
Data Frames
No ratings yet
Data Frames
60 pages
Pandas DataFrame1
No ratings yet
Pandas DataFrame1
22 pages
Python Pandas-Data Frames
No ratings yet
Python Pandas-Data Frames
41 pages
IP 12th Chapter 3
No ratings yet
IP 12th Chapter 3
9 pages
DataFrame Creation and Operations in Pandas
No ratings yet
DataFrame Creation and Operations in Pandas
15 pages
DataFrame Notes1
No ratings yet
DataFrame Notes1
32 pages
Chapter 1 - Part 2 - DataFrame
No ratings yet
Chapter 1 - Part 2 - DataFrame
48 pages
Pandas Practice
No ratings yet
Pandas Practice
7 pages
Pandas
No ratings yet
Pandas
8 pages
XII IP Resource Material - DataFrame
No ratings yet
XII IP Resource Material - DataFrame
22 pages
Data Frame
No ratings yet
Data Frame
17 pages
Pandas DataFrame Basics Guide
No ratings yet
Pandas DataFrame Basics Guide
4 pages
For Assignment-3 (Final - Pandas - Lab)
No ratings yet
For Assignment-3 (Final - Pandas - Lab)
40 pages
Pandas Guide
No ratings yet
Pandas Guide
50 pages
Pandas
No ratings yet
Pandas
27 pages
Python
No ratings yet
Python
16 pages
Data Aggregation and Group Operations
No ratings yet
Data Aggregation and Group Operations
34 pages
Python Data Science: Pandas & ML Basics
100% (1)
Python Data Science: Pandas & ML Basics
41 pages
Data Handing Using Pandas-I
100% (2)
Data Handing Using Pandas-I
46 pages
Pandas Module Overview and Usage Guide
No ratings yet
Pandas Module Overview and Usage Guide
15 pages
Lab 9
No ratings yet
Lab 9
9 pages
Pandas DataFrame Basics
No ratings yet
Pandas DataFrame Basics
48 pages
Creating and Using Pandas Dataframes
No ratings yet
Creating and Using Pandas Dataframes
14 pages
Data Frame Demo
No ratings yet
Data Frame Demo
73 pages
Understanding Pandas Data Structures
No ratings yet
Understanding Pandas Data Structures
56 pages
1 - Indexing in Pandas
No ratings yet
1 - Indexing in Pandas
8 pages
Data Analysis with Pandas
No ratings yet
Data Analysis with Pandas
31 pages
Lecture 1 On DataFrame
No ratings yet
Lecture 1 On DataFrame
4 pages
Pandas
No ratings yet
Pandas
13 pages
Pandas Data Analysis Techniques
No ratings yet
Pandas Data Analysis Techniques
8 pages
Starting Out With Pandas - Ext
No ratings yet
Starting Out With Pandas - Ext
18 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
16 pages
Python Pandas DataFrame Guide
No ratings yet
Python Pandas DataFrame Guide
53 pages
Pandas Introduction: What Is Python Pandas Used For?
No ratings yet
Pandas Introduction: What Is Python Pandas Used For?
28 pages
Unit 2 notes-II
No ratings yet
Unit 2 notes-II
47 pages
Unit-4Introduction To Pandas
No ratings yet
Unit-4Introduction To Pandas
44 pages
Data Handling Using Pandas-1
No ratings yet
Data Handling Using Pandas-1
60 pages
Pandas (PPT 5)
No ratings yet
Pandas (PPT 5)
16 pages
Pandas: DataFrames & Series Guide
No ratings yet
Pandas: DataFrames & Series Guide
2 pages
Pandas Notes
No ratings yet
Pandas Notes
20 pages
Ainotes Dataframe
No ratings yet
Ainotes Dataframe
5 pages
Data Frames
No ratings yet
Data Frames
42 pages
Universal Design Principles Guide
No ratings yet
Universal Design Principles Guide
40 pages
AR & VR Seminar Report
No ratings yet
AR & VR Seminar Report
10 pages
Familiarization With AVR Trainer Kit and Related Software: United International University
100% (2)
Familiarization With AVR Trainer Kit and Related Software: United International University
18 pages
AI TH-312 WS: Face & Temp Recognition Camera
No ratings yet
AI TH-312 WS: Face & Temp Recognition Camera
7 pages
History of Operating Systems
No ratings yet
History of Operating Systems
7 pages
BPR and Prototyping Tools Guide
No ratings yet
BPR and Prototyping Tools Guide
8 pages
Bt20cse170 Internship Report
No ratings yet
Bt20cse170 Internship Report
27 pages
End Term Paper on Data Warehousing
No ratings yet
End Term Paper on Data Warehousing
6 pages
Ford FANUC R30iA R30iB NextGen E83
No ratings yet
Ford FANUC R30iA R30iB NextGen E83
83 pages
CP Erakshak Project
No ratings yet
CP Erakshak Project
61 pages
5600T230 - Issue 004 - L24ei L36ei Scanner Quick Start Guide - English
No ratings yet
5600T230 - Issue 004 - L24ei L36ei Scanner Quick Start Guide - English
4 pages
Moisture Tester User Guide
No ratings yet
Moisture Tester User Guide
28 pages
Cisco Huawei
No ratings yet
Cisco Huawei
7 pages
Slot Machine Game in Python
No ratings yet
Slot Machine Game in Python
2 pages
LG4300auditgard Series
No ratings yet
LG4300auditgard Series
4 pages
MOVIDRIVE B-Manual ISYNC 06-2005 EN
No ratings yet
MOVIDRIVE B-Manual ISYNC 06-2005 EN
120 pages
SolidWorks 2019 Shortcuts Guide
No ratings yet
SolidWorks 2019 Shortcuts Guide
13 pages
MP Lab
No ratings yet
MP Lab
1 page
Cybersecurity 2025 Presentation 20250825142457
No ratings yet
Cybersecurity 2025 Presentation 20250825142457
12 pages
Brute-Forcing Stay-Logged-In Cookies
No ratings yet
Brute-Forcing Stay-Logged-In Cookies
11 pages
Digital Alarm Clock Complete Guide
No ratings yet
Digital Alarm Clock Complete Guide
14 pages
Essential Mac Keyboard Shortcuts Guide
No ratings yet
Essential Mac Keyboard Shortcuts Guide
1 page
CA08103003Z EN INT - Apr2016 PDF
No ratings yet
CA08103003Z EN INT - Apr2016 PDF
316 pages
Study of TCP & UDP Performance DNS, FTP, WEB & Email Multi Server Configuration Using Cisco Packet Tracer. Theory
No ratings yet
Study of TCP & UDP Performance DNS, FTP, WEB & Email Multi Server Configuration Using Cisco Packet Tracer. Theory
25 pages
Priority Expiration Cache Design
No ratings yet
Priority Expiration Cache Design
3 pages
Resume Manish
No ratings yet
Resume Manish
1 page
Tableau - Table Calculations - Primer
75% (4)
Tableau - Table Calculations - Primer
28 pages
Pam Admin 12.6 Exercise Guide Ilt
No ratings yet
Pam Admin 12.6 Exercise Guide Ilt
280 pages
OpenPIV - Open Source Particle Image Velocimetry, Python Version
No ratings yet
OpenPIV - Open Source Particle Image Velocimetry, Python Version
19 pages
Odoo Marketplace Integration Simplified
No ratings yet
Odoo Marketplace Integration Simplified
33 pages

Dataframes-I (Create - Selection)

Uploaded by

Dataframes-I (Create - Selection)

Uploaded by

PANDAS DATA FRAME

Admno Name Class Section Marks Column

a) Creating an empty dataframe

Example 2: (Using Sublists)

c) Creating a dataframe using dictionary

Example 2. (With specific index)

a) Selection by Label: Rows can be selected by passing row label to a .loc()

Example 1: Selecting Single row label

Example 2: Selecting Multiple row labels

Example 1: Selecting single row index

Example 2: Selecting multiple row index

Example 2: Use of step value

Example 3: Multiple rows can also be selected by using iloc()

a) To display the contents of a particular column from the DataFrame we

You might also like