0% found this document useful (0 votes)

114 views9 pages

Introduction To Pandas

Pandas is a Python library for data manipulation and analysis, introducing two main data structures: Series and DataFrame. It allows efficient handling of large datasets, data cleaning, and preprocessing, and can be installed via pip. The document also covers how to create and access Series, along with attributes and CRUD operations on Series objects.

Uploaded by

kartikyadav102003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

114 views9 pages

Introduction To Pandas

Uploaded by

kartikyadav102003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Introduction to Pandas

Pandas is a name from “Panel Data” and is a Python library used for data
manipulation and analysis. Pandas provides a convenient way to analyze
and clean data.
The Pandas library introduces two new data structures to Python
- Series and DataFrame, both of which are built on top of NumPy.

Why Use Pandas?

1. Handle Large Data Efficiently
2. Tabular Data Representation
3. Data Cleaning and Preprocessing
4. Free and Open-Source

Install Pandas
To install pandas, you need Python and PIP installed in your system. If you
have Python and PIP installed already, you can install pandas by entering
the following command in the terminal:

pip install pandas

Import Pandas in Python

We can import Pandas in Python using the import statement.

import pandas as pd

The code above imports the pandas library into our program with the alias pd .

After this import statement, we can use Pandas functions and objects by
calling them with pd .
For example, you can use Pandas dataframe in your program
using pd.DataFrame() .

Pandas Series
A Pandas Series is a one-dimensional labeled array-like object that can
hold data of any type.

A Pandas Series can be thought of as a column in a spreadsheet or a

single column of a DataFrame. It consists of two main components: the
labels and the data.

For example,

0 'John'
1 30
2 6.2
3 False
dtype: object

Here, the series has two columns, labels (0, 1, 2 and 3) and data
( 'John' , 30 , 6.2 , False ).

Create a Pandas Series

There are multiple ways to create a Pandas Series, but the most common
way is by using a Python list. Let's see an example of creating a Series
using a list:
import pandas as pd

# create a list
data = [10, 20, 30, 40, 50]

# create a series from the list

my_series = pd.Series(data)
print(my_series)
Run Code

Output

0 10
1 20
2 30
3 40
4 50
dtype: int64

Labels
The labels in the Pandas Series are index numbers by default. Like in dataframe
and array, the index number in series starts from 0.

Such labels can be used to access a specified value. For example,

import pandas as pd

# create a list
data = [10, 20, 30, 40, 50]

# create a series from the list

my_series = pd.Series(data)

# display third value in the series

print(my_series[2])

Output

We can also specify labels while creating the series using

the index argument in the Series() method. For example,
import pandas as pd

# create a list
a = [1, 3, 5]

# create a series and specify labels

my_series = pd.Series(a, index = ["x", "y", "z"])

print(my_series)
Run Code

Output

x 1
y 3
z 5
dtype: int64

Create Series From a Python Dictionary

You can also create a Pandas Series from a Python dictionary. For
example,
import pandas as pd

# create a dictionary
grades = {"Semester1": 3.25, "Semester2": 3.28, "Semester3": 3.75}

# create a series from the dictionary

my_series = pd.Series(grades)
my_specific_series = pd.Series(grades, index = ["Semester1", "Semester2"])
# display the series
print(my_series)
print(“SPECIFIC SERIES -”)
print(my_specific_series)

Output

Semester1 3.25
Semester2 3.28
Semester3 3.75
dtype: float64
SPECIFIC SERIES -
Semester1 3.25
Semester2 3.28
dtype: float64
Accessing element of Series

There are two ways through which we can access element of series, they are :
1. Accessing Element from Series with Position : In order to access the series element
refers to the index number. Use the index operator [ ] to access an element in a series.
The index must be an integer. In order to access multiple elements from a series, we use
Slice operation.

import pandas as pd
import numpy as np
# creating simple array
data = np.array(['g','e','e','k','s','f','o','r','g','e','e','k','s'])
ser = pd.Series(data)
#retrieve the first element
print(ser[:5])

OUTPUT –

2. Accessing Element Using Label (index) :

In order to access an element from series, we have to set values by index label. A Series is
like a fixed-size dictionary in that you can get and set values by index label.
Accessing a single element using index label

import pandas as pd
import numpy as np
# creating simple array
data =
np.array(['g','e','e','k','s','f','o','r','g','e','e','k','s'])
ser =
pd.Series(data,index=[10,11,12,13,14,15,16,17,18,19,20,21,22])
# accessing a element using index element
print(ser[16])
Output :
o

Series object attributes

The Series attribute is defined as any information related to the Series object such as
size, datatype. etc. Below are some of the attributes that you can use to get the
information about the Series object:

Attributes Description

Series.index Defines the index of the Series.

Series.shape It returns a tuple of shape of the data.

Series.dtype It returns the data type of the data.

Series.size It returns the size of the data.

Series.empty It returns True if Series object is empty, otherwise returns false.

Series.hasnans It returns True if there are any NaN values, otherwise returns false.

Series.nbytes It returns the number of bytes in the data.

Series.ndim It returns the number of dimensions in the data.

Series.itemsize It returns the size of the datatype of item.

Example - Retrieving Index array , data array ,Types (dtype) and

Size of Type (itemsize) of a series object

1. import numpy as np
2. import pandas as pd
3. x=pd.Series(data=[2,4,6,8])
4. y=pd.Series(data=[11.2,18.6,22.5], index=['a','b','c'])
5. print(x.index)
6. print(x.values)
7. print(y.index)
8. print(y.values)
9. print(x.dtype)
10. print(x.itemsize)
11. print(y.dtype)
12. print(y.itemsize)

Output

RangeIndex(start=0, stop=4, step=1)

[2 4 6 8]
Index(['a', 'b', 'c'], dtype='object')
[11.2 18.6 22.5]
int64
8
float64
8

Retrieving Dimension, Size and Number of bytes:

1. import numpy as np
2. import pandas as pd
3. a=pd.Series(data=[1,2,3,4])
4. b=pd.Series(data=[4.9,8.2,5.6],
5. index=['x','y','z'])
6. print(a.ndim, b.ndim)
7. print(a.size, b.size)
8. print(a.nbytes, b.nbytes)

Output

1 1
4 3
32 24
Checking Emptiness and Presence of NaNs
To check the Series object is empty, you can use the empty attribute. Similarly, to
check if a series object contains some NaN values or not, you can use
the hasans attribute.

Example

1. import numpy as np
2. import pandas as pd
3. a=pd.Series(data=[1,2,3,np.NaN])
4. b=pd.Series(data=[4.9,8.2,5.6],index=['x','y','z'])
5. c=pd.Series()
6. print(a.empty,b.empty,c.empty)
7. print(a.hasnans,b.hasnans,c.hasnans)
8. print(len(a),len(b))
9. print(a.count( ),b.count( ))

Output

False False True

True False False
4 3
3 3

CRUD OPERATION ON PANDA SERIES –

import pandas as pd

Phymarks = pd.Series([70,80,90,100])

print printing the original Series”)

print(Phymarks)

print(“Accessing the first element”)

print(Phymarks[0]) #Read operation

print(“Modifying the first element as 57”)

phymarks[0]=57 #Update operation

print(Phymarks)

print(“Deleting an element”)

del Phymarks[3] # delete operation

print(Phymarks)

print(“print marks greater than 80”)

print(Phymarks[Phymarks>80])

Output –
Printing the original Series
0 70
1 80
2 90
3 100
dtype : int64
Accessing the first element
70
Modifying the first element as 57
0 57
1 80
2 90
3 100
dtype:int64
Deleting an element
0 57
1 80
2 90
dtype:int64
Print marks greater than 80
2 90
dtype: int64

SR Ip Pandas I Full Notes
No ratings yet
SR Ip Pandas I Full Notes
30 pages
Pandas Notes
No ratings yet
Pandas Notes
19 pages
Pandas
No ratings yet
Pandas
57 pages
Unit I: Data Handling Using Pandas and Data Visualization: Marks:25
100% (1)
Unit I: Data Handling Using Pandas and Data Visualization: Marks:25
135 pages
XII IP CH 1 Python Pandas - I Series
No ratings yet
XII IP CH 1 Python Pandas - I Series
45 pages
Ip Notes
No ratings yet
Ip Notes
20 pages
1 IP 12 NOTES PythonPandas 2022 PDF
100% (3)
1 IP 12 NOTES PythonPandas 2022 PDF
66 pages
Introduction to pandas Data Structures
No ratings yet
Introduction to pandas Data Structures
21 pages
XII - Ip - Panda - I - Part - I - 2023 (1) 1 1
No ratings yet
XII - Ip - Panda - I - Part - I - 2023 (1) 1 1
25 pages
Unit I: Data Handling Using Pandas and Data Visualization: Marks:30
No ratings yet
Unit I: Data Handling Using Pandas and Data Visualization: Marks:30
75 pages
Chapter 2 Data Handling Using Pandas - I (Series)
0% (1)
Chapter 2 Data Handling Using Pandas - I (Series)
13 pages
Introduction to Python Pandas Library
No ratings yet
Introduction to Python Pandas Library
33 pages
Introduction To Pandas & Data Structures
No ratings yet
Introduction To Pandas & Data Structures
11 pages
Class 12 IP Ch-1, 2 3
No ratings yet
Class 12 IP Ch-1, 2 3
28 pages
Exp 25 - 26
No ratings yet
Exp 25 - 26
17 pages
Python Pandas Series
No ratings yet
Python Pandas Series
7 pages
Unit II Notes Revision
No ratings yet
Unit II Notes Revision
20 pages
Pandas
No ratings yet
Pandas
12 pages
Python Pandas
100% (1)
Python Pandas
35 pages
Data Handling With Pandas - 1 Notes Xii Ip
No ratings yet
Data Handling With Pandas - 1 Notes Xii Ip
28 pages
Python UnitIV
No ratings yet
Python UnitIV
20 pages
Ncert Pandas
No ratings yet
Ncert Pandas
36 pages
Understanding Pandas Series in Python
No ratings yet
Understanding Pandas Series in Python
20 pages
Data Handling Using Pandas-1
No ratings yet
Data Handling Using Pandas-1
23 pages
ML Lab8
No ratings yet
ML Lab8
28 pages
Series Attributes and Operations PDF
No ratings yet
Series Attributes and Operations PDF
29 pages
Class12 Pandas Notes
No ratings yet
Class12 Pandas Notes
23 pages
CSE488 Lab5 Pandas
No ratings yet
CSE488 Lab5 Pandas
27 pages
Data Handling with Pandas in Python
No ratings yet
Data Handling with Pandas in Python
27 pages
Exp8 SBLC
No ratings yet
Exp8 SBLC
9 pages
Httpsncert Nic Intextbookpdfleip102 PDF
No ratings yet
Httpsncert Nic Intextbookpdfleip102 PDF
36 pages
Creating and Using Pandas Series
No ratings yet
Creating and Using Pandas Series
53 pages
LAST MINUTES REVISION Pandas Series
No ratings yet
LAST MINUTES REVISION Pandas Series
6 pages
Chapter 1 and 2 Series and Data Frame
No ratings yet
Chapter 1 and 2 Series and Data Frame
45 pages
Unit-1 Python Pandas
No ratings yet
Unit-1 Python Pandas
56 pages
Final Formatted After Iloc Loc
No ratings yet
Final Formatted After Iloc Loc
34 pages
Panda Ncert 1
No ratings yet
Panda Ncert 1
36 pages
Data Manipulation With Pandas
100% (1)
Data Manipulation With Pandas
38 pages
Python Pandas for Data Analysts
No ratings yet
Python Pandas for Data Analysts
12 pages
Pandas
100% (1)
Pandas
163 pages
4 Pandas Series
No ratings yet
4 Pandas Series
4 pages
Introduction to Python Libraries
No ratings yet
Introduction to Python Libraries
36 pages
Data Handling Python NCERT
No ratings yet
Data Handling Python NCERT
36 pages
CH 2
No ratings yet
CH 2
36 pages
Leip 102
No ratings yet
Leip 102
36 pages
Series
No ratings yet
Series
85 pages
4b Understanding Series in Pandas - PPTX - Lyst2672
No ratings yet
4b Understanding Series in Pandas - PPTX - Lyst2672
10 pages
Subject IP
No ratings yet
Subject IP
9 pages
Python Pandas Series
No ratings yet
Python Pandas Series
45 pages
Python Pandas
No ratings yet
Python Pandas
22 pages
Data Handling with Pandas in Python
No ratings yet
Data Handling with Pandas in Python
14 pages
MySQL DataFrame and Series in Pandas
No ratings yet
MySQL DataFrame and Series in Pandas
181 pages
CH 1 Python Pandas-I
No ratings yet
CH 1 Python Pandas-I
13 pages
Python Pandas (II)
No ratings yet
Python Pandas (II)
18 pages
Unit III Part 2 1725700061785
No ratings yet
Unit III Part 2 1725700061785
85 pages
UNIT 3 (Chapter 2) Pandas
No ratings yet
UNIT 3 (Chapter 2) Pandas
43 pages
IP TERM-1 Study Material (Session 2021-22)
No ratings yet
IP TERM-1 Study Material (Session 2021-22)
84 pages
12ip 22 23
No ratings yet
12ip 22 23
188 pages
Account Project
No ratings yet
Account Project
26 pages
External Practical Schedule (Physics Group) 2022-23
No ratings yet
External Practical Schedule (Physics Group) 2022-23
1 page
Synopsis 1
No ratings yet
Synopsis 1
14 pages
48-1 Hashimoto, Sakurai City, Nara: Industry
No ratings yet
48-1 Hashimoto, Sakurai City, Nara: Industry
1 page
FromRelationalModelSQLtoMongoDBsDocumentModel Badge20250611 27 Yqa4zh
No ratings yet
FromRelationalModelSQLtoMongoDBsDocumentModel Badge20250611 27 Yqa4zh
1 page
Inguinal Hernia Case Study Analysis
No ratings yet
Inguinal Hernia Case Study Analysis
2 pages
Hydraulic Backhoe Machine
No ratings yet
Hydraulic Backhoe Machine
57 pages
Best Lessons Are Learned Through Bitter Experiences Watermark
No ratings yet
Best Lessons Are Learned Through Bitter Experiences Watermark
6 pages
Tilt-Up Wall Panel Design Analysis
No ratings yet
Tilt-Up Wall Panel Design Analysis
42 pages
Amendment Application With Affidavit A4
No ratings yet
Amendment Application With Affidavit A4
5 pages
Fog
No ratings yet
Fog
2 pages
Week 5
No ratings yet
Week 5
8 pages
Access NLU Assam Library via OpenVPN
No ratings yet
Access NLU Assam Library via OpenVPN
27 pages
How Do I Start Growing Mushrooms - PDF Version 1
No ratings yet
How Do I Start Growing Mushrooms - PDF Version 1
3 pages
Defense
No ratings yet
Defense
15 pages
People v. Bayotas
100% (2)
People v. Bayotas
1 page
Globalization of Legal Proessiona and Education - Pragyaan Journal of Law
No ratings yet
Globalization of Legal Proessiona and Education - Pragyaan Journal of Law
9 pages
Grammar II E-Portfolio Guide
No ratings yet
Grammar II E-Portfolio Guide
10 pages
D&D Berserker Barbarian Profile
No ratings yet
D&D Berserker Barbarian Profile
1 page
Hume - 13 Principal Up Ani Shads
No ratings yet
Hume - 13 Principal Up Ani Shads
555 pages
Cocoa Xo Menu
No ratings yet
Cocoa Xo Menu
9 pages
DISTURBANCES IN ABSORPTION AND ELIMINATION Notes
100% (1)
DISTURBANCES IN ABSORPTION AND ELIMINATION Notes
7 pages
Nostalgia Rodrigo Riera
No ratings yet
Nostalgia Rodrigo Riera
1 page
Biopsychosocial Model
No ratings yet
Biopsychosocial Model
21 pages
What Is Capacity Building
No ratings yet
What Is Capacity Building
17 pages
Conversational English Practice Questions
No ratings yet
Conversational English Practice Questions
4 pages
Midterm Exam - Foundation Engineering - 2023-0
No ratings yet
Midterm Exam - Foundation Engineering - 2023-0
5 pages
Forensic Blood Analysis Guide
No ratings yet
Forensic Blood Analysis Guide
27 pages
The True History of The Conquest of New Spain Bernal Diaz Del Castillo
100% (1)
The True History of The Conquest of New Spain Bernal Diaz Del Castillo
1,190 pages
Sacred Marriage Ritual
No ratings yet
Sacred Marriage Ritual
28 pages
Week 2 Reading 1 Revisiting The Fundamental Concepts of IFRS PDF
No ratings yet
Week 2 Reading 1 Revisiting The Fundamental Concepts of IFRS PDF
10 pages
Service Bulletin - NOTES
No ratings yet
Service Bulletin - NOTES
5 pages
Milk Fat Analysis for Dairy Labs
100% (1)
Milk Fat Analysis for Dairy Labs
6 pages
New Holland E385 Workshop Manual
100% (2)
New Holland E385 Workshop Manual
41 pages
Legal Dispute: Bitanga vs. Pyramid
No ratings yet
Legal Dispute: Bitanga vs. Pyramid
8 pages

Introduction To Pandas

Uploaded by

Introduction To Pandas

Uploaded by

Introduction to Pandas

Why Use Pandas?

pip install pandas

Import Pandas in Python

A Pandas Series can be thought of as a column in a spreadsheet or a

Create a Pandas Series

# create a series from the list

Such labels can be used to access a specified value. For example,

# create a series from the list

# display third value in the series

We can also specify labels while creating the series using

# create a series and specify labels

Create Series From a Python Dictionary

# create a series from the dictionary

2. Accessing Element Using Label (index) :

Series object attributes

Series.index Defines the index of the Series.

Series.shape It returns a tuple of shape of the data.

Series.dtype It returns the data type of the data.

Series.size It returns the size of the data.

Series.empty It returns True if Series object is empty, otherwise returns false.

Series.nbytes It returns the number of bytes in the data.

Series.ndim It returns the number of dimensions in the data.

Series.itemsize It returns the size of the datatype of item.

Example - Retrieving Index array , data array ,Types (dtype) and

RangeIndex(start=0, stop=4, step=1)

Retrieving Dimension, Size and Number of bytes:

False False True

CRUD OPERATION ON PANDA SERIES –

print printing the original Series”)

print(“Accessing the first element”)

print(Phymarks[0]) #Read operation

phymarks[0]=57 #Update operation

del Phymarks[3] # delete operation

print(“print marks greater than 80”)

You might also like