0% found this document useful (0 votes)

21 views6 pages

Data Analytic Assignment

Uploaded by

Ayush Shishirrr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views6 pages

Data Analytic Assignment

Uploaded by

Ayush Shishirrr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

shishir-data-analytic-assignment

November 14, 2023

link text Assignment-2 for Batch A ____________________________________________

Submited By- SHISHIR RANJAN _____________________________________________
Roll no. - 2312res600 _____________________________________________________
Email - [email protected] / [email protected]
_________________________________________________________________
colab Link - https://colab.research.google.com/drive/1KsKS7z0bBBVhd6S9LUJKftj7n3MWEBS4?usp=sharing

[18]: from google.colab import drive

drive.mount('/content/drive')

Drive already mounted at /content/drive; to attempt to forcibly remount, call

drive.mount("/content/drive", force_remount=True).

[19]: # Importing the numpy library

import numpy as np

# 1st--> Create an array of five elements and print its value.

arr = np.array([1, 2, 3, 4, 5])
print("Array is ", arr)
print("____________________________________")
#submitted by shishir ranjan

Array is [1 2 3 4 5]
____________________________________
QUESTION (1) COMPLETED ________________________________________________
QUESTION (2) .

[20]: # 2nd--> Use the above array elements to print the 1st and 2nd elements.
print("1st Element:", arr[0])
print("2nd Element:", arr[1])
print("____________________________________")
#submitted by shishir ranjan

1st Element: 1
2nd Element: 2
____________________________________

1
QUESTION (2) COMPLETED ________________________________________________
QUESTION (3).

[21]: # 3rd--> Get the third and fourth elements from the array and add them.
third_element = arr[2]
fourth_element = arr[3]
sum_third_fourth = third_element + fourth_element
print("Sum of 3rd and 4th elements:", sum_third_fourth)
print("____________________________________")
#submitted by shishir ranjan

Sum of 3rd and 4th elements: 7

____________________________________
QUESTION (3) COMPLETED ________________________________________________
QUESTION (4).

[22]: # 4th--> Print the elements from 1 to 5 from the above array.
print("Elements from 1 to 5:", arr[0:5])
print("____________________________________")
#submitted by shishir ranjan

Elements from 1 to 5: [1 2 3 4 5]
____________________________________
QUESTION (5) COMPLETED ________________________________________________
QUESTION (5).

[23]: # 5th--> Get the data type of the array.

data_type = arr.dtype
print("Data Type of the array:", data_type)
print("____________________________________")

"""--------------------- First question finish ------------------"""

#submitted by shishir ranjan

Data Type of the array: int64

____________________________________

[23]: '--------------------- First question finish ------------------'

DATA ANALYTIC ASSIGNMENT 1ST QUESTION SUBMITTED BY SHISHIR RANJAN .

_____________________________________________________

1.
[24]: # Importing the pandas library
import pandas as pd

2
# Loading the dataset
df1 = pd.read_csv("/content/drive/MyDrive/AirQualityUCI.csv")

# 1--> Drop the null values from the dataset. Use (dropna()) method.
df1 = df1.dropna()
print("(1) Dataset after dropping null values:\n", df1)
print("____________________________________")
#submitted by shishir ranjan

(1) Dataset after dropping null values:

Date;Time;CO(GT);PT08.S1(CO);NMHC(GT);C6H6(GT);PT08.S2(NMHC);NOx(GT);PT08.S3(NOx
);NO2(GT);PT08.S4(NO2);PT08.S5(O3);T;RH;AH;;
10/03/2004;18.00.00;2 6;1360;150;11 9;1046;166;1056;113;1692;1268;13 6;48 9;0
7578;;
10/03/2004;20.00.00;2 2;1402;88;9 0;939;131;1140;114;1555;1074;11 9;54 0;0
7502;;
10/03/2004;21.00.00;2 2;1376;80;9 2;948;172;1092;122;1584;1203;11 0;60 0;0
7867;;
10/03/2004;22.00.00;1 6;1272;51;6 5;836;131;1205;116;1490;1110;11 2;59 6;0
7888;;
10/03/2004;23.00.00;1 2;1197;38;4 7;750;89;1337;96;1393;949;11 2;59 2;0
7848;;
…
…
04/04/2005;10.00.00;3 1;1314;-200;13 5;1101;472;539;190;1374;1729;21 9;29 3;0
7568;;
04/04/2005;11.00.00;2 4;1163;-200;11 4;1027;353;604;179;1264;1269;24 3;23 7;0
7119;;
04/04/2005;12.00.00;2 4;1142;-200;12 4;1063;293;603;175;1241;1092;26 9;18 3;0
6406;;
04/04/2005;13.00.00;2 1;1003;-200;9 5;961;235;702;156;1041;770;28 3;13 5;0
5139;;
04/04/2005;14.00.00;2 2;1071;-200;11 9;1047;265;654;168;1129;816;28 5;13 1;0
5028;;

[6915 rows x 1 columns]

____________________________________
2.
[25]: # 2--> Replace NULL values with the number 130.
df1 = df1.fillna(130)
print("\n(2) Dataset after replacing NULL values with 130:\n", df1)
print("____________________________________")
#submitted by shishir ranjan

3
(2) Dataset after replacing NULL values with 130:

[6915 rows x 1 columns]

____________________________________
3.
[26]: # 3--> Filter the value of SO2 > 500. Use data frame (df. loc) methods.
# filtered_df1 = df1.loc[df1['SO2'] > 500]
print("\n(3) Filtered values where SO2 > 500:\n", "So2 column does not exist in␣
↪the given dataset")

print("____________________________________")
#submitted by shishir ranjan

(3) Filtered values where SO2 > 500:

So2 column does not exist in the given dataset
____________________________________
4.
[27]: # 4--> Use drop_duplicates() method to drop duplicate values from the dataset.
df1 = df1.drop_duplicates()

4
print("\n(4) Dataset after dropping duplicate values:\n", df1)
print("____________________________________")
#submitted by shishir ranjan

(4) Dataset after dropping duplicate values:

Date;Time;CO(GT);PT08.S1(CO);NMHC(GT);C6H6(GT);PT08.S2(NMHC);NOx(GT);PT08.S3(NOx
);NO2(GT);PT08.S4(NO2);PT08.S5(O3);T;RH;AH;;
10/03/2004;18.00.00;2 6;1360;150;11 9;1046;166;1056;113;1692;1268;13 6;48 9;0
7578;;
10/03/2004;20.00.00;2 2;1402;88;9 0;939;131;1140;114;1555;1074;11 9;54 0;0
7502;;
10/03/2004;21.00.00;2 2;1376;80;9 2;948;172;1092;122;1584;1203;11 0;60 0;0
7867;;
10/03/2004;22.00.00;1 6;1272;51;6 5;836;131;1205;116;1490;1110;11 2;59 6;0
7888;;
10/03/2004;23.00.00;1 2;1197;38;4 7;750;89;1337;96;1393;949;11 2;59 2;0
7848;;
…
…
04/04/2005;02.00.00;0 5;912;-200;1 5;544;69;959;55;1002;573;12 1;56 3;0
7927;;
04/04/2005;05.00.00;0 5;888;-200;1 3;528;77;1077;53;987;578;10 4;59 9;0
7550;;
04/04/2005;06.00.00;1 1;1031;-200;4 4;730;182;760;93;1129;905;9 5;63 1;0
7531;;
04/04/2005;11.00.00;2 4;1163;-200;11 4;1027;353;604;179;1264;1269;24 3;23 7;0
7119;;
04/04/2005;14.00.00;2 2;1071;-200;11 9;1047;265;654;168;1129;816;28 5;13 1;0
5028;;

[4941 rows x 1 columns]

____________________________________
5.
[28]: # 5--> Use the correlation method to show the relationship between columns (df.
↪corr).

correlation_matrix = df1.corr()
print("\n(5) Correlation matrix:\n", correlation_matrix)
print("____________________________________")
"""--------------------- Second question finish ------------------"""
#submitted by shishir ranjan

(5) Correlation matrix:

Empty DataFrame

5
Columns: []
Index: []
____________________________________
<ipython-input-28-4b6fa2a277c1>:2: FutureWarning: The default value of
numeric_only in DataFrame.corr is deprecated. In a future version, it will
default to False. Select only valid columns or specify the value of numeric_only
to silence this warning.
correlation_matrix = df1.corr()

[28]: '--------------------- Second question finish ------------------'

DATA ANALYTIC ASSIGNMENT 2ND QUESTION COMPLETED .

__________________________________________________________________
DATA ANALYTIC ASSIGMENT SUBMITTED BY SHSHIR RANJAN .
__________________________________________________________________

Python Array and Pandas Data Tasks
No ratings yet
Python Array and Pandas Data Tasks
6 pages
Exp 03 Record
No ratings yet
Exp 03 Record
10 pages
Data Cleaning with Pandas & NumPy
No ratings yet
Data Cleaning with Pandas & NumPy
20 pages
Part A Assignment 6
No ratings yet
Part A Assignment 6
28 pages
Acknowledgement
No ratings yet
Acknowledgement
25 pages
Python Libraries for Data Analysis
No ratings yet
Python Libraries for Data Analysis
4 pages
DMV - 4 - Jupyter Notebook
No ratings yet
DMV - 4 - Jupyter Notebook
8 pages
Cs Sem V Dav Upc 32347507 Sl. No. Qp. 4432 Dec '23
No ratings yet
Cs Sem V Dav Upc 32347507 Sl. No. Qp. 4432 Dec '23
16 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
UNIT-4 Important Q-A
No ratings yet
UNIT-4 Important Q-A
28 pages
Pandas Library
No ratings yet
Pandas Library
6 pages
What Is A Series and How Is It Different From A 1-D Array, A List, and A Dictionary
No ratings yet
What Is A Series and How Is It Different From A 1-D Array, A List, and A Dictionary
3 pages
Applied Tech Lesson 45: 1 Lesson 45: Pie Chart & Bell Curve
No ratings yet
Applied Tech Lesson 45: 1 Lesson 45: Pie Chart & Bell Curve
25 pages
Chapter 1
No ratings yet
Chapter 1
7 pages
Set-D CT2 Answerkey
No ratings yet
Set-D CT2 Answerkey
11 pages
Pandas Module (Part-I)
No ratings yet
Pandas Module (Part-I)
36 pages
PW2 DataCleaning
No ratings yet
PW2 DataCleaning
6 pages
Tutorial 4
No ratings yet
Tutorial 4
8 pages
22cs701-Spm Unit 4
No ratings yet
22cs701-Spm Unit 4
2 pages
3rd Week Report
No ratings yet
3rd Week Report
7 pages
Data Science Practicals - Ipynb
No ratings yet
Data Science Practicals - Ipynb
54 pages
Series 1
No ratings yet
Series 1
408 pages
Dav Pyq 2023
No ratings yet
Dav Pyq 2023
15 pages
DHP Unit - 4 Part2
No ratings yet
DHP Unit - 4 Part2
16 pages
AD3301 DEV Lab Manual
No ratings yet
AD3301 DEV Lab Manual
26 pages
Python Data Handling with Pandas
No ratings yet
Python Data Handling with Pandas
12 pages
Essential Steps in Data Cleaning
No ratings yet
Essential Steps in Data Cleaning
17 pages
Data Cleaning
No ratings yet
Data Cleaning
13 pages
Ip Project
No ratings yet
Ip Project
21 pages
Question Bank 4
No ratings yet
Question Bank 4
4 pages
Exp3 Python
No ratings yet
Exp3 Python
15 pages
Numpy Boolean Indexing: Filter
No ratings yet
Numpy Boolean Indexing: Filter
39 pages
Data Analysis and Visualization Exam Paper
No ratings yet
Data Analysis and Visualization Exam Paper
12 pages
Dev Lab Manual Org
No ratings yet
Dev Lab Manual Org
28 pages
Python Data Science Cheat Sheet
97% (33)
Python Data Science Cheat Sheet
11 pages
Unit 5 Python
No ratings yet
Unit 5 Python
30 pages
Dev Lab Record
No ratings yet
Dev Lab Record
21 pages
AI & Data Science Lab Record
No ratings yet
AI & Data Science Lab Record
28 pages
PYQ Data Analysis and Visualisation Using Python GE May 2024
No ratings yet
PYQ Data Analysis and Visualisation Using Python GE May 2024
6 pages
Experiment No: 1 Title:: Creating Vectors and Data Frames and Implementing Data Summary Functions
No ratings yet
Experiment No: 1 Title:: Creating Vectors and Data Frames and Implementing Data Summary Functions
8 pages
DV Lab Manual Modified
No ratings yet
DV Lab Manual Modified
31 pages
Day 10 Pandasdatacleaning
No ratings yet
Day 10 Pandasdatacleaning
6 pages
CH 3 2
No ratings yet
CH 3 2
17 pages
Pandas: Import
100% (1)
Pandas: Import
13 pages
REYES WS 5 Cleaning Data in Python Nos. 1 6 PDF
100% (1)
REYES WS 5 Cleaning Data in Python Nos. 1 6 PDF
4 pages
Pandas Data Handling Lab Session
No ratings yet
Pandas Data Handling Lab Session
23 pages
DATAFRAME
No ratings yet
DATAFRAME
11 pages
Unit2 Part2 Da
No ratings yet
Unit2 Part2 Da
45 pages
Python MCQs
No ratings yet
Python MCQs
21 pages
Python Data Structures and Libraries Guide
No ratings yet
Python Data Structures and Libraries Guide
7 pages
Ge - Computer Science Data Analysis
No ratings yet
Ge - Computer Science Data Analysis
16 pages
Pandas Cheat Sheet for Data Manipulation
No ratings yet
Pandas Cheat Sheet for Data Manipulation
1 page
Ip Study
No ratings yet
Ip Study
18 pages
Programs For Practical
No ratings yet
Programs For Practical
3 pages
12 Ip Pa2 2024-25
No ratings yet
12 Ip Pa2 2024-25
7 pages
Ip (Hy-Qp)
No ratings yet
Ip (Hy-Qp)
8 pages
AccuTerm 2K2 Programmers Guide
No ratings yet
AccuTerm 2K2 Programmers Guide
320 pages
Semaphore Basics for Developers
No ratings yet
Semaphore Basics for Developers
10 pages
Database Design
No ratings yet
Database Design
31 pages
Unit 1 ADBMS
No ratings yet
Unit 1 ADBMS
36 pages
C Programming Language Syllabus
No ratings yet
C Programming Language Syllabus
5 pages
Yahoo Chat Spy
No ratings yet
Yahoo Chat Spy
9 pages
AZ-104T00A Azure Virtual Machines
100% (1)
AZ-104T00A Azure Virtual Machines
36 pages
Sap Net Weaver Business Intelligence Overview
No ratings yet
Sap Net Weaver Business Intelligence Overview
54 pages
Chapter 5 Structure in C++ Programming
No ratings yet
Chapter 5 Structure in C++ Programming
27 pages
HTTP Proxy Service - Zentyal 5
No ratings yet
HTTP Proxy Service - Zentyal 5
13 pages
Questions: 1: Answer: C, E
No ratings yet
Questions: 1: Answer: C, E
5 pages
10g Segment Space Management Guide
No ratings yet
10g Segment Space Management Guide
8 pages
Winbond Data Sheet
No ratings yet
Winbond Data Sheet
75 pages
List of Programs For Practical File - XII: Visit To Website
No ratings yet
List of Programs For Practical File - XII: Visit To Website
2 pages
Zabbix Gammu SMSD Installation Guide Z-Afshar - Zabbix-Gammu-Smsd Wiki GitHub PDF
No ratings yet
Zabbix Gammu SMSD Installation Guide Z-Afshar - Zabbix-Gammu-Smsd Wiki GitHub PDF
1 page
WebLogic Server Configuration Backup
No ratings yet
WebLogic Server Configuration Backup
3 pages
LEcture 14 Assembly Process and Modular Programming
No ratings yet
LEcture 14 Assembly Process and Modular Programming
23 pages
Megabank Access and Security Training
No ratings yet
Megabank Access and Security Training
52 pages
Data Engineer Resume for Gilang Evandyano
No ratings yet
Data Engineer Resume for Gilang Evandyano
5 pages
IIT 1312 Database Management Assignment
No ratings yet
IIT 1312 Database Management Assignment
4 pages
8 PLC Basics
0% (1)
8 PLC Basics
140 pages
Dot Net Framework PDF
100% (2)
Dot Net Framework PDF
241 pages
How To Compress Zip Files in Java
No ratings yet
How To Compress Zip Files in Java
16 pages
NetApp E2600 - 1051FG000044
No ratings yet
NetApp E2600 - 1051FG000044
1 page
Free Fire Max Cheat Guide
No ratings yet
Free Fire Max Cheat Guide
2 pages
APC Switched PDU - User Manual
100% (1)
APC Switched PDU - User Manual
125 pages
SQL Server Interview Questions 50
No ratings yet
SQL Server Interview Questions 50
2 pages
C Programming Practical Exercises
No ratings yet
C Programming Practical Exercises
8 pages
Introduction To AWS DynamoDB
No ratings yet
Introduction To AWS DynamoDB
8 pages
B-Symantec System Recovery 21178625 DS - En-Us
No ratings yet
B-Symantec System Recovery 21178625 DS - En-Us
4 pages

Data Analytic Assignment

Uploaded by

Data Analytic Assignment

Uploaded by

shishir-data-analytic-assignment

November 14, 2023

link text Assignment-2 for Batch A ____________________________________________

[18]: from google.colab import drive

Drive already mounted at /content/drive; to attempt to forcibly remount, call

[19]: # Importing the numpy library

# 1st--> Create an array of five elements and print its value.

Sum of 3rd and 4th elements: 7

[23]: # 5th--> Get the data type of the array.

"""--------------------- First question finish ------------------"""

Data Type of the array: int64

[23]: '--------------------- First question finish ------------------'

DATA ANALYTIC ASSIGNMENT 1ST QUESTION SUBMITTED BY SHISHIR RANJAN .

(1) Dataset after dropping null values:

[6915 rows x 1 columns]

[6915 rows x 1 columns]

(3) Filtered values where SO2 > 500:

(4) Dataset after dropping duplicate values:

[4941 rows x 1 columns]

(5) Correlation matrix:

[28]: '--------------------- Second question finish ------------------'

DATA ANALYTIC ASSIGNMENT 2ND QUESTION COMPLETED .

You might also like