0% found this document useful (0 votes)

379 views8 pages

Foundation of Data Science Lab Manual Full

The document is a lab manual for foundational data science experiments covering Python programming, data structures, Numpy, Pandas, data visualization, and exploratory data analysis. Each experiment includes aims, algorithms, code examples, outputs, and results demonstrating successful execution of various data science techniques. The manual serves as a practical guide for implementing essential data science skills.

Uploaded by

prithivipt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

379 views8 pages

Foundation of Data Science Lab Manual Full

Uploaded by

prithivipt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Lab Manual: Foundation of Data

Science
Authors: J. Daphney Joann, M. Balasubramaniam

Experiment 1: Introduction to Python for Data Science

Aim:
To understand and implement basic Python programming constructs used in data science.

Algorithm:
1. Start Python IDE or Jupyter Notebook.
2. Create a Python program with basic syntax: input, output, loops.
3. Define variables and perform operations.
4. Run the program and observe the output.

Code:
# Basic Python program for input, loop and output
name = input("Enter your name: ")
print("Welcome", name)

print("Looping from 0 to 4:")

for i in range(5):
print("Iteration", i)

Output:
Enter your name: Daphney
Welcome Daphney
Looping from 0 to 4:
Iteration 0
Iteration 1
Iteration 2
Iteration 3
Iteration 4

Result:
The basic Python constructs such as input, loops, and print statements were successfully
executed.
Experiment 2: Data Structures in Python (List, Tuple, Dictionary)

Aim:
To learn and apply data structures like List, Tuple, and Dictionary in Python.

Algorithm:
5. Initialize a list, tuple, and dictionary with sample values.
6. Perform operations like accessing, slicing, and updating.
7. Print the results to understand behavior.

Code:
# List operations
fruits = ["apple", "banana", "cherry"]
print("Fruits list:", fruits)
fruits.append("orange")
print("Updated list:", fruits)

# Tuple operations
days = ("Mon", "Tue", "Wed")
print("Days tuple:", days)

# Dictionary operations
student = {"name": "John", "age": 21, "course": "Data Science"}
print("Student dictionary:", student)
print("Student Name:", student["name"])

Output:
Fruits list: ['apple', 'banana', 'cherry']
Updated list: ['apple', 'banana', 'cherry', 'orange']
Days tuple: ('Mon', 'Tue', 'Wed')
Student dictionary: {'name': 'John', 'age': 21, 'course': 'Data Science'}
Student Name: John

Result:
List, Tuple, and Dictionary were implemented successfully and their properties were
demonstrated.

Experiment 3: Numpy Basics: Arrays and Vectorized Computations

Aim:
To learn how to use Numpy for array operations and vectorized computations.
Algorithm:
8. Import numpy as np.
9. Create arrays using numpy.
10. Perform basic arithmetic and vectorized operations.
11. Print and interpret results.

Code:
import numpy as np
a = np.array([1, 2, 3])
b = np.array([4, 5, 6])
print("Array a:", a)
print("Array b:", b)
print("Sum:", a + b)
print("Product:", a * b)

Output:
Array a: [1 2 3]
Array b: [4 5 6]
Sum: [5 7 9]
Product: [ 4 10 18]

Result:
Numpy arrays and basic vectorized operations were demonstrated successfully.

Experiment 4: Data Manipulation using Pandas

Aim:
To perform data manipulation using pandas DataFrame.

Algorithm:
12. Import pandas as pd.
13. Create a DataFrame.
14. Perform operations like adding, updating, and deleting data.
15. Display the results.

Code:
import pandas as pd
data = {'Name': ['Alice', 'Bob'], 'Age': [24, 27]}
df = pd.DataFrame(data)
print("Original DataFrame:")
print(df)
df['Age'] = df['Age'] + 1
print("Updated DataFrame:")
print(df)

Output:
Original DataFrame:
Name Age
0 Alice 24
1 Bob 27
Updated DataFrame:
Name Age
0 Alice 25
1 Bob 28

Result:
Data was successfully manipulated using pandas DataFrame.

Experiment 5: Data Visualization using Matplotlib and Seaborn

Aim:
To visualize data using matplotlib and seaborn libraries.

Algorithm:
16. Import required libraries.
17. Prepare data for plotting.
18. Use matplotlib and seaborn to create graphs.
19. Display the plots.

Code:
import matplotlib.pyplot as plt
import seaborn as sns

data = [5, 10, 15, 20]

plt.plot(data)
plt.title("Line Plot")
plt.show()

Output:
Line plot is displayed showing the trend of values.

Result:
Data visualization using matplotlib was successfully implemented.
Experiment 6: Descriptive Statistics and Data Summary

Aim:
To compute summary statistics of a dataset.

Algorithm:
20. Import pandas.
21. Create a DataFrame with numerical values.
22. Use describe() to generate summary.
23. Print the result.

Code:
import pandas as pd
data = {'Score': [88, 92, 79, 93, 85]}
df = pd.DataFrame(data)
print(df.describe())

Output:
Score
count 5.000000
mean 87.400000
std 5.549775
min 79.000000
25% 85.000000
50% 88.000000
75% 92.000000
max 93.000000

Result:
Descriptive statistics were calculated successfully using pandas.

Experiment 7: Handling Missing Data and Data Cleaning

Aim:
To handle and clean missing data using pandas.

Algorithm:
24. Import pandas.
25. Create a DataFrame with missing values.
26. Use functions like fillna() and dropna().
27. Observe changes.
Code:
import pandas as pd
data = {'Name': ['Alice', 'Bob', None], 'Age': [25, None, 30]}
df = pd.DataFrame(data)
print("Original Data:")
print(df)
df_clean = df.fillna({'Name': 'Unknown', 'Age': df['Age'].mean()})
print("Cleaned Data:")
print(df_clean)

Output:
Original Data:
Name Age
0 Alice 25.0
1 Bob NaN
2 None 30.0
Cleaned Data:
Name Age
0 Alice 25.0
1 Bob 27.5
2 Unknown 30.0

Result:
Missing data was handled using fillna and replaced with default values.

Experiment 8: Grouping, Merging and Aggregation with Pandas

Aim:
To perform grouping and merging operations on data using pandas.

Algorithm:
28. Create two DataFrames.
29. Merge them using merge().
30. Group the merged data and aggregate.
31. Display results.

Code:
import pandas as pd
df1 = pd.DataFrame({'ID': [1, 2], 'Name': ['Alice', 'Bob']})
df2 = pd.DataFrame({'ID': [1, 2], 'Score': [85, 90]})
merged = pd.merge(df1, df2, on='ID')
print("Merged DataFrame:")
print(merged)
grouped = merged.groupby('Name').mean()
print("Grouped by Name:")
print(grouped)

Output:
Merged DataFrame:
ID Name Score
0 1 Alice 85
1 2 Bob 90
Grouped by Name:
ID Score
Name
Alice 1.0 85.0
Bob 2.0 90.0

Result:
Grouping and merging of data was demonstrated successfully.

Experiment 9: Introduction to Data Preprocessing Techniques

Aim:
To apply preprocessing techniques like normalization and encoding.

Algorithm:
32. Create sample data.
33. Apply normalization using sklearn.
34. Apply encoding if necessary.
35. Print the results.

Code:
from sklearn.preprocessing import MinMaxScaler
import pandas as pd

data = {'Marks': [50, 80, 100]}

df = pd.DataFrame(data)
scaler = MinMaxScaler()
df[['Marks']] = scaler.fit_transform(df[['Marks']])
print(df)

Output:
Marks
0 0.00
1 0.75
2 1.00

Result:
Data normalization was applied successfully using sklearn.

Experiment 10: Mini Project / Case Study on Exploratory Data Analysis

(EDA)

Aim:
To explore a dataset using EDA techniques and summarize insights.

Algorithm:
36. Load dataset using pandas.
37. Visualize data distributions.
38. Use describe(), info(), value_counts().
39. Document key insights.

Code:
import pandas as pd
df = pd.read_csv('sample.csv')
print(df.info())
print(df.describe())
print(df['Category'].value_counts())

Output:
Displays dataset info, summary statistics, and category distribution.

Result:
EDA was performed successfully and insights were derived.

FDS Record-1-4
No ratings yet
FDS Record-1-4
18 pages
ML Lab File Vijay Kumar
No ratings yet
ML Lab File Vijay Kumar
16 pages
Data Science
No ratings yet
Data Science
42 pages
Python Unit IV
No ratings yet
Python Unit IV
12 pages
FDS Lab
No ratings yet
FDS Lab
43 pages
Practicals 1 To 4
No ratings yet
Practicals 1 To 4
15 pages
Python Lab PRG
No ratings yet
Python Lab PRG
20 pages
Advanced Python Lab
No ratings yet
Advanced Python Lab
17 pages
NumPy and Pandas
No ratings yet
NumPy and Pandas
12 pages
EXP1-siddhant Gupta (23 - SE - 148)
No ratings yet
EXP1-siddhant Gupta (23 - SE - 148)
17 pages
Pandas & PyNumS Essentials
No ratings yet
Pandas & PyNumS Essentials
10 pages
L and T Projects - Colabs
No ratings yet
L and T Projects - Colabs
7 pages
ML Lab File Vijay Kumar
No ratings yet
ML Lab File Vijay Kumar
27 pages
FDS Final Manual
No ratings yet
FDS Final Manual
41 pages
Aanik Info Practical 3261
No ratings yet
Aanik Info Practical 3261
61 pages
MCP Lab-2023 ContentForPythonLibrariesTopic
No ratings yet
MCP Lab-2023 ContentForPythonLibrariesTopic
9 pages
Data Analysis Lab with Python
No ratings yet
Data Analysis Lab with Python
11 pages
MLC Practical
No ratings yet
MLC Practical
51 pages
Ilovepdf Merged (2) Merged
No ratings yet
Ilovepdf Merged (2) Merged
65 pages
Python Lab Assignment 7
No ratings yet
Python Lab Assignment 7
7 pages
Ids 1
No ratings yet
Ids 1
30 pages
Fdsa Lab Manual Final
No ratings yet
Fdsa Lab Manual Final
70 pages
Lab 3 & 4
No ratings yet
Lab 3 & 4
10 pages
AIML Lab Manual
No ratings yet
AIML Lab Manual
39 pages
DSC Lab Programs
No ratings yet
DSC Lab Programs
24 pages
Data Science Lab Manual: Python Guide
No ratings yet
Data Science Lab Manual: Python Guide
72 pages
Shubham Info Practical 3251
No ratings yet
Shubham Info Practical 3251
59 pages
KJD ML File
No ratings yet
KJD ML File
45 pages
Dictionary Operations in Python
No ratings yet
Dictionary Operations in Python
6 pages
Sowmi DS
No ratings yet
Sowmi DS
27 pages
Python Interviews
No ratings yet
Python Interviews
154 pages
Python Cheat Sheet 2.0
100% (2)
Python Cheat Sheet 2.0
10 pages
Index
No ratings yet
Index
4 pages
L - AND - T - Project - Naveen 24cs002895
No ratings yet
L - AND - T - Project - Naveen 24cs002895
7 pages
AI & Data Science Lab Record
No ratings yet
AI & Data Science Lab Record
28 pages
Fds PDF
No ratings yet
Fds PDF
58 pages
Numpy and Pandas Essential Functions
No ratings yet
Numpy and Pandas Essential Functions
46 pages
ML IU48prac1,2
No ratings yet
ML IU48prac1,2
16 pages
Ds Lab-1
No ratings yet
Ds Lab-1
40 pages
Utkarsh Kumar Info Practical
No ratings yet
Utkarsh Kumar Info Practical
53 pages
MLLabcode 1
No ratings yet
MLLabcode 1
3 pages
NumPy and Pandas Basics Guide
No ratings yet
NumPy and Pandas Basics Guide
8 pages
Data Analysis Practical
No ratings yet
Data Analysis Practical
13 pages
ML Lab Manual Final
No ratings yet
ML Lab Manual Final
36 pages
Develop Programs To Understand Concept of Class and Object in Python
No ratings yet
Develop Programs To Understand Concept of Class and Object in Python
49 pages
DSA LAB Manual - Good Content
No ratings yet
DSA LAB Manual - Good Content
70 pages
ML Manual
No ratings yet
ML Manual
21 pages
Python
No ratings yet
Python
22 pages
Staff Manula 01
No ratings yet
Staff Manula 01
7 pages
Python NumPy and Pandas Exercises
No ratings yet
Python NumPy and Pandas Exercises
24 pages
Practical (Data Science)
No ratings yet
Practical (Data Science)
13 pages
Final DAA
No ratings yet
Final DAA
31 pages
Jashan ML
No ratings yet
Jashan ML
20 pages
Data Analytics and Python Basics
No ratings yet
Data Analytics and Python Basics
8 pages
Machine Learning Lab Guide
No ratings yet
Machine Learning Lab Guide
36 pages
NumPy and Pandas Step
No ratings yet
NumPy and Pandas Step
9 pages
Data Analysis With Python Core Libraries
No ratings yet
Data Analysis With Python Core Libraries
5 pages
Experiment 2
No ratings yet
Experiment 2
17 pages
Sample Exam Review: Statistics Concepts
No ratings yet
Sample Exam Review: Statistics Concepts
23 pages
Final RAWE Manual 2024-25
No ratings yet
Final RAWE Manual 2024-25
77 pages
Ficci-Kpmg 2015 PDF
No ratings yet
Ficci-Kpmg 2015 PDF
274 pages
PMC E Bulletin 106D Final 9.2018 PDF
100% (1)
PMC E Bulletin 106D Final 9.2018 PDF
24 pages
English Project 2
No ratings yet
English Project 2
34 pages
Primemover Inspection Check Sheet
No ratings yet
Primemover Inspection Check Sheet
4 pages
TH-1 Energy Conversion-I (4TH Elect.)
No ratings yet
TH-1 Energy Conversion-I (4TH Elect.)
86 pages
Excelara-Project Proposal-For Students
No ratings yet
Excelara-Project Proposal-For Students
10 pages
Matrix Bot Building Workshop
100% (1)
Matrix Bot Building Workshop
32 pages
How To Train Your Dragon Book 6: A Hero's Guide To Deadly Dragons by Cressida Cowell
42% (26)
How To Train Your Dragon Book 6: A Hero's Guide To Deadly Dragons by Cressida Cowell
12 pages
En 458-2016
50% (2)
En 458-2016
49 pages
Complete Guide To Sweepstakes in Affiliate Marketing - From RichAds
No ratings yet
Complete Guide To Sweepstakes in Affiliate Marketing - From RichAds
39 pages
Environmental Effect of Oil Spillage and Cleanup o
No ratings yet
Environmental Effect of Oil Spillage and Cleanup o
10 pages
Settlement in Suchodolski v. Poland
No ratings yet
Settlement in Suchodolski v. Poland
3 pages
Answer Multiple Choice
No ratings yet
Answer Multiple Choice
42 pages
Demp Mid-1 Bit Bank
No ratings yet
Demp Mid-1 Bit Bank
5 pages
Ra. 7079
No ratings yet
Ra. 7079
3 pages
Project 75 ESP - Expandable Graphic Equaliser
No ratings yet
Project 75 ESP - Expandable Graphic Equaliser
5 pages
Containerization in Cloud Computing Comparing Dock
No ratings yet
Containerization in Cloud Computing Comparing Dock
14 pages
Building A Roadmap and Tracking Dependencies Across Teams With Delivery Plans
No ratings yet
Building A Roadmap and Tracking Dependencies Across Teams With Delivery Plans
15 pages
Puerto Abierto Ciudad Cerrada Transformaciones Soc
No ratings yet
Puerto Abierto Ciudad Cerrada Transformaciones Soc
19 pages
Prof Quarm Ugbs 102 Pasco-2019
No ratings yet
Prof Quarm Ugbs 102 Pasco-2019
10 pages
Urban Flow: Smart Mobility Solutions Report
No ratings yet
Urban Flow: Smart Mobility Solutions Report
1 page
Dina Sri Hastuti 2203003 MRP
No ratings yet
Dina Sri Hastuti 2203003 MRP
5 pages
MAC Authentication for Network Access
No ratings yet
MAC Authentication for Network Access
2 pages
Target Complete Workflow PDF
No ratings yet
Target Complete Workflow PDF
289 pages
BIXOLON JavaPOS Driver Guide
No ratings yet
BIXOLON JavaPOS Driver Guide
26 pages
KEY. 4 Bài Nghe
No ratings yet
KEY. 4 Bài Nghe
3 pages
Data Flow Diagram Guide
0% (1)
Data Flow Diagram Guide
30 pages

Foundation of Data Science Lab Manual Full

Uploaded by

Foundation of Data Science Lab Manual Full

Uploaded by

Lab Manual: Foundation of Data

Experiment 1: Introduction to Python for Data Science

print("Looping from 0 to 4:")

Experiment 3: Numpy Basics: Arrays and Vectorized Computations

Experiment 4: Data Manipulation using Pandas

Experiment 5: Data Visualization using Matplotlib and Seaborn

data = [5, 10, 15, 20]

Experiment 7: Handling Missing Data and Data Cleaning

Experiment 8: Grouping, Merging and Aggregation with Pandas

Experiment 9: Introduction to Data Preprocessing Techniques

data = {'Marks': [50, 80, 100]}

Experiment 10: Mini Project / Case Study on Exploratory Data Analysis

You might also like