0% found this document useful (0 votes)

40 views4 pages

FML Lab 1

This document introduces three fundamental Python libraries for data analysis: Pandas, NumPy, and Matplotlib. It provides an example lab using the Iris dataset to demonstrate basic operations with each library, including loading and viewing the data, calculating statistics, and creating visualizations to analyze relationships between variables. The lab contains tasks to generate a correlation matrix and scatter plot matrix to further understand variable correlations. Exercises are included to analyze the visualizations and load another dataset.

Uploaded by

Oumaima Ziat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views4 pages

FML Lab 1

Uploaded by

Oumaima Ziat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Fundamentals of Machine Learning

Hakim Hafidi & Youness Moukafih

Lab 1: Introduction to Python Libraries -

Pandas, NumPy, and Matplotlib
Objective:
This lab aims to introduce you to three fundamental Python libraries, Pandas, NumPy,
and Matplotlib, used in data analysis. By the end of this lab, you should be able to load a
dataset, perform basic operations, and create visualizations to understand the
relationships between different variables in the dataset.

Prerequisites:
• Basic knowledge of Python programming language.
• Anaconda installed on your computer.

Step 1: Installing Anaconda

If you haven't installed Anaconda yet, please follow the instructions below:
1. Download Anaconda from Anaconda Individual Edition.
2. Follow the installation instructions for your operating system: Anaconda
Installation Guide.

Step 2: Setting Up Jupyter Notebook

1. Open Anaconda Navigator.
2. Launch Jupyter Notebook.
3. Create a new Python notebook.

1
Fundamentals of Machine Learning

Introduction to Pandas, NumPy, and Matplotlib

Pandas
Pandas is a powerful library for data analysis and manipulation.
# Importing Pandas Library
import pandas as pd

NumPy
NumPy supports large, multi-dimensional arrays and matrices and mathematical
functions to operate on these arrays.
# Importing NumPy Library
import numpy as np

Matplotlib
Matplotlib is a plotting library for creating static, animated, and interactive visualizations
in Python.
# Importing Matplotlib Library
import matplotlib.pyplot as plt

Lab Tasks:
Task 1: Load a Dataset
Load the 'Iris' dataset from the UCI Machine Learning Repository.
# Loading Iris Dataset
url = "https://archive.ics.uci.edu/ml/machine-learning-databases/iris/iris.dat
a"
column_names = ["sepal_length", "sepal_width", "petal_length", "petal_width",
"class"]
iris = pd.read_csv(url, names=column_names)

2
Fundamentals of Machine Learning

Task 2: View the Dataset

View the first 5 rows of the dataset to understand the data.
# Viewing first 5 rows of Iris Dataset
iris.head()

Task 3: Basic Operations

Calculate the average, median, and standard deviation of the 'sepal_length' column.
# Calculating the average of 'sepal_length'
average_sepal_length = iris['sepal_length'].mean()
print(f"Average Sepal Length: {average_sepal_length}")

# Calculating the median of 'sepal_length'

median_sepal_length = iris['sepal_length'].median()
print(f"Median Sepal Length: {median_sepal_length}")

# Calculating the standard deviation of 'sepal_length'

std_dev_sepal_length = iris['sepal_length'].std()
print(f"Standard Deviation of Sepal Length: {std_dev_sepal_length}")

Task 4: Data Visualization

Create scatter plots to visualize the relationships between 'sepal_length' and
'sepal_width', and between 'petal_length' and 'petal_width'.
# Creating Scatter Plot for 'sepal_length' and 'sepal_width'
plt.scatter(iris['sepal_length'], iris['sepal_width'])
plt.title('Sepal Length vs Sepal Width')
plt.xlabel('Sepal Length (cm)')
plt.ylabel('Sepal Width (cm)')
plt.show()

# Creating Scatter Plot for 'petal_length' and 'petal_width'

plt.scatter(iris['petal_length'], iris['petal_width'])
plt.title('Petal Length vs Petal Width')
plt.xlabel('Petal Length (cm)')
plt.ylabel('Petal Width (cm)')
plt.show()

3
Fundamentals of Machine Learning

Enhanced Visualization and Analysis Tasks:

Task 5: Correlation Matrix
Create a correlation matrix to understand the linear relationship between the different
variables in the dataset.
# Creating Correlation Matrix
correlation_matrix = iris.corr()
print(correlation_matrix)

Task 6: Scatter Plot Matrix

Create a scatter plot matrix to visualize the relationships between all pairs of variables.
# Creating Scatter Plot Matrix
pd.plotting.scatter_matrix(iris, alpha=0.8, figsize=(10, 10), diagonal='hist')
plt.show()

Exercises:
1. Exercise 1: Analyze the correlation matrix and scatter plot matrix. Answer the
following questions: a. Is there a relationship between 'sepal_length' and
'sepal_width'? b. Is the relationship between 'petal_length' and 'petal_width'
positive or negative? c. Which pair of variables has the strongest relationship?
2. Exercise 2: Create a scatter plot for 'petal_length' and 'petal_width'. Based on
the plot, hypothesize whether there is any association between the two variables
and whether the association is positive or negative.
3. Exercise 3: Load another dataset of your choice and perform similar operations
and visualizations to understand the relationships between the variables. Answer
questions about the relationships between the variables based on the
visualizations.

Submission:
Submit the Jupyter notebook containing all the executed cells along with the outputs and
your answers to the exercise questions.

PR Final File
No ratings yet
PR Final File
49 pages
ML Lab File
No ratings yet
ML Lab File
43 pages
Python For AIML2
No ratings yet
Python For AIML2
21 pages
3-Numpy Pandas
No ratings yet
3-Numpy Pandas
37 pages
Roadmap
No ratings yet
Roadmap
27 pages
ML with Python: Data Visualization Guide
No ratings yet
ML with Python: Data Visualization Guide
7 pages
AIML Short Term Internship Session 9 Summary-1719044709410
No ratings yet
AIML Short Term Internship Session 9 Summary-1719044709410
14 pages
Lab 02 - Introduction To Pandas
No ratings yet
Lab 02 - Introduction To Pandas
6 pages
An Example Machine Learning Notebook
No ratings yet
An Example Machine Learning Notebook
28 pages
DAP 5 Module
No ratings yet
DAP 5 Module
68 pages
Ads Exp 3
No ratings yet
Ads Exp 3
7 pages
Python Code for Central Tendency
No ratings yet
Python Code for Central Tendency
28 pages
Content From Jose Portilla's Udemy Course Learning Python For Data Analysis and Visualization Notes by Michael Brothers, Available On
No ratings yet
Content From Jose Portilla's Udemy Course Learning Python For Data Analysis and Visualization Notes by Michael Brothers, Available On
13 pages
Tutorial 1
No ratings yet
Tutorial 1
5 pages
Dsa Lab Record (Ai&Ds)
No ratings yet
Dsa Lab Record (Ai&Ds)
34 pages
ML Exp
No ratings yet
ML Exp
9 pages
Aids Lab
No ratings yet
Aids Lab
45 pages
Lec 19
No ratings yet
Lec 19
14 pages
Mat Plot Lib
No ratings yet
Mat Plot Lib
12 pages
ML3 Data Analysis
No ratings yet
ML3 Data Analysis
80 pages
ML Lab Manual Completed
No ratings yet
ML Lab Manual Completed
56 pages
Unit 5 PythonPackages (Matplotlib)
No ratings yet
Unit 5 PythonPackages (Matplotlib)
24 pages
DSF Lab
No ratings yet
DSF Lab
14 pages
ML Lab Manual With Statistical Formulas
No ratings yet
ML Lab Manual With Statistical Formulas
9 pages
Machine Learning Experiment
No ratings yet
Machine Learning Experiment
69 pages
One-Day Intensive Python Data Analysis and Visuali
No ratings yet
One-Day Intensive Python Data Analysis and Visuali
6 pages
PR Final File
No ratings yet
PR Final File
70 pages
Lab Manual ML R22
No ratings yet
Lab Manual ML R22
27 pages
Week 3
No ratings yet
Week 3
10 pages
Unit 3
No ratings yet
Unit 3
19 pages
Introduction To Python (Part III)
No ratings yet
Introduction To Python (Part III)
29 pages
Practical Labs Guide
No ratings yet
Practical Labs Guide
34 pages
DMKD External Exam Answers
No ratings yet
DMKD External Exam Answers
12 pages
Visualization in Python
No ratings yet
Visualization in Python
2 pages
Unit II Lecturer Notes
No ratings yet
Unit II Lecturer Notes
28 pages
Essential Python Data Visualization Libraries 1687141550
No ratings yet
Essential Python Data Visualization Libraries 1687141550
16 pages
Basic Plotting With Seaborn
No ratings yet
Basic Plotting With Seaborn
6 pages
NumPy and Pandas Basics in Python
No ratings yet
NumPy and Pandas Basics in Python
9 pages
Week6 Matplotlib
No ratings yet
Week6 Matplotlib
5 pages
AI Lab4
No ratings yet
AI Lab4
25 pages
Advanced Matplotlib for Data Visualization
No ratings yet
Advanced Matplotlib for Data Visualization
54 pages
Unit 4
No ratings yet
Unit 4
27 pages
Unit 5
No ratings yet
Unit 5
28 pages
DMV Unit-4-1 PDF
100% (1)
DMV Unit-4-1 PDF
10 pages
Unit 5 Python Notes HM
No ratings yet
Unit 5 Python Notes HM
59 pages
Ex1 - Plotting and Visualization Using Numpy and Pandas
No ratings yet
Ex1 - Plotting and Visualization Using Numpy and Pandas
14 pages
DXV Guidelines
No ratings yet
DXV Guidelines
3 pages
ML Lab Manual (Upto Cie-1)
No ratings yet
ML Lab Manual (Upto Cie-1)
33 pages
Matplotlib Guide for Engineers
No ratings yet
Matplotlib Guide for Engineers
4 pages
Data Science and Analtics Laboratory
No ratings yet
Data Science and Analtics Laboratory
21 pages
15octmatplotlib 2024
No ratings yet
15octmatplotlib 2024
4 pages
Jetlearn Practice - Dimitrina Grazhdani-JL9124415155
No ratings yet
Jetlearn Practice - Dimitrina Grazhdani-JL9124415155
62 pages
DA&V Module 6 (SAMI)
No ratings yet
DA&V Module 6 (SAMI)
10 pages
Comprehensive Data Visualization With Matplotlib and Seaborn
No ratings yet
Comprehensive Data Visualization With Matplotlib and Seaborn
40 pages
EXP1-siddhant Gupta (23 - SE - 148)
No ratings yet
EXP1-siddhant Gupta (23 - SE - 148)
17 pages
ML Manual
No ratings yet
ML Manual
21 pages
RRL 1
No ratings yet
RRL 1
39 pages
Methods of Joints & Sections
No ratings yet
Methods of Joints & Sections
18 pages
6 Wind Loads
100% (1)
6 Wind Loads
3 pages
M.Tech Lecture: EFI System Benefits
No ratings yet
M.Tech Lecture: EFI System Benefits
4 pages
Online FK Sellers data-JAN-2024
No ratings yet
Online FK Sellers data-JAN-2024
6 pages
Leadership Styles Explained
No ratings yet
Leadership Styles Explained
10 pages
Diffusion Models in Psychology
No ratings yet
Diffusion Models in Psychology
22 pages
Mandarin Lesson 3
No ratings yet
Mandarin Lesson 3
18 pages
Basics of CAD/CAM Explained
No ratings yet
Basics of CAD/CAM Explained
4 pages
Shares and Debentures
No ratings yet
Shares and Debentures
22 pages
Halperin - Russia in The Mongol Empire in Comparative Perspective
No ratings yet
Halperin - Russia in The Mongol Empire in Comparative Perspective
24 pages
Human-Computer Interaction: Dr. Ibrar Hussain Week: 01
No ratings yet
Human-Computer Interaction: Dr. Ibrar Hussain Week: 01
16 pages
Top Admits Fall 25
No ratings yet
Top Admits Fall 25
4 pages
Unit 1
No ratings yet
Unit 1
53 pages
Appendix A-1
No ratings yet
Appendix A-1
1 page
Lesson 5 Measures of Dispersion (Rhea)
No ratings yet
Lesson 5 Measures of Dispersion (Rhea)
25 pages
CTET 6to8 June 2011
No ratings yet
CTET 6to8 June 2011
29 pages
PICTOGRAM
No ratings yet
PICTOGRAM
1 page
Set Items
No ratings yet
Set Items
6 pages
Swarovski Beads Colors & Effects
No ratings yet
Swarovski Beads Colors & Effects
5 pages
Beam Deflection Analysis Techniques
No ratings yet
Beam Deflection Analysis Techniques
5 pages
Mobile-IP Seminar Report
No ratings yet
Mobile-IP Seminar Report
33 pages
POCSO Act: Police & Doctors' Roles
No ratings yet
POCSO Act: Police & Doctors' Roles
2 pages
Skva 1350 A1 Test Sizing Report
No ratings yet
Skva 1350 A1 Test Sizing Report
2 pages
AARS Practice Questions
No ratings yet
AARS Practice Questions
211 pages
Children's Storm Adventure Quiz
No ratings yet
Children's Storm Adventure Quiz
2 pages
B1+ Intermediate Units 9-10 Format B
No ratings yet
B1+ Intermediate Units 9-10 Format B
3 pages
Liu Et Al., - 2024 - When Citizens Support AI Policies
No ratings yet
Liu Et Al., - 2024 - When Citizens Support AI Policies
19 pages
NPCIL Tube Fitting Specifications
100% (1)
NPCIL Tube Fitting Specifications
22 pages
NPKFULLTEXT
No ratings yet
NPKFULLTEXT
34 pages

FML Lab 1

Uploaded by

FML Lab 1

Uploaded by

Fundamentals of Machine Learning

Hakim Hafidi & Youness Moukafih

Lab 1: Introduction to Python Libraries -

Step 1: Installing Anaconda

Step 2: Setting Up Jupyter Notebook

Introduction to Pandas, NumPy, and Matplotlib

Task 2: View the Dataset

Task 3: Basic Operations

# Calculating the median of 'sepal_length'

# Calculating the standard deviation of 'sepal_length'

Task 4: Data Visualization

# Creating Scatter Plot for 'petal_length' and 'petal_width'

Enhanced Visualization and Analysis Tasks:

Task 6: Scatter Plot Matrix

You might also like