0% found this document useful (0 votes)

6 views5 pages

9914 ML Lab2

The document outlines a data analysis workflow using Python, focusing on two datasets: handwritten digit recognition and Titanic survival predictions. It includes data loading, preprocessing, model training with Linear Regression, and evaluation using metrics like mean squared error and confusion matrix. Key steps involve splitting data into training and testing sets, fitting models, and visualizing results.

Uploaded by

Ronit Naik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views5 pages

9914 ML Lab2

Uploaded by

Ronit Naik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

In [ ]: # importing modules and packages

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
from sklearn.metrics import mean_squared_error, mean_absolute_error
from sklearn import preprocessing

In [ ]: from google.colab import drive

drive.mount('/content/drive')

Mounted at /content/drive

In [ ]: df = pd.read_csv('/content/drive/MyDrive/data/train.csv')
# df.drop('No', inplace=True, axis=1)

print(df.head())

print(df.columns)

label pixel0 pixel1 pixel2 pixel3 pixel4 pixel5 pixel6 pixel7 \

0 1 0 0 0 0 0 0 0 0
1 0 0 0 0 0 0 0 0 0
2 1 0 0 0 0 0 0 0 0
3 4 0 0 0 0 0 0 0 0
4 0 0 0 0 0 0 0 0 0

pixel8 ... pixel774 pixel775 pixel776 pixel777 pixel778 pixel779 \

0 0 ... 0 0 0 0 0 0
1 0 ... 0 0 0 0 0 0
2 0 ... 0 0 0 0 0 0
3 0 ... 0 0 0 0 0 0
4 0 ... 0 0 0 0 0 0

pixel780 pixel781 pixel782 pixel783

0 0 0 0 0
1 0 0 0 0
2 0 0 0 0
3 0 0 0 0
4 0 0 0 0

[5 rows x 785 columns]

Index(['label', 'pixel0', 'pixel1', 'pixel2', 'pixel3', 'pixel4', 'pixel5',
'pixel6', 'pixel7', 'pixel8',
...
'pixel774', 'pixel775', 'pixel776', 'pixel777', 'pixel778', 'pixel779',
'pixel780', 'pixel781', 'pixel782', 'pixel783'],
dtype='object', length=785)

In [ ]: sns.scatterplot(x='pixel0',
y='label', data=df)

print(df.shape)
# creating feature variables
y = df['label']
X = df.drop('label', axis=1)

# print(X)
# print(y)

(42000, 785)

In [ ]: # creating train and test sets

X_train, X_test, y_train, y_test = train_test_split(
X, y, test_size=0.3, random_state=101)

# creating a regression model

model = LinearRegression()

# fitting the model

model.fit(X_train, y_train)
# making predictions
predictions = model.predict(X_test)

# model evaluation
print('mean_squared_error : ', mean_squared_error(y_test, predictions))
print('mean_absolute_error : ', mean_absolute_error(y_test, predictions))

mean_squared_error : 1.1646355758578547e+18
mean_absolute_error : 15509186.729803031

Titanic Predictions
In [ ]: df = pd.read_csv('/content/drive/MyDrive/data/titanic.csv')
df.drop({'passengerid','name','fare','embarked','cabin','ticket'}, inplace=True, axis=1)

print(df.head())

print(df.columns)

pclass survived sex age sibsp parch

0 1 1 female 29.0000 0 0
1 1 1 male 0.9167 1 2
2 1 0 female 2.0000 1 2
3 1 0 male 30.0000 1 2
4 1 0 female 25.0000 1 2
Index(['pclass', 'survived', 'sex', 'age', 'sibsp', 'parch'], dtype='object')

In [ ]: df = df.fillna(0)

In [ ]: df = df.replace({'sex': {'male': 0, 'female': 1}})

In [ ]: y = df['survived']
X = df.drop('survived', axis=1)
# creating train and test sets
X_train, X_test, y_train, y_test = train_test_split(
X, y, test_size=0.3, random_state=101)

# creating a regression model

model = LinearRegression()

# fitting the model

model.fit(X_train, y_train)

# making predictions
predictions = model.predict(X_test)

# model evaluation
print('mean_squared_error : ', mean_squared_error(y_test, np.round(predictions)))
print('mean_absolute_error : ', mean_absolute_error(y_test, np.round(predictions)))

mean_squared_error : 0.22137404580152673
mean_absolute_error : 0.22137404580152673

In [ ]: def plot_confusion_matrix(cm, classes,normalize=False,title='Confusion matrix',cmap=plt.cm.Blues):

"""
This function prints and plots the confusion matrix.
Normalization can be applied by setting `normalize=True`.
"""
plt.imshow(cm, interpolation='nearest', cmap=cmap)
plt.title(title)
plt.colorbar()
tick_marks = np.arange(len(classes))
plt.xticks(tick_marks, classes, rotation=45)
plt.yticks(tick_marks, classes)

if normalize:
cm = cm.astype('float') / cm.sum(axis=1)[:, np.newaxis]
print("Normalized confusion matrix")
else:
print('Confusion matrix, without normalization')

# print(cm)

thresh = cm.max() / 2.
for i, j in itertools.product(range(cm.shape[0]), range(cm.shape[1])):
plt.text(j, i, cm[i, j],
horizontalalignment="center",
color="white" if cm[i, j] > thresh else "black")

plt.tight_layout()
plt.ylabel('True label')
plt.xlabel('Predicted label')

In [ ]: predictions=np.round(predictions)
# print(y_test)

In [ ]: from sklearn.metrics import confusion_matrix

import itertools
cm = confusion_matrix(y_true=y_test, y_pred=predictions)
cm_plot_labels = ["Met Rose","Met Jack"]
plot_confusion_matrix(cm=cm, classes=cm_plot_labels, title='Confusion Matrix')

Confusion matrix, without normalization

Experiment 2
No ratings yet
Experiment 2
5 pages
Mloa Exp1 C121
No ratings yet
Mloa Exp1 C121
49 pages
MNIST Digit Classification Using NN
No ratings yet
MNIST Digit Classification Using NN
16 pages
Implement SOFM For Character Recognition - Watermark
No ratings yet
Implement SOFM For Character Recognition - Watermark
9 pages
MNIST Dataset Download and Warnings
No ratings yet
MNIST Dataset Download and Warnings
50 pages
Hybrid Resul
No ratings yet
Hybrid Resul
17 pages
CNN with Keras for MNIST Classification
No ratings yet
CNN with Keras for MNIST Classification
11 pages
Apple Vs Orange
No ratings yet
Apple Vs Orange
24 pages
DL EXP2.ipynb - Colaboratory
No ratings yet
DL EXP2.ipynb - Colaboratory
6 pages
SPPUML2
No ratings yet
SPPUML2
7 pages
Bayesian Networks (Part I) : 10 - 601 Introduction To Machine Learning
No ratings yet
Bayesian Networks (Part I) : 10 - 601 Introduction To Machine Learning
55 pages
Deep Learning with MNIST Dataset
No ratings yet
Deep Learning with MNIST Dataset
21 pages
IMg Process
No ratings yet
IMg Process
30 pages
Computer Vision Fundamentals Explained
No ratings yet
Computer Vision Fundamentals Explained
25 pages
MNIST Dataset for Deep Learning
No ratings yet
MNIST Dataset for Deep Learning
12 pages
Aspect Ratio in Subplots With Various Y-Axes: 3 Answers
No ratings yet
Aspect Ratio in Subplots With Various Y-Axes: 3 Answers
1 page
Building A Brain in 10 Minutes: Perceptron Research From The 50's & 6 Perceptron Research From The 50's & 6
No ratings yet
Building A Brain in 10 Minutes: Perceptron Research From The 50's & 6 Perceptron Research From The 50's & 6
14 pages
MNIST CNN & SVM Classification
No ratings yet
MNIST CNN & SVM Classification
5 pages
Random Forest Classification
No ratings yet
Random Forest Classification
24 pages
CNN 2
No ratings yet
CNN 2
40 pages
DL Manuel
No ratings yet
DL Manuel
37 pages
Pipeline
No ratings yet
Pipeline
5 pages
0
No ratings yet
0
343 pages
DigitalImageProcessing BorderDetectionAndLabeling
No ratings yet
DigitalImageProcessing BorderDetectionAndLabeling
5 pages
CNN 1721592934
No ratings yet
CNN 1721592934
53 pages
MLLab 4
No ratings yet
MLLab 4
9 pages
Keras TensorFlow Neural Network Guide
No ratings yet
Keras TensorFlow Neural Network Guide
8 pages
Matlab Image Processing Guide
No ratings yet
Matlab Image Processing Guide
19 pages
1 Libro Ejercicios Direccionamiento y Subredes
No ratings yet
1 Libro Ejercicios Direccionamiento y Subredes
69 pages
MNIST Digit Classification Guide
No ratings yet
MNIST Digit Classification Guide
53 pages
Notebooklien 1
No ratings yet
Notebooklien 1
1 page
Car Buying - Naive Bayes - Colab
No ratings yet
Car Buying - Naive Bayes - Colab
2 pages
Suvdata Analysis
No ratings yet
Suvdata Analysis
7 pages
PyTorch Neural Network Classifcation
No ratings yet
PyTorch Neural Network Classifcation
1 page
Computer Vision and Image Analysis Techniques
No ratings yet
Computer Vision and Image Analysis Techniques
89 pages
Tugas Clustering - 132021012 - Kevin Gazkia Naufal
No ratings yet
Tugas Clustering - 132021012 - Kevin Gazkia Naufal
6 pages
Lab Handwritten
No ratings yet
Lab Handwritten
8 pages
MLP - Week 5 - MNIST - Perceptron - Ipynb - Colaboratory
No ratings yet
MLP - Week 5 - MNIST - Perceptron - Ipynb - Colaboratory
31 pages
Diabetes Data Analysis in Python
No ratings yet
Diabetes Data Analysis in Python
5 pages
2024 10 05 - 21 16 54
No ratings yet
2024 10 05 - 21 16 54
3 pages
Touchpanel Debug Info Report
No ratings yet
Touchpanel Debug Info Report
501 pages
Practical PRogram List 2.ipynb - Colab
No ratings yet
Practical PRogram List 2.ipynb - Colab
6 pages
Python For Machine Learning
No ratings yet
Python For Machine Learning
183 pages
Neural Network Handwritten Digit Prediction 1
No ratings yet
Neural Network Handwritten Digit Prediction 1
5 pages
Clustering Assignment - Colaboratory
No ratings yet
Clustering Assignment - Colaboratory
3 pages
Neural Network Handwritten Digit Prediction
No ratings yet
Neural Network Handwritten Digit Prediction
6 pages
Eda 1758279136
No ratings yet
Eda 1758279136
10 pages
Python Machine Learning Scatter Plot
No ratings yet
Python Machine Learning Scatter Plot
1 page
Lab 3
No ratings yet
Lab 3
19 pages
Fashion CSV
No ratings yet
Fashion CSV
2,825 pages
NumPy: Sigmoid, MSE, and Plotting
No ratings yet
NumPy: Sigmoid, MSE, and Plotting
94 pages
IP Addressing and Subnetting: Workbook
No ratings yet
IP Addressing and Subnetting: Workbook
7 pages
NumPy Basics for Beginners
No ratings yet
NumPy Basics for Beginners
145 pages
Feature Engineering PDF
100% (1)
Feature Engineering PDF
75 pages
Logistic Regression in Python Using Dask
No ratings yet
Logistic Regression in Python Using Dask
19 pages
Using Jupyter Console:: Interacting With Screen Text
No ratings yet
Using Jupyter Console:: Interacting With Screen Text
40 pages
Tarea 4
No ratings yet
Tarea 4
6 pages
9914 VivianLudrick BDA Lab3
No ratings yet
9914 VivianLudrick BDA Lab3
8 pages
9924 Experiment 2
No ratings yet
9924 Experiment 2
20 pages
Technical Paper
No ratings yet
Technical Paper
11 pages
csc604: Ai: Module 5: Planning
No ratings yet
csc604: Ai: Module 5: Planning
18 pages
66.0.0 Differentiation Q
No ratings yet
66.0.0 Differentiation Q
4 pages
De Broglie Wave-Particle Duality
No ratings yet
De Broglie Wave-Particle Duality
22 pages
Solution Manual For Cost Management Measuring Monitoring and Motivating Performance 2nd Edition by Eldenburg Newest Edition 2025
100% (12)
Solution Manual For Cost Management Measuring Monitoring and Motivating Performance 2nd Edition by Eldenburg Newest Edition 2025
154 pages
Simple Interest Practice Questions 2023
No ratings yet
Simple Interest Practice Questions 2023
262 pages
IB 04 Straight Lines (11 16)
100% (1)
IB 04 Straight Lines (11 16)
3 pages
Formal Languages & Automata Theory Guide
No ratings yet
Formal Languages & Automata Theory Guide
152 pages
DLSU CCS Thesis Writing Support Services
100% (2)
DLSU CCS Thesis Writing Support Services
4 pages
Waxman-Smits Model for Shaly Sands
No ratings yet
Waxman-Smits Model for Shaly Sands
8 pages
(2015) The Production Routing Problem A Review of Formulations and Solution Algorithms PDF
No ratings yet
(2015) The Production Routing Problem A Review of Formulations and Solution Algorithms PDF
12 pages
Types and Analysis of Histograms
No ratings yet
Types and Analysis of Histograms
5 pages
Numerical Error Analysis Guide
100% (1)
Numerical Error Analysis Guide
8 pages
Continuous Probability Distribution
No ratings yet
Continuous Probability Distribution
14 pages
Sloshing Effects in Cylindrical Tanks
No ratings yet
Sloshing Effects in Cylindrical Tanks
43 pages
Feedback Assistant in SLS For Maths - 10 Jul
No ratings yet
Feedback Assistant in SLS For Maths - 10 Jul
1 page
Decision Making Under Uncertainty
No ratings yet
Decision Making Under Uncertainty
12 pages
EPRG 2014 Guidelines for Pipeline Welds
No ratings yet
EPRG 2014 Guidelines for Pipeline Welds
20 pages
Examinations: Faculty of Science (1-Year) Master of Science (M.SC) M.SC (Mathematics)
No ratings yet
Examinations: Faculty of Science (1-Year) Master of Science (M.SC) M.SC (Mathematics)
4 pages
Aircraft Skin Manufacturing and Material Properties
No ratings yet
Aircraft Skin Manufacturing and Material Properties
6 pages
Parallel and Perpendicular Lines Guide
No ratings yet
Parallel and Perpendicular Lines Guide
19 pages
Ge 4 Module 3
No ratings yet
Ge 4 Module 3
13 pages
Term 1 STD 12 Paper Solutions
No ratings yet
Term 1 STD 12 Paper Solutions
14 pages
Practice Physics 1. Second Semester. Newton'S Laws: in This IB Lab You Will Be Assessed On The Following Criteria
No ratings yet
Practice Physics 1. Second Semester. Newton'S Laws: in This IB Lab You Will Be Assessed On The Following Criteria
4 pages
Gantt Charts for Project Managers
No ratings yet
Gantt Charts for Project Managers
5 pages
2022 Mathematics OL Model Paper
No ratings yet
2022 Mathematics OL Model Paper
8 pages
Linear Algebra: Basis and Spanning Sets
No ratings yet
Linear Algebra: Basis and Spanning Sets
19 pages
Conic Section
No ratings yet
Conic Section
12 pages
Aops Community 1963 German National Olympiad
No ratings yet
Aops Community 1963 German National Olympiad
1 page
Lecture Theory 2 - Part 2
No ratings yet
Lecture Theory 2 - Part 2
10 pages
A Survey of Stability of Stochastic Systems
No ratings yet
A Survey of Stability of Stochastic Systems
18 pages
Modular Arithmetic Lesson
No ratings yet
Modular Arithmetic Lesson
7 pages

9914 ML Lab2

Uploaded by

9914 ML Lab2

Uploaded by

In [ ]: # importing modules and packages

In [ ]: from google.colab import drive

label pixel0 pixel1 pixel2 pixel3 pixel4 pixel5 pixel6 pixel7 \

pixel8 ... pixel774 pixel775 pixel776 pixel777 pixel778 pixel779 \

pixel780 pixel781 pixel782 pixel783

[5 rows x 785 columns]

In [ ]: # creating train and test sets

# creating a regression model

# fitting the model

pclass survived sex age sibsp parch

In [ ]: df = df.replace({'sex': {'male': 0, 'female': 1}})

# creating a regression model

# fitting the model

In [ ]: def plot_confusion_matrix(cm, classes,normalize=False,title='Confusion matrix',cmap=plt.cm.Blues):

In [ ]: from sklearn.metrics import confusion_matrix

Confusion matrix, without normalization

You might also like