Import As Import As From Import From Import From Import From Import

The document contains code for implementing and evaluating various classification models using Logistic Regression and Support Vector Machine (SVM) on synthetic datasets. It includes metrics such as precision, recall, F1-score, and confusion matrices, along with visualizations of precision-recall curves. Additionally, it explores the impact of class weighting and optimizing decision thresholds on model performance.

Uploaded by

Catherine Shendre

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views6 pages

Import As Import As From Import From Import From Import From Import

Uploaded by

Catherine Shendre

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 6

QUESTION 1:

import numpy as np
import matplotlib.pyplot as plt
from sklearn.datasets import make_classification
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import precision_score, recall_score, f1_score,
classification_report, confusion_matrix
X, y = make_classification(n_samples=5000, n_features=20, n_classes=2,
weights=[0.95, 0.05], flip_y=0.01, random_state=42)

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,

random_state=42)
model = LogisticRegression()
model.fit(X_train, y_train)
y_pred = model.predict(X_test)
precision = precision_score(y_test, y_pred)
recall = recall_score(y_test, y_pred)
f1 = f1_score(y_test, y_pred)
print("Precision:", precision)
print("Recall:", recall)
print("F1-score:", f1)
print("\nClassification Report:\n", classification_report(y_test, y_pred))

Precision: 0.72
Recall: 0.32142857142857145
F1-score: 0.4444444444444444

Classification Report:
precision recall f1-score support

0 0.96 0.99 0.98 944

1 0.72 0.32 0.44 56

accuracy 0.95 1000

macro avg 0.84 0.66 0.71 1000
weighted avg 0.95 0.95 0.95 1000

# Confusion Matrix
cm = confusion_matrix(y_test, y_pred)
print("\nConfusion Matrix:\n", cm)
import seaborn as sns
plt.figure(figsize=(6,4))
sns.heatmap(cm, annot=True, fmt='d', cmap='Blues', xticklabels=['Not Fraud',
'Fraud'], yticklabels=['Not Fraud',
'Fraud'])
plt.xlabel("Predicted Label")
plt.ylabel("True Label")
plt.title("Confusion Matrix")
plt.show()

Confusion Matrix:
[[937 7]
[ 38 18]]
QUESTION 2:
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.datasets import make_classification
from sklearn.model_selection import train_test_split
from sklearn.svm import SVC
from sklearn.metrics import precision_recall_curve, classification_report

X, y = make_classification(n_samples=5000, n_features=20, n_classes=2,

weights=[0.9, 0.1], flip_y=0.01, random_state=42)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
random_state=42)
svm_model = SVC(kernel='linear', probability=True)
svm_model.fit(X_train, y_train)
y_scores = svm_model.predict_proba(X_test)[:, 1]
precision, recall, thresholds = precision_recall_curve(y_test, y_scores)
plt.figure(figsize=(8,6))
plt.plot(recall, precision, marker='.', label="Precision-Recall Curve")
plt.xlabel("Recall")
plt.ylabel("Precision")
plt.title("Precision-Recall Curve for SVM")
plt.legend()
plt.grid()
plt.show()
threshold = 0.5
y_pred = (y_scores >= threshold).astype(int)
print(f"\nClassification Report at threshold={threshold}:\n")
print(classification_report(y_test, y_pred))

plt.figure(figsize=(8,6))
plt.plot(thresholds, precision[:-1], label="Precision")
plt.plot(thresholds, recall[:-1], label="Recall")
plt.xlabel("Decision Threshold")
plt.ylabel("Score")
plt.title("Precision and Recall vs Threshold")
plt.legend()
plt.grid()
plt.show()
Classification Report at threshold=0.5:
precision recall f1-score support
0 0.95 0.98 0.96 908
1 0.66 0.46 0.54 92
accuracy 0.93 1000
macro avg 0.80 0.72 0.75 1000
weighted avg 0.92 0.93 0.92 1000
QUESTION 3:
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.datasets import make_classification
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import precision_recall_curve, classification_report,
accuracy_score

X, y = make_classification(n_samples=5000, n_features=20, n_classes=2,

weights=[0.9, 0.1], flip_y=0.01, random_state=42)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
random_state=42)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
random_state=42)
logreg_baseline = LogisticRegression(random_state=42)
logreg_baseline.fit(X_train, y_train)
y_pred_baseline = logreg_baseline.predict(X_test)
y_prob_baseline = logreg_baseline.predict_proba(X_test)[:, 1]
precision, recall, _ = precision_recall_curve(y_test, y_prob_baseline)
print("BASELINE MODEL (Logistic Regression):")
print(f"Accuracy: {accuracy_score(y_test, y_pred_baseline):.4f}")
print(classification_report(y_test, y_pred_baseline))
plt.figure(figsize=(8, 6))
plt.plot(recall, precision, marker='.', label="Baseline Model")
plt.xlabel("Recall")
plt.ylabel("Precision")
plt.title("Precision-Recall Curve for Logistic Regression")
plt.legend()
plt.grid()
plt.show()

BASELINE MODEL (Logistic Regression):

Accuracy: 0.9280
precision recall f1-score support
0 0.95 0.97 0.96 908
1 0.64 0.49 0.56 92
accuracy 0.93 1000
macro avg 0.80 0.73 0.76 1000
weighted avg 0.92 0.93 0.92 1000
# CLASS-WEIGHTED MODEL
logreg_weighted = LogisticRegression(class_weight='balanced',
random_state=42)
logreg_weighted.fit(X_train, y_train)
y_pred_weighted = logreg_weighted.predict(X_test)
y_prob_weighted = logreg_weighted.predict_proba(X_test)[:, 1]
# Compute Precision-Recall Curve
precision_w, recall_w, _ = precision_recall_curve(y_test, y_prob_weighted)
# Evaluate Weighted Model
print("MODEL WITH CLASS WEIGHTING:")
print(f"Accuracy: {accuracy_score(y_test, y_pred_weighted):.4f}")
print(classification_report(y_test, y_pred_weighted))

MODEL WITH CLASS WEIGHTING:

Accuracy: 0.8410
precision recall f1-score support
0 0.98 0.84 0.91 908
1 0.35 0.85 0.50 92
accuracy 0.84 1000
macro avg 0.67 0.84 0.70 1000
weighted avg 0.92 0.84 0.87 1000

plt.figure(figsize=(8, 6))
plt.plot(recall, precision, marker='.', label="Baseline Model")
plt.plot(recall_w, precision_w, marker='.', linestyle='dashed', label="Class
Weighted Model")
plt.xlabel("Recall")
plt.ylabel("Precision")
plt.title("Comparison of Precision-Recall Curves")
plt.legend()
plt.grid()
plt.show()

QUESTION 4:
import numpy as np
import matplotlib.pyplot as plt
from sklearn.datasets import make_classification
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import precision_recall_curve, classification_report,
accuracy_score
X, y = make_classification(n_samples=5000, n_features=20, n_classes=2,
weights=[0.9, 0.1], flip_y=0.01, random_state=42)
# Split data into training and test sets
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
random_state=42)

model = LogisticRegression(random_state=42)
model.fit(X_train, y_train)
y_scores = model.predict_proba(X_test)[:, 1]

precision, recall, thresholds = precision_recall_curve(y_test, y_scores)

best_idx = np.argmax(precision) # Index of maximum precision
best_threshold = thresholds[best_idx]
print(f"Optimal Decision Threshold for Maximum Precision:
{best_threshold:.4f}")
y_pred_optimized = (y_scores >= best_threshold).astype(int)

print("Model Evaluation with Optimized Threshold:")

print(classification_report(y_test, y_pred_optimized))

Optimal Decision Threshold for Maximum Precision: 0.9439

Model Evaluation with Optimized Threshold:
precision recall f1-score support
0 0.92 1.00 0.96 908
1 1.00 0.09 0.16 92
accuracy 0.92 1000
macro avg 0.96 0.54 0.56 1000
weighted avg 0.92 0.92 0.88 1000

plt.figure(figsize=(8, 6))
plt.plot(recall, precision, marker='.', label="Precision-Recall Curve")
plt.scatter(recall[best_idx], precision[best_idx], marker='o', color='red',
label="Optimal Point")
plt.xlabel("Recall")
plt.ylabel("Precision")
plt.title("Precision-Recall Curve with Optimal Threshold")
plt.legend()
plt.grid()
plt.show()

Import As Import As From Import From Import From Import From Import
No ratings yet
Import As Import As From Import From Import From Import From Import
4 pages
Ann Experiential Learning
No ratings yet
Ann Experiential Learning
43 pages
Classification Techniques in Python
No ratings yet
Classification Techniques in Python
30 pages
22se02cs039 DS P-11
No ratings yet
22se02cs039 DS P-11
10 pages
Titanic Data Analysis with Python
No ratings yet
Titanic Data Analysis with Python
20 pages
Logistic Regression vs SVM Analysis
No ratings yet
Logistic Regression vs SVM Analysis
7 pages
ML Lab Manual
No ratings yet
ML Lab Manual
17 pages
Da 012307
No ratings yet
Da 012307
8 pages
Deep Learningexp4
No ratings yet
Deep Learningexp4
4 pages
23BCE7092 ML Lab Assignment
No ratings yet
23BCE7092 ML Lab Assignment
14 pages
Binary Classifier Evaluation Guide
No ratings yet
Binary Classifier Evaluation Guide
12 pages
ADS - Phase 3
No ratings yet
ADS - Phase 3
34 pages
Ai Lab PRGM
No ratings yet
Ai Lab PRGM
10 pages
Code Examples for ML Models
No ratings yet
Code Examples for ML Models
6 pages
Import As Import As Import As: "Default - CSV"
No ratings yet
Import As Import As Import As: "Default - CSV"
9 pages
Assignment 3
No ratings yet
Assignment 3
8 pages
Professional Machine Learning
No ratings yet
Professional Machine Learning
67 pages
AIML Week7 Week8 Week9
No ratings yet
AIML Week7 Week8 Week9
6 pages
IRis
No ratings yet
IRis
19 pages
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
No ratings yet
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
3 pages
Classification
No ratings yet
Classification
3 pages
MD Asaduzzaman - 213002257
No ratings yet
MD Asaduzzaman - 213002257
3 pages
Classification Review
No ratings yet
Classification Review
8 pages
Unit 2 Supervised Learning
No ratings yet
Unit 2 Supervised Learning
20 pages
ML Manual With Outputs
No ratings yet
ML Manual With Outputs
30 pages
Detect Fake Social Media Profiles with SVM
No ratings yet
Detect Fake Social Media Profiles with SVM
8 pages
St. John College of Engineering and Management, Palghar - Maharashtra
No ratings yet
St. John College of Engineering and Management, Palghar - Maharashtra
11 pages
AIML Project
No ratings yet
AIML Project
4 pages
MD - Sajedul Islam - Assaignment - 02
No ratings yet
MD - Sajedul Islam - Assaignment - 02
11 pages
23BCE7199 ML Lab Assignment
No ratings yet
23BCE7199 ML Lab Assignment
15 pages
ML Internal Answers
No ratings yet
ML Internal Answers
9 pages
TP - Ipynb - Colab
No ratings yet
TP - Ipynb - Colab
6 pages
05 E RandomForest LoanData
No ratings yet
05 E RandomForest LoanData
8 pages
ML Yogesh
No ratings yet
ML Yogesh
23 pages
Rain in Australia Logistic Regression Classifier
No ratings yet
Rain in Australia Logistic Regression Classifier
10 pages
Lab Exam ... Roll No 24cs4103
No ratings yet
Lab Exam ... Roll No 24cs4103
4 pages
CCD - Ipynb - Colab
No ratings yet
CCD - Ipynb - Colab
6 pages
Practicalpgm ML
No ratings yet
Practicalpgm ML
33 pages
Aiml Ex 4-7
No ratings yet
Aiml Ex 4-7
8 pages
ADS Expt5 BE9 29
No ratings yet
ADS Expt5 BE9 29
3 pages
Data Analytics
No ratings yet
Data Analytics
10 pages
Karmbir's Python ML Project Solutions
No ratings yet
Karmbir's Python ML Project Solutions
20 pages
Btech1007022 Lab5.1
No ratings yet
Btech1007022 Lab5.1
9 pages
I Avaliação Parcial - 25.0 PTS - Gabarito
No ratings yet
I Avaliação Parcial - 25.0 PTS - Gabarito
9 pages
Data Analytcs 2
No ratings yet
Data Analytcs 2
2 pages
Shobit Sharma (2124399) ML Lab File PDF
No ratings yet
Shobit Sharma (2124399) ML Lab File PDF
19 pages
Telecom Churn Proj
No ratings yet
Telecom Churn Proj
4 pages
ML File
No ratings yet
ML File
10 pages
Text Classification with ML Algorithms
No ratings yet
Text Classification with ML Algorithms
5 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
7 pages
Da Lab Mannual
No ratings yet
Da Lab Mannual
25 pages
Print Out ML - Finallllllllllllllll
No ratings yet
Print Out ML - Finallllllllllllllll
11 pages
SanatKulkarni - AP22110010183 - Assignment5
No ratings yet
SanatKulkarni - AP22110010183 - Assignment5
8 pages
Machine Learning Strategies
No ratings yet
Machine Learning Strategies
59 pages
ML Functions
No ratings yet
ML Functions
12 pages
Btech1007022 Lab5
No ratings yet
Btech1007022 Lab5
14 pages
Home Work
No ratings yet
Home Work
12 pages
Unit 3
No ratings yet
Unit 3
13 pages
TK/KW/15 - 6235 Third Semester Master of Science (M. SC.) Examination
No ratings yet
TK/KW/15 - 6235 Third Semester Master of Science (M. SC.) Examination
3 pages
Revised OGs UFS 2017-22
No ratings yet
Revised OGs UFS 2017-22
9 pages
Master of Science (M.SC.) Third Semester (Statistics) (CBCS) Examination Decision Theory and Non Parametric Methods Compulsory Paper-1 Paper-I
No ratings yet
Master of Science (M.SC.) Third Semester (Statistics) (CBCS) Examination Decision Theory and Non Parametric Methods Compulsory Paper-1 Paper-I
12 pages
Survey Methodology and Estimation Procedure
No ratings yet
Survey Methodology and Estimation Procedure
13 pages
Soda Sales
No ratings yet
Soda Sales
2 pages
Principal Component Analysis: #Question 1
No ratings yet
Principal Component Analysis: #Question 1
6 pages
Understanding Z-Scores for Students
No ratings yet
Understanding Z-Scores for Students
2 pages
Random Forest
No ratings yet
Random Forest
5 pages
Graphic Designer Job PAN India
No ratings yet
Graphic Designer Job PAN India
2 pages
Profile
No ratings yet
Profile
2 pages
CAT TraCking Sheet
100% (1)
CAT TraCking Sheet
15 pages
Auto Repair: Career & Skills Guide
100% (1)
Auto Repair: Career & Skills Guide
10 pages
Methods of Acquiring Knowledge
No ratings yet
Methods of Acquiring Knowledge
37 pages
Unit 12
No ratings yet
Unit 12
10 pages
Clothing Word Search Puzzle
No ratings yet
Clothing Word Search Puzzle
1 page
Activity-Proposal - For Research Capacity 2017
No ratings yet
Activity-Proposal - For Research Capacity 2017
5 pages
Best Text To 3d Model Service
No ratings yet
Best Text To 3d Model Service
3 pages
Resumen MIS
No ratings yet
Resumen MIS
79 pages
Supporting Needy Individuals
No ratings yet
Supporting Needy Individuals
3 pages
UPSC NCERT Foundation 2hr Daily
No ratings yet
UPSC NCERT Foundation 2hr Daily
2 pages
ACE Star Model
50% (2)
ACE Star Model
4 pages
2023 DLL Pre-Calculus 11 Final W 10
100% (1)
2023 DLL Pre-Calculus 11 Final W 10
5 pages
2nd Year and Above Second Semester Class Schedule For Regular Undergraduate
No ratings yet
2nd Year and Above Second Semester Class Schedule For Regular Undergraduate
11 pages
Knowledge Ceha
No ratings yet
Knowledge Ceha
4 pages
Documenting Agricultural Indigenous Knowledge and Provision of Access Through Online Database Platform
No ratings yet
Documenting Agricultural Indigenous Knowledge and Provision of Access Through Online Database Platform
15 pages
Popa 2021 Operationalizing Historical Consciousness A Review and Synthesis of The Literature On Meaning Making in
No ratings yet
Popa 2021 Operationalizing Historical Consciousness A Review and Synthesis of The Literature On Meaning Making in
38 pages
Stative vs Dynamic Verbs in Tamil
No ratings yet
Stative vs Dynamic Verbs in Tamil
8 pages
Army Public School Application Format
No ratings yet
Army Public School Application Format
4 pages
Density of Matter Fast Track GRASP Math Packet V1.5 02.27.2019
No ratings yet
Density of Matter Fast Track GRASP Math Packet V1.5 02.27.2019
88 pages
Understanding "Deserve" in English
No ratings yet
Understanding "Deserve" in English
2 pages
Kaizen Costing: Principles and Benefits
100% (1)
Kaizen Costing: Principles and Benefits
18 pages
The Power of First Impressions Creating Impact at First Glance
No ratings yet
The Power of First Impressions Creating Impact at First Glance
13 pages
Biography of
No ratings yet
Biography of
2 pages
Corporal Punishment
No ratings yet
Corporal Punishment
9 pages
Week 3
No ratings yet
Week 3
61 pages
Text Books Free Tests Emergencies PDF
No ratings yet
Text Books Free Tests Emergencies PDF
2 pages
9699 Sociology Example Candidate Responses 2014 PDF
100% (3)
9699 Sociology Example Candidate Responses 2014 PDF
201 pages
Airlive Mfp-101u U
No ratings yet
Airlive Mfp-101u U
113 pages
Itep Score
No ratings yet
Itep Score
3 pages