0% found this document useful (0 votes)

11 views5 pages

Machine Learning - Lab Exercise Topics

Uploaded by

healthvisionxz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views5 pages

Machine Learning - Lab Exercise Topics

Uploaded by

healthvisionxz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Artificial Intelligence & Machine

Learning Comprehensive Assessment

(ML-ADVSA-400)
Duration: 17 hours

Part 1 (555 pts): Written Exam (Section A, B and C) – 7 hours

Part 2 (745 pts): Lab Exam – 8 hours

Part 3 (200 pts): Challenge and Defend – 2 hours

Part 2: Lab Exam (745 points)

Objective
Build, evaluate, and interpret a binary classification model for cancer detection using a neural
network-based approach. The dataset contains 30 numerical features per patient record.

Task 1: Data Preparation and Exploration (1 hour)

**Goals:** Understand the data distribution, ensure data quality, and prepare inputs for ML
model.

- Load the Breast Cancer Wisconsin dataset.

- Perform EDA: shape, nulls, outliers, class balance.
- Visualize key features, feature correlations, and target distribution.
- Normalize features using `StandardScaler`.

**Expanded Response:**

```python
from sklearn.datasets import load_breast_cancer
from sklearn.preprocessing import StandardScaler
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt

# Load dataset
X, y = load_breast_cancer(return_X_y=True, as_frame=True)
df = X.copy()
df['target'] = y

# Check for missing values

print("Missing values:\n", df.isnull().sum())

# Visualize class distribution

sns.countplot(x='target', data=df)
plt.title('Class Distribution (0 = Malignant, 1 = Benign)')
plt.show()

# Correlation heatmap
plt.figure(figsize=(12, 10))
sns.heatmap(df.corr(), cmap='coolwarm')
plt.title('Feature Correlation Heatmap')
plt.show()

# Normalize features
scaler = StandardScaler()
X_scaled = scaler.fit_transform(X)
```

Task 2: Model Architecture and Training (2 hours)

**Goals:** Build and train a simple neural network with appropriate configurations.

- Create 3-layer MLP (64-32-1 neurons) with dropout and batch normalization.
- Use `ReLU` for hidden layers and `Sigmoid` for output.
- Use `BCELoss` for binary classification.
- Implement training loop with early stopping and validation split.

Expanded Response (PyTorch):

```python
import torch
import torch.nn as nn
from sklearn.model_selection import train_test_split

X_train, X_val, y_train, y_val = train_test_split(X_scaled, y, test_size=0.2, random_state=42)

tensor_x = torch.tensor(X_train, dtype=torch.float32)

tensor_y = torch.tensor(y_train.values.reshape(-1, 1), dtype=torch.float32)
train_data = torch.utils.data.TensorDataset(tensor_x, tensor_y)
train_loader = torch.utils.data.DataLoader(train_data, batch_size=32, shuffle=True)
class CancerNet(nn.Module):
def __init__(self):
super(CancerNet, self).__init__()
self.model = nn.Sequential(
nn.Linear(30, 64),
nn.BatchNorm1d(64),
nn.ReLU(),
nn.Dropout(0.3),
nn.Linear(64, 32),
nn.BatchNorm1d(32),
nn.ReLU(),
nn.Dropout(0.3),
nn.Linear(32, 1),
nn.Sigmoid()
)

def forward(self, x):

return self.model(x)

model = CancerNet()
optimizer = torch.optim.Adam(model.parameters(), lr=0.001)
criterion = nn.BCELoss()

# Training loop (simplified)

for epoch in range(100):
model.train()
for xb, yb in train_loader:
optimizer.zero_grad()
pred = model(xb)
loss = criterion(pred, yb)
loss.backward()
optimizer.step()
```

Task 3: Model Evaluation (1 hour)

**Goals:** Assess model performance using classification metrics and diagnostic plots.

- Predict on validation data and compute metrics.

- Plot confusion matrix and ROC curve.
- Discuss model balance between sensitivity (recall) and specificity.

**Expanded Response:**

```python
from sklearn.metrics import classification_report, roc_auc_score, confusion_matrix,
ConfusionMatrixDisplay

# Validation set
model.eval()
x_val_tensor = torch.tensor(X_val, dtype=torch.float32)
y_val_tensor = torch.tensor(y_val.values.reshape(-1, 1), dtype=torch.float32)
y_pred = model(x_val_tensor).detach().numpy()

# Metrics
print(classification_report(y_val, y_pred > 0.5))
print("AUC Score:", roc_auc_score(y_val, y_pred))

# Confusion Matrix
cm = confusion_matrix(y_val, y_pred > 0.5)
ConfusionMatrixDisplay(cm).plot()
plt.title("Confusion Matrix")
plt.show()
```

Task 4: Interpretability and Debugging (1 hour)

**Goals:** Interpret model behavior using SHAP to uncover key predictive features.

- Use SHAP to visualize local and global explanations.

- Identify features influencing predictions for a specific patient.
- Provide interpretability summary.

**Expanded Response:**

```python
import shap
explainer = shap.Explainer(model, torch.tensor(X_scaled, dtype=torch.float32))
shap_values = explainer(torch.tensor(X_scaled[:100], dtype=torch.float32))

# Global view
shap.plots.beeswarm(shap_values)

# Local explanation
shap.plots.waterfall(shap_values[0])
```

**Analysis:**
Top features contributing to predictions include 'worst perimeter', 'mean concave points', and
'mean area'. SHAP confirms alignment with known clinical biomarkers.

Task 5: Error Analysis and Model Improvement (1 hour)

**Goals:** Identify weaknesses and propose enhancements.

- Examine incorrect predictions.

- Analyze decision boundaries.
- Suggest two improvement strategies with rationale.

**Expanded Analysis:**
- Many errors occur on ambiguous borderline cases.
- Misclassified malignant cases have feature overlap with benign class.

**Suggestions:**
1. **SMOTE for imbalance:** Minority class (malignant) can be synthetically expanded.
2. **Deeper CNN-like 1D layers:** If sequential/structural patterns emerge in features, a hybrid
model (e.g., MLP + CNN) could increase capacity.

ML-ADVSA-400 D2P3 ChallengeAndDefend Code
No ratings yet
ML-ADVSA-400 D2P3 ChallengeAndDefend Code
5 pages
Rahul Phase 4...
No ratings yet
Rahul Phase 4...
13 pages
Skill 7
No ratings yet
Skill 7
11 pages
Brain Tumor Classification
100% (1)
Brain Tumor Classification
12 pages
AngadKumar - 21CS012 - Pattern Recognition
No ratings yet
AngadKumar - 21CS012 - Pattern Recognition
8 pages
ML 2.4 Prashant
No ratings yet
ML 2.4 Prashant
3 pages
AM19 ADL U-Net-Model
No ratings yet
AM19 ADL U-Net-Model
37 pages
HW4ML Project Code
No ratings yet
HW4ML Project Code
24 pages
BLDD VIT ResNet50v2 CustomCNN
No ratings yet
BLDD VIT ResNet50v2 CustomCNN
38 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
Deep Learning
No ratings yet
Deep Learning
46 pages
Lab 2-Image-Classification-Using-NNs
No ratings yet
Lab 2-Image-Classification-Using-NNs
6 pages
Quality Testing Resnet18 Compressed
No ratings yet
Quality Testing Resnet18 Compressed
13 pages
Cep Dip
No ratings yet
Cep Dip
9 pages
Deep Learning Assignments
No ratings yet
Deep Learning Assignments
6 pages
PES1PG24CS018 Debjit DLTP Assignment-2 BERT Report
No ratings yet
PES1PG24CS018 Debjit DLTP Assignment-2 BERT Report
10 pages
Lab 2
No ratings yet
Lab 2
8 pages
Python Code
No ratings yet
Python Code
3 pages
Assignment 3 DS5620
No ratings yet
Assignment 3 DS5620
11 pages
Quality Testing MobileNet V2 Compressed
No ratings yet
Quality Testing MobileNet V2 Compressed
13 pages
NN From Scratch PDF 1735495327
No ratings yet
NN From Scratch PDF 1735495327
19 pages
DataTour 2024
No ratings yet
DataTour 2024
26 pages
Deeplg 3
No ratings yet
Deeplg 3
8 pages
Assignment 1: Q1. Task Description
No ratings yet
Assignment 1: Q1. Task Description
12 pages
SC Lab File Fayiz PDF
No ratings yet
SC Lab File Fayiz PDF
29 pages
Da 1 Deeep
No ratings yet
Da 1 Deeep
45 pages
Logistic Regression
No ratings yet
Logistic Regression
4 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
10 pages
Lab 8
No ratings yet
Lab 8
10 pages
Softmax Regression Mnist
No ratings yet
Softmax Regression Mnist
3 pages
CS335 Lab6
No ratings yet
CS335 Lab6
7 pages
Deep Learning
No ratings yet
Deep Learning
30 pages
24CS4505 ML Assignment
No ratings yet
24CS4505 ML Assignment
3 pages
IBest DeepLearning
No ratings yet
IBest DeepLearning
123 pages
Big Data Assignment - 7
No ratings yet
Big Data Assignment - 7
7 pages
Softmax Regression Scratch
No ratings yet
Softmax Regression Scratch
5 pages
Assignment Guideline and Rubric CPC251
No ratings yet
Assignment Guideline and Rubric CPC251
3 pages
Brain Tumor Detection Using Deep Learning
No ratings yet
Brain Tumor Detection Using Deep Learning
96 pages
Project and Weekly Report For Cancer Detection Model
No ratings yet
Project and Weekly Report For Cancer Detection Model
16 pages
Bank Customer
No ratings yet
Bank Customer
9 pages
C2 W2ok
No ratings yet
C2 W2ok
109 pages
TRPO Training for LunarLander
No ratings yet
TRPO Training for LunarLander
4 pages
Day 16 - Save The Best Performing Model's We...
No ratings yet
Day 16 - Save The Best Performing Model's We...
5 pages
Paper Code
No ratings yet
Paper Code
3 pages
Model Evaluation Techniques
No ratings yet
Model Evaluation Techniques
5 pages
S2 24 WIPRO AML Labcourse2 Kittu
No ratings yet
S2 24 WIPRO AML Labcourse2 Kittu
15 pages
Autoencoder From Scratch
No ratings yet
Autoencoder From Scratch
21 pages
Work Flow
No ratings yet
Work Flow
6 pages
6 Neural Network
No ratings yet
6 Neural Network
4 pages
Adaline SGD
No ratings yet
Adaline SGD
4 pages
Perceptron Assignment
No ratings yet
Perceptron Assignment
2 pages
Neural Network
No ratings yet
Neural Network
10 pages
Experiment 2.4 DL
No ratings yet
Experiment 2.4 DL
4 pages
Exp1 DL
No ratings yet
Exp1 DL
6 pages
SS 2020 Solutions
No ratings yet
SS 2020 Solutions
22 pages
ML Paper - Breast Cancer Model
No ratings yet
ML Paper - Breast Cancer Model
38 pages
Deep Learning Perceptron
No ratings yet
Deep Learning Perceptron
10 pages
糖尿病分类原方案
No ratings yet
糖尿病分类原方案
4 pages
MNIST Handwritten Digit Recognition Model
No ratings yet
MNIST Handwritten Digit Recognition Model
2 pages
Greenberg 2001 Empathy
No ratings yet
Greenberg 2001 Empathy
5 pages
Understanding Human Memory Systems
No ratings yet
Understanding Human Memory Systems
14 pages
Hafiza Revisi 4 - 2
No ratings yet
Hafiza Revisi 4 - 2
48 pages
Roteiro de Aula - Modal Verbs: Língua Inglesa
No ratings yet
Roteiro de Aula - Modal Verbs: Língua Inglesa
3 pages
Lesson-Plan P.E 12 - Week 2
No ratings yet
Lesson-Plan P.E 12 - Week 2
3 pages
What Are You Doing Over The Weekend?
No ratings yet
What Are You Doing Over The Weekend?
22 pages
Level 7 Leadership Coaching Mentoring Syllabus
0% (1)
Level 7 Leadership Coaching Mentoring Syllabus
82 pages
Chapter 5 Power Point Lecture - PSY101 - Sensation and Perception
No ratings yet
Chapter 5 Power Point Lecture - PSY101 - Sensation and Perception
15 pages
Chap-6 Machine Learning Introduction
No ratings yet
Chap-6 Machine Learning Introduction
49 pages
Parts of Speech Rhyme
No ratings yet
Parts of Speech Rhyme
1 page
Understanding Various Grief Models
No ratings yet
Understanding Various Grief Models
8 pages
Chapter V PR2
No ratings yet
Chapter V PR2
3 pages
Determine Rs
No ratings yet
Determine Rs
6 pages
Language, Society and Education
No ratings yet
Language, Society and Education
77 pages
Competencies of Counselor
No ratings yet
Competencies of Counselor
13 pages
Motivation, Emotion, Mood, and Involvement
No ratings yet
Motivation, Emotion, Mood, and Involvement
20 pages
Seminar On Management Development
No ratings yet
Seminar On Management Development
56 pages
Traffic Signs Recognition
0% (1)
Traffic Signs Recognition
20 pages
Curriculum Development Seminar
No ratings yet
Curriculum Development Seminar
26 pages
The Life Cycle Completed (PDFDrive)
No ratings yet
The Life Cycle Completed (PDFDrive)
111 pages
Ctet 2021
No ratings yet
Ctet 2021
156 pages
4 Pacing Writing
No ratings yet
4 Pacing Writing
4 pages
Reflection ICT..
No ratings yet
Reflection ICT..
1 page
Worksheet 1 FS 2
No ratings yet
Worksheet 1 FS 2
8 pages
TOEFL Notes
No ratings yet
TOEFL Notes
10 pages
NLG Evaluation Methods Survey
No ratings yet
NLG Evaluation Methods Survey
75 pages
Theories of Personality 4 (Carl Jung)
No ratings yet
Theories of Personality 4 (Carl Jung)
34 pages
Understanding Global Language Dynamics
No ratings yet
Understanding Global Language Dynamics
16 pages
Q1 W1 Salting
No ratings yet
Q1 W1 Salting
2 pages
Cognitivism in Instructional Design
No ratings yet
Cognitivism in Instructional Design
7 pages

Machine Learning - Lab Exercise Topics

Uploaded by

Machine Learning - Lab Exercise Topics

Uploaded by

Artificial Intelligence & Machine

Learning Comprehensive Assessment

Part 1 (555 pts): Written Exam (Section A, B and C) – 7 hours

Part 2 (745 pts): Lab Exam – 8 hours

Part 3 (200 pts): Challenge and Defend – 2 hours

Part 2: Lab Exam (745 points)

Task 1: Data Preparation and Exploration (1 hour)

- Load the Breast Cancer Wisconsin dataset.

# Check for missing values

# Visualize class distribution

Task 2: Model Architecture and Training (2 hours)

**Expanded Response (PyTorch):**

X_train, X_val, y_train, y_val = train_test_split(X_scaled, y, test_size=0.2, random_state=42)

tensor_x = torch.tensor(X_train, dtype=torch.float32)

def forward(self, x):

# Training loop (simplified)

Task 3: Model Evaluation (1 hour)

- Predict on validation data and compute metrics.

Task 4: Interpretability and Debugging (1 hour)

- Use SHAP to visualize local and global explanations.

Task 5: Error Analysis and Model Improvement (1 hour)

- Examine incorrect predictions.

You might also like

Expanded Response (PyTorch):