0% found this document useful (0 votes)

12 views9 pages

Lecture3 MCQ Guide

This document serves as a study guide for unsupervised learning and neural networks, covering key concepts such as clustering algorithms (K-means, hierarchical clustering, DBSCAN), dimensionality reduction techniques (PCA, t-SNE), and association rule learning. It also details the structure and training of neural networks, including activation functions, regularization techniques, and specific architectures like CNNs and RNNs. Additionally, it provides practice questions and calculations related to these topics to reinforce understanding.

Uploaded by

pereraasp2022

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views9 pages

Lecture3 MCQ Guide

Uploaded by

pereraasp2022

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Lecture 3: Unsupervised Learning & Neural Net-

works - MCQ Study Guide

Key Concepts Explained Simply
Unsupervised Learning
What is Unsupervised Learning? Unsupervised learning finds patterns in
data without labeled responses. It’s like sorting objects without being told what
categories to use.

Clustering Algorithms
K-means Clustering
• What it is: Groups similar data points into K clusters
• How it works:
1. Choose K (number of clusters)
2. Randomly place K centroids
3. Assign each point to the nearest centroid
4. Recalculate centroids as the average of all points in the cluster
5. Repeat steps 3-4 until convergence
• Objective function: Minimize the sum of squared distances from points
to their centroids
• Determining optimal K:
– Elbow method: Plot error vs. K and look for the “elbow”
– Silhouette score: Measures how similar points are to their own cluster
vs. other clusters
• Limitations:
– Sensitive to initial centroid positions
– Assumes spherical clusters
– Sensitive to outliers

Hierarchical Clustering
• What it is: Builds a tree of clusters (dendrogram)
• Types:
– Agglomerative (bottom-up): Start with each point as a cluster and
merge
– Divisive (top-down): Start with one cluster and divide
• Linkage methods:
– Single linkage: Minimum distance between points in clusters
– Complete linkage: Maximum distance between points in clusters
– Average linkage: Average distance between all pairs of points
– Ward’s method: Minimizes variance within clusters
• Advantages:
– No need to specify number of clusters beforehand

1
– Creates a hierarchy that can be cut at different levels

DBSCAN (Density-Based Spatial Clustering of Applications with

Noise)
• What it is: Groups dense regions of points, marking sparse regions as
noise
• Parameters:
– � (epsilon): Maximum distance between two points to be considered
neighbors
– MinPts: Minimum number of points required to form a dense region
• Point types:
– Core points: Have at least MinPts points within distance �
– Border points: Within distance � of a core point but have fewer than
MinPts neighbors
– Noise points: Neither core nor border points
• Advantages:
– Can find arbitrarily shaped clusters
– Robust to outliers
– Doesn’t require specifying number of clusters
• Disadvantages:
– Sensitive to parameter selection
– Struggles with varying density clusters

Dimensionality Reduction
Principal Component Analysis (PCA)
• What it is: Reduces dimensions while preserving as much variance as
possible
• How it works:
1. Standardize the data
2. Compute covariance matrix
3. Calculate eigenvectors and eigenvalues
4. Sort eigenvectors by eigenvalues (highest to lowest)
5. Select top k eigenvectors to form new feature space
6. Transform original data to new space
• Applications:
– Data compression
– Visualization
– Noise reduction
– Feature extraction

t-SNE (t-Distributed Stochastic Neighbor Embedding)

• What it is: Nonlinear dimensionality reduction technique for visualiza-
tion

2
• Key idea: Convert similarities between data points to joint probabilities
and minimize the KL divergence
• Advantages:
– Preserves local structure
– Effective for visualization
• Disadvantages:
– Computationally intensive
– Non-deterministic
– Not suitable for dimensionality reduction for modeling

Association Rule Learning

• What it is: Discovers interesting relations between variables in large
databases
• Key metrics:
– Support: Frequency of an itemset = (transactions containing itemset)
/ (total transactions)
– Confidence: Likelihood of Y given X = support(X�Y) / support(X)
– Lift: Ratio of observed support to expected support if X and Y were
independent
• Apriori Algorithm:
1. Find all frequent itemsets with support � minimum support
2. Generate rules with confidence � minimum confidence
• Applications:
– Market basket analysis
– Product recommendation
– Cross-selling

Neural Networks
Basic Structure
• Input layer: Receives the features
• Hidden layer(s): Processes the information
• Output layer: Produces the prediction
• Neuron (Perceptron): Basic unit that:
– Receives inputs
– Applies weights
– Adds bias
– Applies activation function
– Produces output

Activation Functions
• Sigmoid: f(x) = 1/(1+e^(-x))
– Range: (0, 1)
– Used in binary classification output layers

3
• Tanh: f(x) = (e^x - e(-x))/(e x + e^(-x))
– Range: (-1, 1)
– Zero-centered
• ReLU (Rectified Linear Unit): f(x) = max(0, x)
– Range: [0, ∞)
– Computationally eﬀicient
– Helps mitigate vanishing gradient problem
• Leaky ReLU: f(x) = max(�x, x) where � is a small constant
– Addresses “dying ReLU” problem
• Softmax: f(x_i) = e(x_i)/Σ(e (x_j))
– Used for multi-class classification output layers
– Outputs sum to 1 (probability distribution)

Training Neural Networks

• Forward Propagation: Compute outputs given inputs
• Loss Function:
– Mean Squared Error (regression)
– Binary Cross-Entropy (binary classification)
– Categorical Cross-Entropy (multi-class classification)
• Backpropagation: Calculate gradients of the loss with respect to weights
• Optimization Algorithms:
– Stochastic Gradient Descent (SGD)
– Adam
– RMSprop
– Adagrad

Regularization Techniques
• Dropout: Randomly deactivate neurons during training
• L1/L2 Regularization: Add penalty terms to the loss function
• Batch Normalization: Normalize layer inputs for each mini-batch
• Early Stopping: Stop training when validation error starts increasing

Convolutional Neural Networks (CNNs)

Key Components
• Convolutional layers: Apply filters to detect features
• Filters/Kernels: Small matrices that slide over the input
• Feature Maps: Output of applying filters to the input
• Pooling layers: Reduce spatial dimensions
– Max pooling: Take maximum value in each window
– Average pooling: Take average value in each window
• Fully connected layers: Final classification

CNN Architecture

4
• Input layer: Holds the raw pixel values
• Convolutional layer: Applies convolution operation
• Activation layer: Applies non-linearity (usually ReLU)
• Pooling layer: Reduces dimensions
• Fully connected layer: Connects to all neurons in previous layer

CNN Applications
• Image classification
• Object detection
• Face recognition
• Medical image analysis

CNN Calculations
• Output size after convolution: ((W-F+2P)/S)+1
– W: input size
– F: filter size
– P: padding
– S: stride

Recurrent Neural Networks (RNNs)

What are RNNs?
• Neural networks with loops to maintain information over time
• Designed for sequential data (time series, text, speech)

Types of RNNs
• Simple RNN: Basic recurrent structure
• LSTM (Long Short-Term Memory): Solves vanishing gradient prob-
lem with gates
• GRU (Gated Recurrent Unit): Simplified version of LSTM

Applications
• Natural language processing
• Speech recognition
• Time series prediction
• Machine translation

MCQ Practice Questions

Question 1
In DBSCAN clustering, what are points called that have at least
MinPts points within distance �? - A) Border points - B) Core points - C)
Noise points - D) Centroid points

5
Answer: B) Core points
Explanation: In DBSCAN, core points are defined as points that have at least
MinPts points within distance �, making them central to forming clusters.

Question 2
What is the primary purpose of Principal Component Analysis
(PCA)? - A) Classification - B) Clustering - C) Dimensionality reduction - D)
Association rule learning
Answer: C) Dimensionality reduction
Explanation: PCA is a technique used to reduce the dimensionality of a
dataset while preserving as much variance as possible.

Question 3
In a neural network with an input layer of 10 nodes, a hidden layer
of 20 nodes, and an output layer of 5 nodes, how many weights are
there in total (excluding biases)? - A) 35 - B) 300 - C) 200 - D) 250
Answer: B) 300
Explanation: The number of weights is calculated as: (input nodes × hidden
nodes) + (hidden nodes × output nodes) = (10 × 20) + (20 × 5) = 200 + 100
= 300.

Question 4
Which activation function is commonly used in the output layer for
multi-class classification problems? - A) ReLU - B) Sigmoid - C) Tanh -
D) Softmax
Answer: D) Softmax
Explanation: Softmax converts a vector of values into a probability distribu-
tion, making it ideal for multi-class classification where outputs need to sum to
1.

Question 5
In CNN architecture, what is the purpose of a pooling layer? - A) To
apply filters to the input - B) To reduce spatial dimensions - C) To fully connect
all neurons - D) To normalize the input
Answer: B) To reduce spatial dimensions
Explanation: Pooling layers reduce the spatial dimensions (width and height)
of the input volume, which helps reduce computation and control overfitting.

6
Question 6
For deep learning with CNN, what is the size of filter (Kernel) re-
quired to produce a 4x4 feature map from an image of 6x6 pixels,
assuming the filter is applied by sliding window of 1 pixel? - A) 3x3 -
B) 4x4 - C) 2x2 - D) 6x6
Answer: A) 3x3
Explanation: Using the formula: Output size = ((Input size - Filter size) /
Stride) + 1 4 = ((6 - Filter size) / 1) + 1 3 = (6 - Filter size) Filter size = 3
Therefore, a 3x3 filter is needed.

Question 7
In association rule mining, if the marketing team specified the min-
imum support of 0.2, what is the maximum support that can be
specified? - A) 0.2 - B) 1 - C) 0 - D) 0.5
Answer: B) 1
Explanation: Support is a probability measure ranging from 0 to 1. A support
of 1 means the itemset appears in 100% of transactions, which is the maximum
possible value.

Question 8
Which of the following is NOT a type of point in DBSCAN clustering?
- A) Core point - B) Border point - C) Noise point - D) Centroid point
Answer: D) Centroid point
Explanation: DBSCAN defines three types of points: core points, border
points, and noise points. Centroid points are a concept from K-means clustering,
not DBSCAN.

Question 9
What does the “vanishing gradient problem” refer to in neural net-
works? - A) When gradients become too large during backpropagation - B)
When gradients become very small during backpropagation - C) When the learn-
ing rate is too high - D) When there are too many hidden layers
Answer: B) When gradients become very small during backpropagation
Explanation: The vanishing gradient problem occurs when gradients become
extremely small as they propagate backward through the network, making it
diﬀicult for early layers to learn.

7
Calculation Problems
Problem 1: CNN Output Size
If you have an input image of size 28x28 and apply a 5x5 convolutional
filter with stride 1 and no padding, what will be the size of the output
feature map?
Solution: Using the formula: Output size = ((Input size - Filter size) / Stride)
+ 1 Output size = ((28 - 5) / 1) + 1 = 23 + 1 = 24 Therefore, the output
feature map will be 24x24.

Problem 2: Neural Network Weights

A neural network has 3 layers: an input layer with 8 nodes, a hidden
layer with 12 nodes, and an output layer with 4 nodes. How many
weights and biases does this network have in total?
Solution: Weights: - Between input and hidden: 8 × 12 = 96 - Between hidden
and output: 12 × 4 = 48 - Total weights: 96 + 48 = 144
Biases: - Hidden layer: 12 - Output layer: 4 - Total biases: 16
Total parameters: 144 + 16 = 160

Problem 3: Association Rule Metrics

In a market basket analysis of 200 transactions, itemset {bread, but-
ter} appears in 40 transactions, itemset {bread} appears in 100 trans-
actions, and itemset {butter} appears in 80 transactions. Calculate
the support, confidence, and lift for the rule “bread → butter”.
Solution: Support({bread, butter}) = 40/200 = 0.2 or 20% Support({bread})
= 100/200 = 0.5 or 50% Support({butter}) = 80/200 = 0.4 or 40%
Confidence(bread → butter) = Support({bread, butter}) / Support({bread}) =
0.2 / 0.5 = 0.4 or 40%
Lift(bread → butter) = Confidence(bread → butter) / Support({butter}) = 0.4
/ 0.4 = 1

Problem 4: PCA Variance Explained

After performing PCA on a dataset with 10 features, you find that
the first 3 principal components have eigenvalues of 4.2, 2.8, and
1.5, while the remaining components have eigenvalues summing to
1.5. What percentage of variance is explained by the first 3 principal
components?
Solution: Total variance = 4.2 + 2.8 + 1.5 + 1.5 = 10 Variance explained by
first 3 components = 4.2 + 2.8 + 1.5 = 8.5 Percentage of variance explained =
(8.5 / 10) × 100 = 85%

8
Key Formulas to Remember
1. CNN Output Size: ((W-F+2P)/S)+1
• W: input size
• F: filter size
• P: padding
• S: stride
2. Support: (Transactions containing itemset) / (Total transactions)
3. Confidence: Support(X�Y) / Support(X)
4. Lift: Confidence(X→Y) / Support(Y)
5. Sigmoid Function: f(x) = 1/(1+e^(-x))
6. ReLU Function: f(x) = max(0, x)
7. Softmax Function: f(x_i) = e(x_i)/Σ(e (x_j))
8. Binary Cross-Entropy: -[y·log(p) + (1-y)·log(1-p)]
9. Categorical Cross-Entropy: -Σ[y_i·log(p_i)]

Tips for MCQ Questions

1. Understand the algorithms: Know how each clustering and dimension-
ality reduction algorithm works.
2. Memorize CNN formulas: Be able to calculate output sizes after con-
volution and pooling.
3. Know activation functions: Understand which activation functions are
used for different purposes.
4. Practice calculations: Be comfortable with calculating support, confi-
dence, and lift for association rules.
5. Understand neural network architecture: Know how to calculate
the number of parameters in a network.

Types of Neural Networks Explained
No ratings yet
Types of Neural Networks Explained
13 pages
DWDM Rit-E22 Unit4
No ratings yet
DWDM Rit-E22 Unit4
139 pages
Antim Prahar AI and ML For Business 2025
No ratings yet
Antim Prahar AI and ML For Business 2025
45 pages
AI Notes
No ratings yet
AI Notes
3 pages
Deep Learning
No ratings yet
Deep Learning
13 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
155 pages
Artificial Intelligence
100% (1)
Artificial Intelligence
47 pages
ML Exam Q&A
No ratings yet
ML Exam Q&A
10 pages
DeekshikaJadyada21 AP24LDS11
No ratings yet
DeekshikaJadyada21 AP24LDS11
5 pages
Machine Learning Clustering & NN
No ratings yet
Machine Learning Clustering & NN
28 pages
Updated AAM QB
No ratings yet
Updated AAM QB
6 pages
Aiml Ece Unit-5
No ratings yet
Aiml Ece Unit-5
48 pages
Unit 3
No ratings yet
Unit 3
59 pages
Entropy (S) Log (P) : I 1c I I
No ratings yet
Entropy (S) Log (P) : I 1c I I
5 pages
Data Science For Civil Engineering Unit 5 Notes
No ratings yet
Data Science For Civil Engineering Unit 5 Notes
17 pages
MLT UNIT-4 & 5 Imp Sol
No ratings yet
MLT UNIT-4 & 5 Imp Sol
22 pages
Machine Learning for Data Scientists
No ratings yet
Machine Learning for Data Scientists
14 pages
Unit 4
No ratings yet
Unit 4
51 pages
L4 - Deep Learning
No ratings yet
L4 - Deep Learning
50 pages
Deep Learning Exam: Key Concepts
No ratings yet
Deep Learning Exam: Key Concepts
32 pages
Data-Mining and Knowledge Discovery, Neural Networks in
No ratings yet
Data-Mining and Knowledge Discovery, Neural Networks in
15 pages
Unit II
No ratings yet
Unit II
38 pages
CNN Layer Sequence in Transfer Learning
No ratings yet
CNN Layer Sequence in Transfer Learning
8 pages
Unit2 CNN
No ratings yet
Unit2 CNN
34 pages
Ai & Ds-Ii Iat-2 QB Soln
No ratings yet
Ai & Ds-Ii Iat-2 QB Soln
15 pages
MACHINE LEARNING Notes
No ratings yet
MACHINE LEARNING Notes
8 pages
Unit 4 Introduction To Algorithm
No ratings yet
Unit 4 Introduction To Algorithm
10 pages
Unit 4 (CNN and SOM)
No ratings yet
Unit 4 (CNN and SOM)
15 pages
Comprehensive
No ratings yet
Comprehensive
14 pages
Deep Learning Unit-III
No ratings yet
Deep Learning Unit-III
9 pages
NNML Full
No ratings yet
NNML Full
19 pages
Introduction & Fundamentals: - Part I: Introduction - Part II: Fundamental Concepts - Part III: Classification Lab
No ratings yet
Introduction & Fundamentals: - Part I: Introduction - Part II: Fundamental Concepts - Part III: Classification Lab
50 pages
Exam Gen AI
No ratings yet
Exam Gen AI
14 pages
A Survey On Deep Network PDF
No ratings yet
A Survey On Deep Network PDF
24 pages
Deep Learning & CNN Fundamentals
No ratings yet
Deep Learning & CNN Fundamentals
56 pages
CV Unit V
No ratings yet
CV Unit V
18 pages
ML Ans For QP-1
No ratings yet
ML Ans For QP-1
22 pages
Pattern Recognition 14
No ratings yet
Pattern Recognition 14
46 pages
Module 5-1
No ratings yet
Module 5-1
8 pages
02 - Introduction To Convolutional Neural Networks (CNNS)
No ratings yet
02 - Introduction To Convolutional Neural Networks (CNNS)
28 pages
L11 Learning III Neural Network Architectures
No ratings yet
L11 Learning III Neural Network Architectures
35 pages
Mergeddv
No ratings yet
Mergeddv
2 pages
What Should You Consider or Pay Attention To When Preparing A Data Set
No ratings yet
What Should You Consider or Pay Attention To When Preparing A Data Set
7 pages
Introduction to CNNs and Representation Learning
No ratings yet
Introduction to CNNs and Representation Learning
10 pages
Deep Learning Cheatsheet Guide
No ratings yet
Deep Learning Cheatsheet Guide
14 pages
dpt4 Answer Key
No ratings yet
dpt4 Answer Key
4 pages
FODL Unit-4
No ratings yet
FODL Unit-4
46 pages
DL Unit 3
No ratings yet
DL Unit 3
14 pages
What Is Computer Vision?
No ratings yet
What Is Computer Vision?
120 pages
Pattern Recognition
No ratings yet
Pattern Recognition
33 pages
Ml@ok Questions
No ratings yet
Ml@ok Questions
16 pages
Various Neural Network Architect Assignment Questions
No ratings yet
Various Neural Network Architect Assignment Questions
9 pages
Some Important Question
No ratings yet
Some Important Question
59 pages
UNIT-III Convolution Neural Networks
No ratings yet
UNIT-III Convolution Neural Networks
9 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
DL Unit Iv
No ratings yet
DL Unit Iv
18 pages
Pattern Recognition
No ratings yet
Pattern Recognition
33 pages
ML Topics
No ratings yet
ML Topics
18 pages
CNN Architecture
No ratings yet
CNN Architecture
6 pages
Deep Learning Full
No ratings yet
Deep Learning Full
25 pages
Lab2 Solution PDF
No ratings yet
Lab2 Solution PDF
2 pages
Question Bank
No ratings yet
Question Bank
4 pages
R21_AM701_NN_DL_SUGGESTIONS
No ratings yet
R21_AM701_NN_DL_SUGGESTIONS
7 pages
Taxonomy of Neural Network Architectures
No ratings yet
Taxonomy of Neural Network Architectures
3 pages
Sppu ML 2023 End Term
No ratings yet
Sppu ML 2023 End Term
2 pages
Gen AI Unit 3
No ratings yet
Gen AI Unit 3
52 pages
Unit 3 AI - Neural Networks
No ratings yet
Unit 3 AI - Neural Networks
11 pages
Deep Learning Unit-II
No ratings yet
Deep Learning Unit-II
19 pages
Retele Neuronale si Algoritmi Genetici
No ratings yet
Retele Neuronale si Algoritmi Genetici
3 pages
Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
37 pages
2019 Me 0174
No ratings yet
2019 Me 0174
52 pages
Gradient Descent and Delta Rule
No ratings yet
Gradient Descent and Delta Rule
13 pages
SP14 CS188 Lecture 22 - Perceptron - Print
No ratings yet
SP14 CS188 Lecture 22 - Perceptron - Print
35 pages
Lecture14 Neural Network 5 Quiz
No ratings yet
Lecture14 Neural Network 5 Quiz
12 pages
Intro to Artificial Neural Networks
No ratings yet
Intro to Artificial Neural Networks
20 pages
2015 - Convolutional Neural Networks For Sentence Classification (XXX) (15 Slides)
No ratings yet
2015 - Convolutional Neural Networks For Sentence Classification (XXX) (15 Slides)
15 pages
MultilayerPerceptron Chapter9
No ratings yet
MultilayerPerceptron Chapter9
13 pages
CT1 Question - C
No ratings yet
CT1 Question - C
4 pages
Neural Networks & Fuzzy Logic Exam Paper
100% (1)
Neural Networks & Fuzzy Logic Exam Paper
1 page
RGPV Notes - Machine Learning
No ratings yet
RGPV Notes - Machine Learning
4 pages
Ker As Tutorial
No ratings yet
Ker As Tutorial
33 pages
Ad3511 Practical Questions
No ratings yet
Ad3511 Practical Questions
3 pages
(23mca32) Practical 1 & Practical 2
No ratings yet
(23mca32) Practical 1 & Practical 2
9 pages
The Transformer Model - Revolutionizing Artificial Intelligence
No ratings yet
The Transformer Model - Revolutionizing Artificial Intelligence
6 pages
Overview of Machine Learning Concepts
No ratings yet
Overview of Machine Learning Concepts
13 pages
LSTM vs GRU in Speech Recognition
No ratings yet
LSTM vs GRU in Speech Recognition
6 pages
Multi-layer Neural Networks Basics
No ratings yet
Multi-layer Neural Networks Basics
55 pages
10.2. Deep Learning (CNN)
No ratings yet
10.2. Deep Learning (CNN)
50 pages