ML Interview

Uploaded by

arham22khalid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views65 pages

ML Interview

Uploaded by

arham22khalid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Interview Prep:

Why learn ml because every problem is not deterministic and so we need to learn the randomness and
there comes machine learning e.g. in handwriting recognition not every body writes 2 in similar manner

There are two parts of it training and prediction, training is the process of learning from historical data
it’s basically a mapping from the input to the output’s we want to predict and it return a model to us

And in prediction we predict from the test data

Basics:

1. Supervised Learning:-

Talking about types of supervision is for two types classification (In this Y(out output is categorical)
and Regression in which our output Y is (numeric)

Categorical-> Logistic regression, Decision Tree, Random forest, support vector machines(SVM) naïve
bayes etc

Regression involves -> Linear Regression ,Regression trees, and Kernel Regression
Loss function tells us that how good or bad a model is on test data
We don’t want to fit to noise of the data we want to fit to the pattern of the data
Both overfitting and under fitting are not good for our model so how do we solve that and for that
we take help of regularization
L1- regularization what it tries to do is that it tries to push small and insignificant weight values to
zero

L-2 also does same but it’s not as aggressive as L-1 it tries somekind of uniform reduction in all the
coefficients involved

But question comes to we quantify a fit whether a particular fit is good fit or bad fit

And their comes the bias variance and tade off to measure this

Overfitting is the case where we fit the model with higher complexity than required by the data and
in underfitting we fit the model with lower complexity than it is required by the data
But in practice how would you determine a particular fit is good fit or bad fit

Given any data set we will split it into test sample and train sample and you train it on the train data
and test it on the test and from the we will try be somewhere in the middle pick that model in which
the error on the test sample is lowest
It is also same as linear regression only but in this we add some regularization to diagonal elements

likelihood is the product of all the probabilities but in theory it is possible but in practical it can lead
to various numerical underflow issue so we take summation of log off probabilities which also same
and this is called conditional log likelihood but problem is that we cannot solve this types of
regression that is cannot minimize the function analytically so we follow iterative solvers
the trick we use is that we start with some initial value of W and then we move in the direction in
value if J is down we calculate the gradient and move in it’s opposite direction ita is the learning rate

So when the size of the data is very large like millions of data then computing the gradient is
expensive too much so what we say is that can we get an estimate of the direction in which I need to
move so why not plug one sample and this is stochastic gradient descent but the problem is too
much variance so why not so something in between so we go for batch gradient descent
Now we will move on to next topic tree models

It is more like if else statement

No of leafs corresspond to number of different regions of the graph

Let’s talk about how we go on building that decision trees it basically a greedy divide and conquer
alogortihm we check like for the current node which feature gives most satisfying classification and
we break the tree there based on that feature

We will try to choose that which result in greatest decrease in the impurity of the nodes

Before talking about what are the criteria for measuring impurity let’s take example
We have used gini impurity value so we see 0.34<.49 so second is best split reemeber high gini
impurity is bad so we don’t want that

The deeper the tree is higher the chance is that there is overfiting so we need to stop but when to
stop

Let’s see how decision trees are used in real model so what we do is use ensemble of trees for
We do ensemble learning through bagging and boosting

In bagging we make mutiple decsion tree with low vbias and high variance and then we predict and
take average and this will reduce the overal variance giving low bias and low variance

I boosting what we do is that we tain on one model and predict from it and then for next model we
use the error of the first model in the learning of the second model it is iterative method

Each of the matrix is called the bootstrap and we just take the avrage the prediction of the models
Why it works is beacause it aggration of high variance giving it low variance

The idea is to start from the weak learners in the adaboost and then go on building up the model
complexity each weak learner is very high bias model and famous boosting algorithm is adaboost

I each iteration we get a weak learner and error weight and final model will be linear combination of
weight and learner
Why it is called gradient beacause the way we calculate the loss function in that
Ratio of (tpr+tnr)/2 is balanced accuracy
Reciever operator characterstics (ROC)
It assumes the independence of the features from the other one

It’s not training it’s learning from conditional probability and so it’s very fast
It sufffers from the curse of dimensionality as we go to higher dimension the number of samples in
cubic region goes on reducing
As the dimension increases more and more sample have same distance where it would fail so it’s
better to work in low dimension

The dotted line is support vectors

If in the current dimension the data is not separable we use kernel to to transform data and separate
in that dimasion
Max pooling prevent overfitting and makes model and invariant of small features or shits
Inductive biases are the biases which we have taken in consideration in training our model we
basically feed sequentially data to the RNN

RNN HAS THE MEMORY TO REMEMBER THE PAST

One hot encoding
RNN SUFFERS FROM THE FORGETFULLNESS THAT IS THEY LOOSE THE DATA WHICH CAME BACK VERY
LONG IN THE TIME
LSTM COME WITH MORE PARAMETER AND BUT THEY ARE EXPENSIVE TO TRAIN

Bidirectional rnn says not only past is important but future is also important
In attention all hidden state are kept other than simple network and it’s some way like reffering back
Without position encoding the self attention will not work
It works on masking mask one or two word and then it predict the that masked word so it is very
successful we use bert on transfer learning perfora

22 Boosting
No ratings yet
22 Boosting
32 pages
05 - Ensemble Learning
No ratings yet
05 - Ensemble Learning
39 pages
Chapter 7 - Ensemble
No ratings yet
Chapter 7 - Ensemble
12 pages
Evaluating Machine Learning Models
100% (2)
Evaluating Machine Learning Models
10 pages
MODULE - 4 - PART 1 - Ensemble - Methods
No ratings yet
MODULE - 4 - PART 1 - Ensemble - Methods
24 pages
Key Machine Learning Terminologies and Their Expla
No ratings yet
Key Machine Learning Terminologies and Their Expla
4 pages
Overfitting & Feature Engineering
No ratings yet
Overfitting & Feature Engineering
37 pages
AWS Machine Learning Specialty Master Cheat Sheet
No ratings yet
AWS Machine Learning Specialty Master Cheat Sheet
24 pages
ML11 Generalization
No ratings yet
ML11 Generalization
40 pages
Fall 2022 Midterm Notes PDF
No ratings yet
Fall 2022 Midterm Notes PDF
15 pages
Unit 1b - Fundamentals of Machine Learning
No ratings yet
Unit 1b - Fundamentals of Machine Learning
31 pages
Codes and Concepts of ML-Developer-2
No ratings yet
Codes and Concepts of ML-Developer-2
17 pages
??????? ???????? ??????????!
No ratings yet
??????? ???????? ??????????!
16 pages
Ai - W7L14
No ratings yet
Ai - W7L14
22 pages
Ensemble Methods
No ratings yet
Ensemble Methods
19 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
116 pages
Week 11
No ratings yet
Week 11
16 pages
Machine Learning Juunit2.pdf Lands
No ratings yet
Machine Learning Juunit2.pdf Lands
7 pages
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
No ratings yet
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
6 pages
Module 4 Supervised Learning
No ratings yet
Module 4 Supervised Learning
4 pages
MachineLearning Chatgpt
No ratings yet
MachineLearning Chatgpt
19 pages
Machine Learning Engineer Cheatsheet
No ratings yet
Machine Learning Engineer Cheatsheet
3 pages
Module 4
No ratings yet
Module 4
44 pages
1.2 Overfitting Under Fitting and Cross Validation and Confusion Matrix
No ratings yet
1.2 Overfitting Under Fitting and Cross Validation and Confusion Matrix
17 pages
Understanding Underfitting, Overfitting, and Capacity
No ratings yet
Understanding Underfitting, Overfitting, and Capacity
72 pages
You Said
No ratings yet
You Said
64 pages
Machine Learning Fundamentals
No ratings yet
Machine Learning Fundamentals
5 pages
Cornell CS578: Bagging and Boosting
No ratings yet
Cornell CS578: Bagging and Boosting
10 pages
Machine Learning Engineer Interview Preparation Guide
No ratings yet
Machine Learning Engineer Interview Preparation Guide
14 pages
M05 Ensemble
No ratings yet
M05 Ensemble
42 pages
Advanced Machine Learning Techniques
No ratings yet
Advanced Machine Learning Techniques
61 pages
Module3 DS PPT
No ratings yet
Module3 DS PPT
68 pages
Unit 4
No ratings yet
Unit 4
34 pages
Week 11.3
No ratings yet
Week 11.3
14 pages
Ensemble Methods
No ratings yet
Ensemble Methods
21 pages
AN2DL 03 2324 NeuralNetwroksTraining
No ratings yet
AN2DL 03 2324 NeuralNetwroksTraining
40 pages
Machine Learning
No ratings yet
Machine Learning
16 pages
Unit 1
No ratings yet
Unit 1
14 pages
15-The Bias - Variance - Trade-Off-08-04-2024
No ratings yet
15-The Bias - Variance - Trade-Off-08-04-2024
23 pages
Daa PL 6
No ratings yet
Daa PL 6
28 pages
Machine Learning Interview Questions
No ratings yet
Machine Learning Interview Questions
38 pages
Ensemble Methods
No ratings yet
Ensemble Methods
31 pages
Overfitting vs Underfitting in ML
No ratings yet
Overfitting vs Underfitting in ML
20 pages
Study Notes - Lesson 1 - 7 PDF
No ratings yet
Study Notes - Lesson 1 - 7 PDF
25 pages
AIML105
No ratings yet
AIML105
5 pages
SML Updated UNIT 4
No ratings yet
SML Updated UNIT 4
44 pages
Machine Learning Notes "2023
No ratings yet
Machine Learning Notes "2023
31 pages
Notes - Unit 3 - Machine Learning Lnctu-Bca (Aida) - IV Sem
No ratings yet
Notes - Unit 3 - Machine Learning Lnctu-Bca (Aida) - IV Sem
19 pages
Data Science Interview Question
No ratings yet
Data Science Interview Question
23 pages
Comprehensive Machine Learning Guide
No ratings yet
Comprehensive Machine Learning Guide
20 pages
Understanding Model Regularization in ML
No ratings yet
Understanding Model Regularization in ML
42 pages
UNIT - 5 Data Science
No ratings yet
UNIT - 5 Data Science
34 pages
Machine Leafning
No ratings yet
Machine Leafning
5 pages
Lecture 1 Part II
No ratings yet
Lecture 1 Part II
24 pages
Overfitting
No ratings yet
Overfitting
7 pages
Overfitting Underfitting Bias Variance
No ratings yet
Overfitting Underfitting Bias Variance
11 pages
Boosting
No ratings yet
Boosting
2 pages
A Survey On Feature Extraction Techniques For Palmprint Identification
No ratings yet
A Survey On Feature Extraction Techniques For Palmprint Identification
5 pages
K Nearest Neighbors Tutorial - KNN Numerical Example (Hand Computation)
No ratings yet
K Nearest Neighbors Tutorial - KNN Numerical Example (Hand Computation)
2 pages
Understanding Multi Degree of Freedom Systems
No ratings yet
Understanding Multi Degree of Freedom Systems
9 pages
IoT and ML for Liver Disease Prediction
No ratings yet
IoT and ML for Liver Disease Prediction
11 pages
Space Time Codes For MIMO
No ratings yet
Space Time Codes For MIMO
42 pages
DEA for Performance Analysis
No ratings yet
DEA for Performance Analysis
23 pages
Task 2 Exploratory Data Analysis
No ratings yet
Task 2 Exploratory Data Analysis
5 pages
Li 2024 Mach. Learn. Sci. Technol. 5 015025
No ratings yet
Li 2024 Mach. Learn. Sci. Technol. 5 015025
16 pages
Final Report Fraud Detection
No ratings yet
Final Report Fraud Detection
46 pages
Affordance-Based Robot Manipulation
No ratings yet
Affordance-Based Robot Manipulation
12 pages
Applied Mathematics and Computation: C. Clavero, J.C. Jorge
No ratings yet
Applied Mathematics and Computation: C. Clavero, J.C. Jorge
1 page
The Clique Percolation Method CPM
No ratings yet
The Clique Percolation Method CPM
10 pages
Constraint Satisfaction Problems
No ratings yet
Constraint Satisfaction Problems
10 pages
Input Filter Magic
No ratings yet
Input Filter Magic
54 pages
Numerical Methods Exam Paper
No ratings yet
Numerical Methods Exam Paper
1 page
PID Control for DC Motor with Microcontroller
No ratings yet
PID Control for DC Motor with Microcontroller
7 pages
DAA PPT - Unit - IV
No ratings yet
DAA PPT - Unit - IV
9 pages
Analisis Autokorelasi Spasialtitik Panas Di Kalimantan Timur Menggunakan Indeks Moran PDF
No ratings yet
Analisis Autokorelasi Spasialtitik Panas Di Kalimantan Timur Menggunakan Indeks Moran PDF
8 pages
Mealy vs Moore Machines Explained
No ratings yet
Mealy vs Moore Machines Explained
28 pages
Fixed Paper 2 Mock Set 2
No ratings yet
Fixed Paper 2 Mock Set 2
28 pages
Python Basics: Variables, Strings, Lists, and More
No ratings yet
Python Basics: Variables, Strings, Lists, and More
65 pages
Understanding Transformer Parameter Count
No ratings yet
Understanding Transformer Parameter Count
6 pages
331 Basics
No ratings yet
331 Basics
26 pages
Quantum Mechanics Project
No ratings yet
Quantum Mechanics Project
7 pages
Numerical Method Using Wolfarm Mathematica
No ratings yet
Numerical Method Using Wolfarm Mathematica
22 pages
Factorization of 407 Explained
No ratings yet
Factorization of 407 Explained
6 pages
Basic Programming Algorithms and Flowcharts
No ratings yet
Basic Programming Algorithms and Flowcharts
4 pages
GROUP 3 - Transfer Function (Control Engr.)
No ratings yet
GROUP 3 - Transfer Function (Control Engr.)
25 pages
Module - 1 - Intro - To - Algorithm - Analysis
No ratings yet
Module - 1 - Intro - To - Algorithm - Analysis
22 pages
Chapter 3 Samplingandquantization
No ratings yet
Chapter 3 Samplingandquantization
91 pages