0% found this document useful (0 votes)

216 views9 pages

Logistic Regression Interview Prep

Uploaded by

sai ramya prabhaji

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

216 views9 pages

Logistic Regression Interview Prep

Uploaded by

sai ramya prabhaji

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

20+ Questions to Test your Skills on Logistic Regression

BE G I NNE R C A RE E R C LA S S I F I C AT I O N I NT E RVI E W S M A C HI NE LE A RNI NG RE G RE S S I O N

This article was published as a part of the Data Science Blogathon

Introduction

Logistic Regression, a statistical model is a very popular and easy-to-understand algorithm that is mainly
used to find out the probability of an outcome.

Therefore it becomes necessary for every aspiring Data Scientist and Machine Learning Engineer to have a
good knowledge of Logistic Regression.

In this article, we will discuss the most important questions on Logistic Regression which is helpful to get
you a clear understanding of the techniques, and also for Data Science Interviews, which covers its very
fundamental level to complex concepts.

Let’s get started,

1. What do you mean by the Logistic Regression?

It’s a classification algorithm that is used where the target variable is of categorical nature. The main
objective behind Logistic Regression is to determine the relationship between features and the probability
of a particular outcome.

For Example, when we need to predict whether a student passes or fails in an exam given the number of
hours spent studying as a feature, the target variable comprises two values i.e. pass and fail.

Therefore, we can solve classification problem statements which is a supervised machine learning
technique using Logistic Regression.

2. What are the different types of Logistic Regression?

Three different types of Logistic Regression are as follows:

1. Binary Logistic Regression: In this, the target variable has only two 2 possible outcomes.

For Example, 0 and 1, or pass and fail or true and false.

2. Multinomial Logistic Regression: In this, the target variable can have three or more possible values
without any order.

For Example, Predicting preference of food i.e. Veg, Non-Veg, Vegan.

3. Ordinal Logistic Regression: In this, the target variable can have three or more values with ordering.

For Example, Movie rating from 1 to 5.

3. Explain the intuition behind Logistic Regression in detail.

Given:

By using the training dataset, we can find the dependent(x) and independent variables(y), so if we can
determine the parameters w (Normal) and b (y-intercept), then we can easily find a decision boundary that
can almost separate both the classes in a linear fashion.

Objective:

In order to train a Logistic Regression model, we just need w and b to find a line(in 2D), plane(3D), or
hyperplane(in more than 3-D dimension) that can separate both the classes point as perfect as possible so
that when it encounters with any new unseen data point, it can easily classify, from which class the unseen
data point belongs to.

For Example, Let us consider we have only two features as x1 and x2 .

Let’s take any of the +ve class points (figure below) and find the shortest distance from that point to the
plane. Here, the shortest distance is computed using:

d i = wT *xi / ||w||

If weight vector is a unit vector i.e, ||w||=1. Then,

d i = wT *xi

Since w and xi are on the same side of the decision boundary therefore distance will be +ve. Now for a

negative point, we have to compute d j = w T *xj. For point xj , distance will be -ve since this point is the
opposite side of w.

Thus we can conclude, points that are in the same direction of w are considered as +ve points and the
points which are in the opposite direction of w are considered as -ve points.

Now, we can easily classify the unseen data points as -ve and +ve points. If the value of w T *xi >0, then y

=+1 and if value of w T *xi < 0 then y = -1.

If y i = +1 and w T *xi > 0, then the classifier classifies it as+ve points. This implies if y i *w T *xi > 0, then it
is a correctly classified point because multiplying two +ve numbers will always be greater than 0.

If y i = -1 and w T *xi < 0, then the classifier classifies it as -ve point. This implies if y i * w T *xi > 0 then it is
a correctly classified point because multiplying two -ve numbers will always be greater than zero. So,
for both +ve and -ve points the value of y i * w T *xi is greater than 0. Therefore, the model classifies the
points xi correctly.

If y i = +1 and w T *xi < 0, i.e, y i is +ve point but the classifier says that it is -ve then we will get -ve value.
This means that point is classified as -ve but the actual class label is +ve, then it is a miss-classified
point.

If y i = -1 and w T *xi > 0, this means actual class label is -ve but classified as +ve, then it is miss-

classified point( y i w T xi < 0).

Now, by observing all the cases above now our objective is that our classifier minimizes the miss-
classification error, i.e, we want the values of y i *w T *xi to be greater than 0.

In our problem, xi and y i are fixed because these are coming from the dataset.

As we change the values of the parameters w, and b the sum will change and we want to find that w and b
that maximize the sum given below. To calculate the parameters w and b, we can use the Gradient Descent
optimizer. Therefore, the optimization function for logistic regression is:

4. What are the odds?

Odds are defined as the ratio of the probability of an event occurring to the probability of the event not
occurring.

For Example, let’s assume that the probability of winning a game is 0.02. Then, the probability of not
winning is 1- 0.02 = 0.98.

The odds of winning the game= (Probability of winning)/(probability of not winning)

The odds of winning the game= 0.02/0.98
The odds of winning the game are 1 to 49, and the odds of not winning the game are 49 to 1.

5. What factors can attribute to the popularity of Logistic Regression?

Logistic Regression is a popular algorithm as it converts the values of the log of odds which can range
from -inf to +inf to a range between 0 and 1.

Since logistic functions output the probability of occurrence of an event, they can be applied to many real-
life scenarios therefore these models are very popular.
6. Is the decision boundary Linear or Non-linear in the case of a Logistic
Regression model?

The decision boundary is a line or a plane that separates the target variables into different classes that can
be either linear or nonlinear. In the case of a Logistic Regression model, the decision boundary is a straight
line.

Logistic Regression model formula = α+1X 1 +2X 2 +….+kX k . This clearly represents a straight line.

It is suitable in cases where a straight line is able to separate the different classes. However, in cases
where a straight line does not suffice then nonlinear algorithms are used to achieve better results.

7. What is the Impact of Outliers on Logistic Regression?

The estimates of the Logistic Regression are sensitive to unusual observations such as outliers, high
leverage, and influential observations. Therefore, to solve the problem of outliers, a sigmoid function is
used in Logistic Regression.

8. What is the difference between the outputs of the Logistic model and
the Logistic function?

The Logistic model outputs the logits, i.e. log-odds; whereas the Logistic function outputs the
probabilities.

Logistic model = α+1X 1 +2X 2 +….+kX k . Therefore, the output of the Logistic model will be logits.

Logistic function = f(z) = 1/(1+e-(α+1X 1 +2X 2 +….+kX k )). Therefore, the output of the Logistic function will
be the probabilities.

9. How do we handle categorical variables in Logistic Regression?

The inputs given to a Logistic Regression model need to be numeric. The algorithm cannot handle
categorical variables directly. So, we need to convert the categorical data into a numerical format that is
suitable for the algorithm to process.

Each level of the categorical variable will be assigned a unique numeric value also known as a dummy
variable. These dummy variables are handled by the Logistic Regression model in the same manner as any
other numeric value.

10. Which algorithm is better in the case of outliers present in the

dataset i.e., Logistic Regression or SVM?

SVM (Support Vector Machines) handles the outliers in a better manner than the Logistic Regression.

Logistic Regression: Logistic Regression will identify a linear boundary if it exists to accommodate the
outliers. To accommodate the outliers, it will shift the linear boundary.

SVM: SVM is insensitive to individual samples. So, to accommodate an outlier there will not be a major
shift in the linear boundary. SVM comes with inbuilt complexity controls, which take care of overfitting,
which is not true in the case of Logistic Regression.

11. What are the assumptions made in Logistic Regression?

Some of the assumptions of Logistic Regression are as follows:

1. It assumes that there is minimal or no multicollinearity among the independent variables i.e, predictors
are not correlated.

2. There should be a linear relationship between the logit of the outcome and each predictor variable. The
logit function is described as logit(p) = log(p/(1-p)), where p is the probability of the target outcome.

3. Sometimes to predict properly, it usually requires a large sample size.

4. The Logistic Regression which has binary classification i.e, two classes assume that the target variable
is binary, and ordered Logistic Regression requires the target variable to be ordered.

For example, Too Little, About Right, Too Much.

5. It assumes there is no dependency between the observations.

12. Can we solve the multiclass classification problems using Logistic

Regression? If Yes then How?

Yes, in order to deal with multiclass classification using Logistic Regression, the most famous method is
known as the one-vs-all approach. In this approach, a number of models are trained, which is equal to the
number of classes. These models work in a specific way.

For Example, the first model classifies the datapoint depending on whether it belongs to class 1 or some
other class(not class 1); the second model classifies the datapoint into class 2 or some other class(not
class 2) and so-on for all other classes.

So, in this manner, each data point can be checked over all the classes.

13. How can we express the probability of a Logistic Regression model

as conditional probability?

We define probability P(Discrete value of Target variable | X 1 , X 2 , X 3 …., X k ) as the probability of the target
variable that takes up a discrete value (either 0 or 1 in the case of binary classification problems) when the
values of independent variables are given.

For Example, the probability an employee will attain (target variable) given his attributes such as his age,
salary, etc.

14. Discuss the space complexity of Logistic Regression.

During training: We need to store four things in memory: x, y, w, and b during training a Logistic Regression
model.

Storing b is just 1 step, i.e, O(1) operation since b is a constant.

x and y are two matrices of dimension (n x d) and (n x 1) respectively. So, storing these two matrices
takes O(nd + n) steps.
Lastly, w is a vector of size-d. Storing it in memory takes O(d) steps.

Therefore, the space complexity of Logistic Regression while training is O(nd + n +d).

During Runtime or Testing: After training the model what we just need to keep in memory is w. We just
need to perform w T *xi to classify the points.

Hence, the space complexity during runtime is in the order of d, i.e, O(d).

15. Discuss the Test or Runtime complexity of Logistic Regression.

At the end of the training, we test our model on unseen data and calculate the accuracy of our model. At
that time knowing about runtime complexity is very important. After the training of Logistic Regression, we
get the parameters w and b.

To classify any new point, we have to just perform the operation w T * xi. If w T *xi>0, the point is +ve, and if
w T *xi < 0, the point is negative. As w is a vector of size d, performing the operation w T *xi takes O(d) steps
as discussed earlier.

Therefore, the testing complexity of the Logistic Regression is O(d).

Hence, Logistic Regression is very good for low latency applications, i.e, for applications where the
dimension of the data is small.

16. Why is Logistic Regression termed as Regression and not

classification?

The major difference between Regression and classification problem statements is that the target variable
in the Regression is numerical (or continuous) whereas in classification it is categorical (or discrete).

Logistic Regression is basically a supervised classification algorithm. However, the Logistic Regression
builds a model just like linear regression in order to predict the probability that a given data point belongs
to the category numbered as “1”.

For Example, Let’s have a binary classification problem, and ‘x’ be some feature and ‘y’ be the target
outcome which can be either 0 or 1.

The probability that the target outcome is 1 given its input can be represented as:

If we predict the probability by using linear Regression, we can describe it as:

where, p(x) = p(y=1|x)

Logistic regression models generate predicted probabilities as any number ranging from neg to pos infinity
while the probability of an outcome can only lie between 0< P(x)<1.
However, to solve the problem of outliers, a sigmoid function is used in Logistic Regression. The Linear
equation is put in the sigmoid function.

17. Discuss the Train complexity of Logistic Regression.

In order to train a Logistic Regression model, we just need w and b to find a line(in 2-D), plane(in 3-D), or
hyperplane(in more than 3-D dimension) that can separate both the classes point as perfect as possible so
that when it encounters with any new point, it can easily classify, from which class the unseen data point
belongs to.

The value of w and b should be such that it maximizes the sum y i *w T *xi > 0.

Now, let’s calculate its time complexity in terms of Big O notation:

Performing the operation y i *w T *xi takes O(d) steps since w is a vector of size-d.

Iterating the above step over n data points and finding the maximum sum takes n steps.

Therefore, the overall time complexity of the Logistic Regression during training is n(O(d))=O(nd).

18. Why can’t we use Mean Square Error (MSE) as a cost function for
Logistic Regression?

In Logistic Regression, we use the sigmoid function to perform a non-linear transformation to obtain the
probabilities. If we square this nonlinear transformation, then it will lead to the problem of non-convexity
with local minimums and by using gradient descent in such cases, it is not possible to find the global
minimum. As a result, MSE is not suitable for Logistic Regression.

So, in the Logistic Regression algorithm, we used Cross-entropy or log loss as a cost function. The
property of the cost function for Logistic Regression is that:

The confident wrong predictions are penalized heavily

The confident right predictions are rewarded less

By optimizing this cost function, convergence is achieved.

19. Why can’t we use Linear Regression in place of Logistic Regression
for Binary classification?

Linear Regressions cannot be used in the case of binary classification due to the following reasons:

1. Distribution of error terms: The distribution of data in the case of Linear and Logistic Regression is
different. It assumes that error terms are normally distributed. But this assumption does not hold true in
the case of binary classification.

2. Model output: In Linear Regression, the output is continuous(or numeric) while in the case of binary
classification, an output of a continuous value does not make sense. For binary classification problems,
Linear Regression may predict values that can go beyond the range between 0 and 1. In order to get the
output in the form of probabilities, we can map these values to two different classes, then its range should
be restricted to 0 and 1. As the Logistic Regression model can output probabilities with Logistic or sigmoid
function, it is preferred over linear Regression.

3. The variance of Residual errors: Linear Regression assumes that the variance of random errors is
constant. This assumption is also not held in the case of Logistic Regression.

20. What are the advantages of Logistic Regression?

The advantages of the logistic regression are as follows:

1. Logistic Regression is very easy to understand.

2. It requires less training.

3. It performs well for simple datasets as well as when the data set is linearly separable.

4. It doesn’t make any assumptions about the distributions of classes in feature space.

5. A Logistic Regression model is less likely to be over-fitted but it can overfit in high dimensional
datasets. To avoid over-fitting these scenarios, One may consider regularization.

6. They are easier to implement, interpret, and very efficient to train.

21. What are the disadvantages of Logistic Regression?

The disadvantages of the logistic regression are as follows:

1. Sometimes a lot of Feature Engineering is required.

2. If the independent features are correlated with each other it may affect the performance of the classifier.

3. It is quite sensitive to noise and overfitting.

4. Logistic Regression should not be used if the number of observations is lesser than the number of
features, otherwise, it may lead to overfitting.

5. By using Logistic Regression, non-linear problems can’t be solved because it has a linear decision
surface. But in real-world scenarios, the linearly separable data is rarely found.

6. By using Logistic Regression, it is tough to obtain complex relationships. Some algorithms such as
neural networks, which are more powerful, and compact can easily outperform Logistic Regression
algorithms.
7. In Linear Regression, there is a linear relationship between independent and dependent variables but in
Logistic Regression, independent variables are linearly related to the log odds (log(p/(1-p)).

End Notes

Thanks for reading!

I hope you enjoyed the questions and were able to test your knowledge about Logistic Regression.

If you liked this and want to know more, go visit my other articles on Data Science and Machine Learning
by clicking on the Link

Please feel free to contact me on Linkedin, Email.

Something not mentioned or want to share your thoughts? Feel free to comment below And I’ll get back to
you.

About the author

Chirag Goyal

Currently, I am pursuing my Bachelor of Technology (B.Tech) in Computer Science and Engineering from
the Indian Institute of Technology Jodhpur(IITJ). I am very enthusiastic about Machine learning, Deep
Learning, and Artificial Intelligence.

The media shown in this ar ticle are not owned by Analytics Vidhya and is used at the Author’s discretion.

Article Url - https://www.analyticsvidhya.com/blog/2021/05/20-questions-to-test-your-skills-on-logistic-

regression/

chirag676

Week 7
No ratings yet
Week 7
21 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
53 pages
Report Logistic Regression
No ratings yet
Report Logistic Regression
21 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Machine Learning for Mechanics
No ratings yet
Machine Learning for Mechanics
19 pages
Logistic Regression
No ratings yet
Logistic Regression
20 pages
Eml 24.7.25
No ratings yet
Eml 24.7.25
23 pages
Logistic Regression
No ratings yet
Logistic Regression
36 pages
Lecture Notes 6 Logistic Regression
No ratings yet
Lecture Notes 6 Logistic Regression
8 pages
ML CLASS 5 Logistic Regression Algorithm
No ratings yet
ML CLASS 5 Logistic Regression Algorithm
16 pages
Chp2 Logistic Regression
No ratings yet
Chp2 Logistic Regression
6 pages
Regression vs Classification Algorithms
100% (1)
Regression vs Classification Algorithms
13 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
22 pages
MLStackCafe QAS 1672810525772
No ratings yet
MLStackCafe QAS 1672810525772
12 pages
Logistic Regression
No ratings yet
Logistic Regression
10 pages
09 23ECE216 LogisticRegression
No ratings yet
09 23ECE216 LogisticRegression
40 pages
FAM Unit6
No ratings yet
FAM Unit6
32 pages
Logistic Regression Report
No ratings yet
Logistic Regression Report
39 pages
11-Logistic Regression
No ratings yet
11-Logistic Regression
27 pages
DMML Unit4
No ratings yet
DMML Unit4
77 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
23 pages
MACHINE LEARNING Presentation Logistic Regression
No ratings yet
MACHINE LEARNING Presentation Logistic Regression
18 pages
Week 4 Logistic
No ratings yet
Week 4 Logistic
21 pages
Simple Linear Regression Definition: Two Variables Independent Variable Dependent Variable Equation
No ratings yet
Simple Linear Regression Definition: Two Variables Independent Variable Dependent Variable Equation
9 pages
What Is Logistic Regression
No ratings yet
What Is Logistic Regression
20 pages
11logistic Regression in Machine Learning - GeeksforGeeks
No ratings yet
11logistic Regression in Machine Learning - GeeksforGeeks
4 pages
Logistic Regression
No ratings yet
Logistic Regression
16 pages
ML - LAB - BE CSE (DS) Final
No ratings yet
ML - LAB - BE CSE (DS) Final
110 pages
Logistic Regression
No ratings yet
Logistic Regression
14 pages
30 Questions To Test Your Understanding of Logistic Regression
100% (1)
30 Questions To Test Your Understanding of Logistic Regression
13 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
41 pages
Linear and Logistic Regression
No ratings yet
Linear and Logistic Regression
21 pages
Lec 02 LogisticReg
No ratings yet
Lec 02 LogisticReg
33 pages
Logistic Regression
No ratings yet
Logistic Regression
34 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
10 pages
W5S01 - PM-Logistic Regression
No ratings yet
W5S01 - PM-Logistic Regression
17 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
Lecture 4-Logistic Regression
No ratings yet
Lecture 4-Logistic Regression
20 pages
13 Logistic Regression Main
No ratings yet
13 Logistic Regression Main
14 pages
Confusion Matrix in Regression Analysis
No ratings yet
Confusion Matrix in Regression Analysis
23 pages
Lecture 08
No ratings yet
Lecture 08
42 pages
Logistic Regression
No ratings yet
Logistic Regression
5 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
67 pages
Logistic REGRESSION
No ratings yet
Logistic REGRESSION
10 pages
Logistic Regressions
No ratings yet
Logistic Regressions
11 pages
ML-Unit 4
No ratings yet
ML-Unit 4
29 pages
ML 4
No ratings yet
ML 4
80 pages
Chapter 4 Statistical Classification Methods
No ratings yet
Chapter 4 Statistical Classification Methods
63 pages
VO MCA S4 Data Mining Unit 8
No ratings yet
VO MCA S4 Data Mining Unit 8
18 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
4 pages
Logistics Regression
No ratings yet
Logistics Regression
8 pages
Unit 3-ML
No ratings yet
Unit 3-ML
99 pages
Logistic Regression for Analysts
No ratings yet
Logistic Regression for Analysts
33 pages
Sonia Jessica - 2022 - How Does Logistic Regression Work
No ratings yet
Sonia Jessica - 2022 - How Does Logistic Regression Work
4 pages
Logisticregression
No ratings yet
Logisticregression
22 pages
Wa0004.
No ratings yet
Wa0004.
9 pages
Logistic Regression for BBA Students
No ratings yet
Logistic Regression for BBA Students
12 pages
04 - Linear-Classification-2024
No ratings yet
04 - Linear-Classification-2024
65 pages
Vande Bharat Train Ticket Details
33% (3)
Vande Bharat Train Ticket Details
2 pages
Gradient Descent Types Explained
No ratings yet
Gradient Descent Types Explained
5 pages
40 Questions On Time Series Solution Skillpower Time Series Datafest 2017
100% (1)
40 Questions On Time Series Solution Skillpower Time Series Datafest 2017
18 pages
Normal Lead by SR
No ratings yet
Normal Lead by SR
1 page
Doubts
No ratings yet
Doubts
2 pages
Mock
No ratings yet
Mock
13 pages
Campus Ambassador Pitch
No ratings yet
Campus Ambassador Pitch
2 pages
Manufacturing 5.0: AI & Cloud in Industry
No ratings yet
Manufacturing 5.0: AI & Cloud in Industry
9 pages
Face Recognition Based On Chaotic Fuzzy RBF Neural Network: Meng Jian-Liang Gao Wan-Qing Pang Hui-Jing Niu Wei-Hua
No ratings yet
Face Recognition Based On Chaotic Fuzzy RBF Neural Network: Meng Jian-Liang Gao Wan-Qing Pang Hui-Jing Niu Wei-Hua
4 pages
Deep Learning Book
No ratings yet
Deep Learning Book
610 pages
Antim Prahar 2024 Data Analytics For Business Decisions
50% (2)
Antim Prahar 2024 Data Analytics For Business Decisions
38 pages
CS4780/5780 Homework 1: ML & K-NN Tasks
No ratings yet
CS4780/5780 Homework 1: ML & K-NN Tasks
2 pages
LSTM Architecture Explained: Key Components
No ratings yet
LSTM Architecture Explained: Key Components
19 pages
Practical 1: A. Design A Simple Machine Learning Model To Train The Training Instances and Test The Same
No ratings yet
Practical 1: A. Design A Simple Machine Learning Model To Train The Training Instances and Test The Same
30 pages
Individual Assignment
No ratings yet
Individual Assignment
13 pages
Ijsrp p8252
No ratings yet
Ijsrp p8252
6 pages
The Four Monetization Mistakes That Are Costing You Money
No ratings yet
The Four Monetization Mistakes That Are Costing You Money
14 pages
Full Stack Developer & IT Project Manager
No ratings yet
Full Stack Developer & IT Project Manager
9 pages
AI Transforming DevOps in 2024
No ratings yet
AI Transforming DevOps in 2024
14 pages
Ai Tools and Prompt Engineering Application in Management Assignment Ashish Joshi
No ratings yet
Ai Tools and Prompt Engineering Application in Management Assignment Ashish Joshi
15 pages
Classification Techniques by Sushil Kulkarni
No ratings yet
Classification Techniques by Sushil Kulkarni
166 pages
RJDS Sep 2022 Article 35-37
No ratings yet
RJDS Sep 2022 Article 35-37
3 pages
Arpita - Updated - Resume 2025, Jan
No ratings yet
Arpita - Updated - Resume 2025, Jan
1 page
Application of Ai in Aircraft Maintenance - The Game Changer
No ratings yet
Application of Ai in Aircraft Maintenance - The Game Changer
21 pages
Machine Learning Assignment 2
No ratings yet
Machine Learning Assignment 2
1 page
VII - CS8031 - DMDW - Module 6 - Classification - VBP
No ratings yet
VII - CS8031 - DMDW - Module 6 - Classification - VBP
99 pages
PhD Thesis: Feature Selection in ML
No ratings yet
PhD Thesis: Feature Selection in ML
198 pages
CS 485 - 685 - Foundations of Machine Learning (Fall 2021)
No ratings yet
CS 485 - 685 - Foundations of Machine Learning (Fall 2021)
5 pages
A Review On Finding Efficient Approach To Detect Customer Emotion Analysis Using Deep Learning Analysis
No ratings yet
A Review On Finding Efficient Approach To Detect Customer Emotion Analysis Using Deep Learning Analysis
17 pages
Purva Rawale - BDA Practical No 2
No ratings yet
Purva Rawale - BDA Practical No 2
9 pages
Pattern Recognition Notes For Students-1
No ratings yet
Pattern Recognition Notes For Students-1
18 pages
2020 Machine Learning Prediction of Mechanical Properties of Concrete Critical Review
No ratings yet
2020 Machine Learning Prediction of Mechanical Properties of Concrete Critical Review
18 pages
RAMAN Reinforcement Learning Inspired Algorithm For Mapping Applications Onto Mesh Network-on-Chip
No ratings yet
RAMAN Reinforcement Learning Inspired Algorithm For Mapping Applications Onto Mesh Network-on-Chip
7 pages
Ajam Ali Resume
No ratings yet
Ajam Ali Resume
2 pages
JD - Campus 2026
No ratings yet
JD - Campus 2026
6 pages
Information Technology P.G. Syllabus
No ratings yet
Information Technology P.G. Syllabus
42 pages
IBM Watson AI On IBM Cloud Professional Certification Program
No ratings yet
IBM Watson AI On IBM Cloud Professional Certification Program
27 pages

Logistic Regression Interview Prep

Uploaded by

Logistic Regression Interview Prep

Uploaded by

20+ Questions to Test your Skills on Logistic Regression

BE G I NNE R C A RE E R C LA S S I F I C AT I O N I NT E RVI E W S M A C HI NE LE A RNI NG RE G RE S S I O N

This article was published as a part of the Data Science Blogathon

Let’s get started,

1. What do you mean by the Logistic Regression?

2. What are the different types of Logistic Regression?

Three different types of Logistic Regression are as follows:

For Example, 0 and 1, or pass and fail or true and false.

For Example, Predicting preference of food i.e. Veg, Non-Veg, Vegan.

For Example, Movie rating from 1 to 5.

3. Explain the intuition behind Logistic Regression in detail.

For Example, Let us consider we have only two features as x1 and x2 .

If weight vector is a unit vector i.e, ||w||=1. Then,

=+1 and if value of w T *xi < 0 then y = -1.

classified point( y i *w T *xi < 0).

4. What are the odds?

The odds of winning the game= (Probability of winning)/(probability of not winning)

5. What factors can attribute to the popularity of Logistic Regression?

7. What is the Impact of Outliers on Logistic Regression?

9. How do we handle categorical variables in Logistic Regression?

10. Which algorithm is better in the case of outliers present in the

11. What are the assumptions made in Logistic Regression?

Some of the assumptions of Logistic Regression are as follows:

3. Sometimes to predict properly, it usually requires a large sample size.

For example, Too Little, About Right, Too Much.

5. It assumes there is no dependency between the observations.

12. Can we solve the multiclass classification problems using Logistic

13. How can we express the probability of a Logistic Regression model

14. Discuss the space complexity of Logistic Regression.

Storing b is just 1 step, i.e, O(1) operation since b is a constant.

15. Discuss the Test or Runtime complexity of Logistic Regression.

Therefore, the testing complexity of the Logistic Regression is O(d).

16. Why is Logistic Regression termed as Regression and not

If we predict the probability by using linear Regression, we can describe it as:

where, p(x) = p(y=1|x)

17. Discuss the Train complexity of Logistic Regression.

Now, let’s calculate its time complexity in terms of Big O notation:

The confident wrong predictions are penalized heavily

By optimizing this cost function, convergence is achieved.

20. What are the advantages of Logistic Regression?

The advantages of the logistic regression are as follows:

1. Logistic Regression is very easy to understand.

2. It requires less training.

6. They are easier to implement, interpret, and very efficient to train.

21. What are the disadvantages of Logistic Regression?

The disadvantages of the logistic regression are as follows:

1. Sometimes a lot of Feature Engineering is required.

3. It is quite sensitive to noise and overfitting.

Thanks for reading!

Please feel free to contact me on Linkedin, Email.

About the author

Article Url - https://www.analyticsvidhya.com/blog/2021/05/20-questions-to-test-your-skills-on-logistic-

You might also like

classified point( y i w T xi < 0).