Machine Learning MCQ Guide
Machine Learning MCQ Guide
Answer: A
2) Bayesian classifiers is
Answer: A
3) Algorithm is
Answer: B
4) Bias is
Answer: B
Answer: A
6) Case-based learning is
Answer: C
7) Classification is
Answer: A
Answer: A
9) Classification accuracy is
Answer: B
Answer: B
11) Cluster is
Answer: A
Answer: C
A) Complete
B) Consistent
C) Constant
D) None of these
Answer: A
Answer: A
A) Complete
B) Consistent
C) Constant
D) None of these
Answer: B
Answer: B
Answer: C
Answer: C
19) Hybrid is
Answer: A
20) Discovery is
Answer: C
Answer: C
23) Enrichment is
Answer: A
Answer: A
Answer: B
26) Heuristic is
Answer: B
Answer: A
Answer: B
Answer: B
Answer: A
Answer: C
Answer: C
Answer: A
35) Learning is
Answer: C
Answer: C
Answer: A
Answer: C
39) Node is
A) A component of a network
B) In the context of KDD and data mining, this refers to random
errors in a database table.
C) One of the defining aspects of a data warehouse
D) None of these
Answer: A
Answer: B
Answer: C
Answer: A
Answer: B
Answer: B
45) Prediction is
Answer: A
Answer: C
Answer: B
(a) Regression
(b) Classification
(c) Clustering
(d) inference of associative rules
(e) All (a), (b), (c) and (d) above.
Answer: E
Explanation: Regression, Classification and Clustering are the
data mining tasks.
Answer: A
Explanation: In a data warehouse, if D1 and D2 are two
conformed dimensions, then D1 may be an exact replica of D2.
(a) Informatica
(b) Oracle warehouse builder
(c) Datastage
(d) Visual studio
(e) DT/studio.
Answer: D
Explanation: Visual Studio is not an ETL tool.
i) Data streams
ii) Sequence data
iii) Networked data
iv) Text data
v) Spatial data
A) Data Characterization
B) Data Classification
C) Data discrimination
D) Data selection
A) cost-sensitive
B) work-sensitive
C) time-sensitive
D) technical-sensitive
Answer: C) time-sensitive
A) Data Characterization
B) Data Classification
C) Data discrimination
D) Data selection
A) i, ii and iv only
B) ii, iii and iv only
C) i, ii and iii only
D) All i, ii, iii and iv
A) Knowledge Database
B) Knowledge Discovery Database
C) Knowledge Data House
D) Knowledge Data Definition
A) Data
B) Information
C) Query
D) Useful information
A) Data Mining
B) Data Warehousing
C) Document Mining
D) Text Mining
A) OLAP
B) OLTP
C) SMTP
D) FTP
Answer: B) OLTP
64) An .................. system is market-oriented and is used
for data analysis by knowledge workers, including managers,
executives, and analysts.
A) OLAP
B) OLTP
C) Both of the above
D) None of the above
Answer: A) OLAP
A) Star schema
B) Snowflake schema
C) Fact constellation
D) Star-snowflake schema
A) top-down view
B) data warehouse view
C) data source view
D) business query view
A) many to many
B) one to one
C) one to many
D) many to one
A) top-down view
B) data warehouse view
C) data source view
D) business query view
A) Metadata
B) Current detail data
C) Lightly summarized data
D) Component Key
A) Information processing
B) Analytical processing
C) Data mining
D) Transaction processing
A) DBMS
B) RDBMS
C) Sybase
D) SQL Server
Answer:B) RDBMS
A) Information processing
B) Analytical processing
C) Data mining
D) Transaction processing
A) Multidimensional cube
B) Dimensions cube
C) Data cube
D) Data model
A) Forecasting
B) Data Mining
C) Analysis of large volumes of product sales data
D) All of the above
A) normalized
B) informational
C) summary
D) denormalized
Answer: C) summary
A) Hardware
B) Software
C) End users
D) Middle ware
A) flexibility
B) quantify
C) qualify
D) ability
Answer: A) flexibility
A) Operational database
B) Relational database
C) Multidimensional database
D) Data repository
Answer: B
Explanation: Data access tools to be used when deciding on the
data structure of a data mart.
82) The process of removing the deficiencies and loopholes in
the data is called as
Answer: C
Explanation: The process of removing the deficiencies and
loopholes in the data is called as cleaning up of data.
(a) OLTP
(b) OLAP
(c) Spread sheet
(d) XML
(e) All (a), (b), (c) and (d) above.
Answer: B
Explanation: Online Analytical Processing (OLAP) manages both
current and historic transactions.
(a) Partitioning
(b) Grid
(c) Cluster
(d) Table
(e) Data source.
Answer: C
Explanation: Cluster is the collection of data objects that are
similar to one another within the same group.
Answer: A
Explanation: KDD Process includes data cleaning, data
integration, data selection, data transformation, data mining,
pattern evolution, and knowledge presentation.
Answer: B
Explanation: Dimensional models can be created at Architecture
models level.
(a) Verbose
(b) Descriptive
(c) Equally unavailable
(d) Complete
(e) Indexed.
Answer: C
Explanation: Equally unavailable is not related to dimension
table attributes.
89) Data warehouse bus matrix is a combination of
Answer: A
Explanation: Data warehouse bus matrix is a combination of
Dimensions and data marts.
Answer: E
Explanation: Ensure that the transaction edit flat is used for
analysis is not the managing issue in the modeling process.
Answer: A
Explanation: Data modeling technique used for data marts is
Dimensional modeling.
Answer: C
Explanation: An OLAP tool provides for Slicing and dicing.
Answer: C
Explanation: The synonym for data mining is Knowledge discovery
in Database.
Answer: D
Explanation: The fact table of a data warehouse is the main
store of all of the recorded transactions over time is the
correct statement.
Answer: A
Explanation: The Most common kind of queries in a data
warehouse is Inside-out queries.
Answer: B
Explanation: Concept description is the basis form of the
descriptive data mining.
(a) If a set cannot pass a test, all of its supersets will fail
the same test as well
(b) To improve the efficiency the level-wise generation of
frequent item sets
(c) If a set can pass a test, all of its supersets will fail
the same test as well
(d) To decrease the efficiency the level-wise generation of
frequent item sets
(e) All (a), (b), (c) and (d) above.
Answer: B
Explanation: The apriori property means to improve the
efficiency the level-wise generation of frequent item sets.
Answer: D
Explanation: Disposable Data Marts is the form the set of data
created to support a specific short lived business situation.
I. Administrative.
II. Business.
III. Operational.
Answer: E
Explanation: The different types of Meta data are
Administrative, Business and Operational.
Answer: D
Explanation: Multiple Regression means extension of linear
regression involving more than one predicator value.
Answer: B
Explanation: Rapid changing dimension policy should not be
considered for each dimension attribute.
Answer: A
Explanation: A business Intelligence system requires data from
Data warehouse
(a) Biomedical
(b) DNA data analysis
(c) Financial data analysis
(d) Retail industry and telecommunication industry
(e) All (a), (b), (c) and (d) above.
Answer: E
Explanation: Data mining application domains are Biomedical,
DNA data analysis, Financial data analysis and Retail industry
and telecommunication industry
Answer: A
Explanation: The generalization of multidimensional attributes
of a complex object class can be performed by examining each
attribute, generalizing each attribute to simple-value data and
constructing a multidimensional data cube is called as object
cube.
Answer: A
Explanation: High risk high reward project is a building a data
mart for a business process/department that is very critical for
your organization
Answer: A
Explanation: Business intelligence system will have OLAP, Data
mining and reporting tolls.
Solution: (B)
A) PCA
B) K-Means
Solution: (A)
A) TRUE
B) FALSE
Solution: (A)
Y=X2. Note that, they are not only associated, but one is a
function of the other and Pearson correlation between them is 0.
A) Only 1
B) Only 2
C) Only 3
D) 1 and 2
E) 2 and 3
F) 1,2 and 3
Solution: (A)
1. Number of Trees
2. Depth of Tree
3. Learning Rate
A) Only 1
B) Only 2
C) Only 3
D) 1 and 2
E) 2 and 3
F) 1,2 and 3
Solution: (B)
6) Imagine, you are working with “Analytics Vidhya” and you want
to develop a machine learning algorithm which predicts the
number of views on the articles.
Your analysis is based on features like author name, number of
articles written by the same author on Analytics Vidhya in past
and a few other features. Which of the following evaluation
metric would you choose in that case?
2. Accuracy
3. F1 Score
A) Only 1
B) Only 2
C) Only 3
D) 1 and 3
E) 2 and 3
F) 1 and 2
Solution:(A)
B)
C)
A) 1 is tanh, 2 is ReLU and 3 is SIGMOID activation functions.
Solution: (D)
[0,0,0,1,1,1,1,1]
Solution: (A)
So the answer is A.
D) Both A and B
E) None of these
Solution: (D)
Both are true, The OHE will fail to encode the categories which
is present in test but not in train so it could be one of the
main challenges while applying OHE. The challenge given in
option B is also true you need to more careful while applying
OHE if frequency distribution doesn’t same in train and test.
10) Skip gram model is one of the best models used in Word2vec
algorithm for words embedding. Which one of the following models
depict the skip gram model?
A) A
B) B
C) Both A and B
D) None of these
Solution: (B)
A) ReLU
B) tanh
C) SIGMOID
D) None of these
Solution: (B)
A) TRUE
B) FALSE
Solution: (B)
A) Only 1
B) Only 2
C) Only 3
D) 1 and 2
E) 1 and 3
F) 2 and 3
Solution: (E)
In statistical hypothesis testing, a type I error is the
incorrect rejection of a true null hypothesis (a “false
positive”), while a type II error is incorrectly retaining a
false null hypothesis (a “false negative”).
1. Stemming
3. Object Standardization
A) 1 and 2
B) 1 and 3
C) 2 and 3
D) 1,2 and 3
Solution: (D)
Stop words are those words which will have not relevant to the
context of the data for example is/am/are.
15) Suppose you want to project high dimensional data into lower
dimensions. The two most famous dimensionality reduction
algorithms used here are PCA and t-SNE. Let’s say you have
applied both algorithms respectively on data “X” and you got the
datasets “X_projected_PCA” , “X_projected_tSNE”.
Solution: (B)
Context: 16-17
Given below are three scatter plots for two features (Image 1, 2
& 3 from left to right).
A) Features in Image 1
B) Features in Image 2
C) Features in Image 3
Solution: (D)
A) Only 1
B)Only 2
C) Only 3
D) Either 1 or 3
E) Either 2 or 3
Solution: (E)
You cannot remove the both features because after removing the
both features you will lose all of the information so you
should either remove the only 1 feature or you can use the
regularization algorithm like L1 and L2.
18) Adding a non-important feature to a linear regression model
may result in.
1. Increase in R-square
2. Decrease in R-square
A) Only 1 is correct
B) Only 2 is correct
C) Either 1 or 2
D) None of these
Solution: (A)
E) D1 = C1, D2 = C2, D3 = C3
F) Cannot be determined
Solution: (E)
Correlation between the features won’t change if you add or
subtract a value in the features.
Your model has 99% accuracy after taking the predictions on test
data. Which of the following is true in such a case?
A) 1 and 3
B) 1 and 4
C) 2 and 3
D) 2 and 4
Solution: (A)
A) 1 and 2
B) 1 and 3
C) 2 and 3
D) Only 1
E) Only 2
Solution: (A)
A) 1 and 2
B) 2 and 3
C) 1 and 3
D) 1,2 and 3
Solution: (D)
D) Can’t estimate
Solution: (D)
A) 1000-1500 second
B) 1500-3000 Second
D) None of these
Solution: (D)
H TE VE
1 105 90
2 200 85
3 250 96
4 105 85
5 300 100
A) 1
B) 2
C) 3
D) 4
E) 5
Solution: (D)
26) What would you do in PCA to get the same projection as SVD?
C) Not possible
D) None of these
Solution: (A)
When the data has a zero mean vector PCA will have same
projections as SVD, otherwise you have to centre the data first
before taking SVD.
You can also think that this black box algorithm is same as 1-NN
(1-nearest neighbor).
B) FALSE
Solution: (A)
28) Instead of using 1-NN black box we want to use the j-NN
(j>1) algorithm as black box. Which of the following option is
correct for finding k-NN using j-NN?
2. J > k
3. Not possible
A) 1
B) 2
C) 3
Solution: (A)
29) Suppose you are given 7 Scatter plots 1-7 (left to right)
and you want to compare Pearson correlation coefficients between
variables of each scatterplot.
2. 1>2>3 > 4
3. 7<6<5<4
4. 7>6>5>4
A) 1 and 3
B) 2 and 3
C) 1 and 4
D) 2 and 4
Solution: (B)
1.
If a classifier is confident about an incorrect
classification, then log-loss will penalise it heavily.
A) 1 and 3
B) 2 and 3
C) 1 and 2
D) 1,2 and 3
Solution: (D)
Question 31-32
A) 0
D) 0.4
C) 0.8
D) 1
Solution: (C)
A) 1NN
B) 3NN
C) 4NN
Solution: (A)
33) Suppose you are given the below data and you want to apply a
logistic regression model for classifying it in two given
classes.
You are using logistic regression with L1 regularization.
Where C is the
regularization parameter and w1 & w2 are the coefficients of x1
and x2.
Solution: (B)
Note: All other hyper parameters are same and other factors are
not affected.
A) Only 1
B) Only 2
C) Both 1 and 2
Solution: (A)
A) 2 and 3
B) 1 and 3
C) 1 and 2
D) All of above
Solution: (D)
1. Accuracy is ~0.91
A) 1 and 3
B) 2 and 4
C) 1 and 4
D) 2 and 3
Solution: (C)
The true Positive Rate is how many times you are predicting
positive class correctly so true positive rate would be 100/105
= 0.95 also known as “Sensitivity” or “Recall”
2. Depth of tree
A)1 and 2
B) 2 and 3
C) 1 and 3
D) 1, 2 and 3
E) Can’t say
Solution: (E)
Context 38-39
Imagine, you have a 28 * 28 image and you run a 3 * 3
convolution neural network on it with the input depth of 3 and
output depth of 8.
38) What is the dimension of output feature map when you are
using the given parameters.
Solution: (A)
39) What is the dimensions of output feature map when you are
using following parameters.
Solution: (B)
Same as above
40) Suppose, we were plotting the visualization for different
values of C (Penalty parameter) in SVM algorithm. Due to some
reason, we forgot to tag the C values with visualizations. In
that case, which of the following option best explains the C
values for the images below (1,2,3 left to right, so C values
are C1 for image1, C2 for image2 and C3 for image3 ) in case of
rbf kernel.
A) C1 = C2 = C3
B) C1 > C2 > C3
C) C1 < C2 < C3
D) None of these
Solution: (C)
2. The most widely used metrics and tools to assess a classification model is:
A. Confusion matrix
B. Cost-sensitive accuracy
C. Area under the ROC curve
D. All of these
ANSWER: D
6. Statistical significance is
A. The science of collecting, ogranizing and applying numerical facts
B. Measure of the probability that a certain hypothesis is incorrect given certain
observations
C. One of the defining aspects of a data warehouse, which is specially built around
all the existing applicatons of the operational data
D. None of these
ANSWER: B
7. Which of the folllowing is an example of feature extraction?
A. Constructing bag of words vector from an email
B. Applying PCA projects to a large high-dimensional data
C. Removing stopwords in a sentence
D. All of these
ANSWER: D
8. How can you prevent a clustering algorithm from getting stuck in bad local optima?
A. Set the same seed value for each run
B. Use multiple random initializations
C. Both A and B
D. None of these
ANSWER: B
12. Classification is
A. Subdivision of a set of examples into a number of classes
B. Measure of the accuracy, of the classification of a concept that is given by a
certain theory
C. The task of assigning a classification to a set of examples
D. None of these
ANSWER: A
14. Cluster is
A. Group of similar objects that differ significantly from other objects
B. Operations on a database to transform or simplify data in order to prepare it for
a machine-learning algorithm
C. Symbolic representation of facts or ideas from which information can potentially
be extracted
D. None of these
ANSWER: A
15. Suppose you are given an EM algorithm that finds maximum likelihood estimates for
a model with latent variables. You are asked to modify the algorithm so that it finds MAP
estimates instead. Which step or steps do you need to modify?
A. Expectation
B. Maximization
C. No modification necessary
D. Both A & B
ANSWER: B
16. Compared to the variance of the Maximum Likelihood Estimate (MLE), the variance
of the Maximum A Posteriori (MAP) estimate is ________
A. Higher
B. Same
C. Lower
D. It could be any of the above
ANSWER: C
19. Predicting on whether will it rain or not tomorrow evening at a particular time
is a type of _________ problem.
A. Classification
B. Regression
C. Unsupervised learning
D. All o these
ANSWER: A
21. A feature F1 can take certain value: A, B, C, D, E, & F and represents grade of
students from a college.Which of the following statement is true in following case?
A. Feature F1 is an example of nominal variable.
B. Feature F1 is an example of ordinal variable.
C. It doesn’t belong to any of the above category.
D. Both of A & B
ANSWER: B
22. If your training loss increases with number of epochs, which of the following could
be a possible issue with the learning process?
A. Regularization is too low and model is overfitting
B. Regularization is too high and model is underfitting
C. Step size is too large
D. Step size is too small
ANSWER: C
23. Given a large dataset of medical records from patients suffering from heart disease,
try to learn whether there might be different clusters of such patients for which we might
tailor separate treatments. What kind of learning problem is this?
A. Supervised learning
B. Unsupervised learning
C. Both A and B
D. None of these
ANSWER: B
26. Classifying email as a spam, labeling webpages based on their content, voice
recognition are the example of _____.
A. Supervised learning
B. Unsupervised learning
C. Machine learning
D. Deep learning
ANSWER: A
27. Deep learning is a subfield of machine learning where concerned algorithms are
inspired by the structured and function of the brain called _____.
A. Machine learning
B. Artificial neural networks
C. Deep learning
D. Robotics
ANSWER: B
29. When the number of output classes is greater than one, there are main possibilities
to manage a classification problem:
A. One-vs-all, One-vs-one
B. One-vs-one, Many-vs-one
C. One-vs-many, Many-vs-one
D. None of these
ANSWER: A
30. For a neural network, which one of these structural assumptions is the one that
most affects the trade-off between underfitting (i.e. a high bias model) and overfitting
(i.e. a high variance model):
A. The learning rate
B. The number of hidden nodes
C. The initial choice of weights
D. The use of a constant-term unit input
ANSWER: B
31. ___________ refers to a model that can neither model the training data nor
generalize to new data.
A. Good fitting
B. Overfitting
C. Underfitting
D. All of the these
ANSWER: C
32. Given two Boolean random variables, A and B, where P(A) = 1/2, P(B) = 1/3, and P(A
| ¬B) = 1/4, what is P(A | B)?
A. 1/6
B. 1/4
C. 3/4
D. 1
ANSWER: D
33. Suppose your model is overfitting. Which of the following is NOT a valid way to
try and reduce the overfitting?
A. Increase the amount of training data
B. Improve the optimization algorithm being used for error minimization
C. Decrease the model complexity
D. Reduce the noise in the training data
ANSWER: B
34. Predicting on whether will it rain or not tomorrow evening at a particular time
is a type of _________ problem.
A. Classification
B. Regression
C. Unsupervised learning
D. All of these
ANSWER: A
35. Given a large dataset of medical records from patients suffering from heart disease,
try to learn whether there might be different clusters of such patients for which we might
tailor separate treatments. What kind of learning problem is this?
A. Supervised learning
B. Unsupervised learning
C. Both A and B
D. Neither A nor B
ANSWER: B
36. Given a large dataset of medical records from patients suffering from heart disease,
try to learn whether there might be different clusters of such patients for which we might
tailor separate treatments. What kind of learning problem is this?
A. Supervised learning
B. Unsupervised learning
C. Both A and B
D. Neither A nor B
ANSWER: B
46. Which of the following is wrong statement about the maximum likelihood approach?
A. This method doesn’t always involve probability calculations
B. It finds a tree that best accounts for the variation in a set of sequences
C. The method is similar to the maximum parsimony method
D. The analysis is performed on each column of a multiple sequence alignment
ANSWER: A
47. The main disadvantage of maximum likelihood methods is that they are _____
A. Mathematically less folded
B. Mathematically less complex
C. Computationally lucid
D. Computationally intense
ANSWER: B
A) TRUE
B) FALSE
Solution: (A)
Yes, Linear regression is a supervised learning algorithm because it uses true labels for
training. Supervised learning algorithm should have input variable (x) and an output variable
(Y) for each example.
A) TRUE
B) FALSE
Solution: (A)
A) TRUE
B) FALSE
Solution: (A)
4) Which of the following methods do we use to find the best fit line for data in Linear
Regression?
C2 General
A) Least Square Error
B) Maximum Likelihood
C) Logarithmic Loss
D) Both A and B
Solution: (A)
In linear regression, we try to minimize the least square errors of the model to identify the
line of best fit.
5) Which of the following evaluation metrics can be used to evaluate a model while
modeling a continuous output variable?
A) AUC-ROC
B) Accuracy
C) Logloss
D) Mean-Squared-Error
Solution: (D)
Since linear regression gives output as continuous values, so in such case we use mean
squared error metric to evaluate the model performance. Remaining options are use in case
of a classification problem.
A) TRUE
B) FALSE
Solution: (A)
True, In case of lasso regression we apply absolute penalty which makes some of the
coefficients zero.
A) Lower is better
B) Higher is better
C) A or B depend on the situation
D) None of these
Solution: (A)
C2 General
Residuals refer to the error values of the model. Therefore lower residuals are desired.
8) Suppose that we have N independent variables (X1,X2… Xn) and dependent variable is
Y. Now Imagine that you are applying linear regression by fitting the best fit line using least
square error on this data.
You found that correlation coefficient for one of it’s variable(Say X1) with Y is -0.95.
Solution: (B)
The absolute value of the correlation coefficient denotes the strength of the relationship.
Since absolute correlation is very high it means that the relationship is strong between X1
and Y.
9) Looking at above two characteristics, which of the following option is the correct
for Pearson correlation between V1 and V2?
If you are given the two variables V1 and V2 and they are following below two
characteristics.
Solution: (D)
C2 General
10) Suppose Pearson correlation between V1 and V2 is zero. In such case, is it right
to conclude that V1 and V2 do not have any relation between them?
A) TRUE
B) FALSE
Solution: (B)
Pearson correlation coefficient between 2 variables might be zero even when they have a
relationship between them. If the correlation coefficient is zero, it just means that that they
don’t move together. We can take examples like y=|x| or y=x^2.
11) Which of the following offsets, do we use in linear regression’s least square line
fit? Suppose horizontal axis is independent variable and vertical axis is dependent
variable.
A) Vertical offset
B) Perpendicular offset
C) Both, depending on the situation
D) None of above
Solution: (A)
12) True- False: Overfitting is more likely when you have huge amount of data to
train?
C2 General
A) TRUE
B) FALSE
Solution: (B)
With a small training dataset, it’s easier to find a hypothesis to fit the training data exactly
i.e. overfitting.
13) We can also compute the coefficient of linear regression with the help of an
analytical method called “Normal Equation”. Which of the following is/are true about
Normal Equation?
A) 1 and 2
B) 1 and 3
C) 2 and 3
D) 1,2 and 3
Solution: (D)
Instead of gradient descent, Normal Equation can also be used to find coefficients. Refer
this article for read more about normal equation.
14) Which of the following statement is true about sum of residuals of A and B?
Below graphs show two fitted regression lines (A & B) on randomly generated data. Now, I
want to find the sum of residuals in both cases A and B.
Note:
C2 General
A) A has higher sum of residuals than B
B) A has lower sum of residual than B
C) Both have same sum of residuals
D) None of these
Solution: (C)
Sum of residuals will always be zero, therefore both have same sum of residuals
Suppose you have fitted a complex regression model on a dataset. Now, you are using
Ridge regression with penality x.
Solution: (B)
If the penalty is very large it means model is less complex, therefore the bias would be high.
16) What will happen when you apply very large penalty?
C2 General
Solution: (B)
In lasso some of the coefficient value become zero, but in case of Ridge, the coefficients
become close to zero but not zero.
17) What will happen when you apply very large penalty in case of Lasso?
A) Some of the coefficient will become zero
B) Some of the coefficient will be approaching to zero but not absolute zero
C) Both A and B depending on the situation
D) None of these
Solution: (A)
As already discussed, lasso applies absolute penalty, so some of the coefficients will
become zero.
18) Which of the following statement is true about outliers in Linear regression?
Solution: (A)
The slope of the regression line will change due to outliers in most of the cases. So Linear
Regression is sensitive to outliers.
19) Suppose you plotted a scatter plot between the residuals and predicted values in
linear regression and you found that there is a relationship between them. Which of
the following conclusion do you make about this situation?
Solution: (A)
C2 General
There should not be any relationship between predicted values and residuals. If there exists
any relationship between them,it means that the model has not perfectly captured the
information in the data.
Suppose that you have a dataset D1 and you design a linear regression model of degree 3
polynomial and you found that the training and testing error is “0” or in another terms it
perfectly fits the data.
20) What will happen when you fit degree 4 polynomial in linear regression?
A) There are high chances that degree 4 polynomial will over fit the data
B) There are high chances that degree 4 polynomial will under fit the data
C) Can’t say
D) None of these
Solution: (A)
Since is more degree 4 will be more complex(overfit the data) than the degree 3 model so it
will again perfectly fit the data. In such case training error will be zero but test error may not
be zero.
21) What will happen when you fit degree 2 polynomial in linear regression?
A) It is high chances that degree 2 polynomial will over fit the data
B) It is high chances that degree 2 polynomial will under fit the data
C) Can’t say
D) None of these
Solution: (B)
If a degree 3 polynomial fits the data perfectly, it’s highly likely that a simpler model(degree
2 polynomial) might under fit the data.
22) In terms of bias and variance. Which of the following is true when you fit degree 2
polynomial?
C2 General
C) Bias will be high, variance will be low
D) Bias will be low, variance will be low
Solution: (C)
Since a degree 2 polynomial will be less complex as compared to degree 3, the bias will be
high and variance will be low.
Which of the following is true about below graphs(A,B, C left to right) between the cost
function and Number of iterations?
23) Suppose l1, l2 and l3 are the three learning rates for A,B,C respectively. Which of
the following is true about l1,l2 and l3?
A) l2 < l1 < l3
B) l1 > l2 > l3
C) l1 = l2 = l3
D) None of these
Solution: (A)
In case of high learning rate, step will be high, the objective function will decrease quickly
initially, but it will not find the global minima and objective function starts increasing after a
few iterations.
In case of low learning rate, the step will be small. So the objective function will decrease
slowly
C2 General
Question Context 24-25:
We have been given a dataset with n records in which we have input attribute as x and
output attribute as y. Suppose we use a linear regression method to model this data. To test
our linear regressor, we split the data in training set and test set randomly.
24) Now we increase the training set size gradually. As the training set size increases,
what do you expect will happen with the mean training error?
A) Increase
B) Decrease
C) Remain constant
D) Can’t Say
Solution: (D)
Training error may increase or decrease depending on the values that are used to fit the
model. If the values used to train contain more outliers gradually, then the error might just
increase.
25) What do you expect will happen with bias and variance as you increase the size
of training data?
Solution: (D)
As we increase the size of the training data, the bias would increase while the variance
would decrease.
Consider the following data where one input(X) and one output(Y) is given.
C2 General
26) What would be the root mean square training error for this data if you run a
Linear Regression model of the form (Y = A0+A1X)?
A) Less than 0
B) Greater than zero
C) Equal to 0
D) None of these
Solution: (C)
We can perfectly fit the line on the following data so mean error will be zero.
Suppose you have been given the following scenario for training and validation error for
Linear Regression.
Number of Validation
Scenario Learning Rate Training Error
iterations Error
1 0.1 1000 100 110
2 0.2 600 90 105
3 0.3 400 110 110
4 0.4 300 120 130
5 0.4 250 130 150
C2 General
27) Which of the following scenario would give you the right hyper parameter?
A) 1
B) 2
C) 3
D) 4
Solution: (B)
Option B would be the better option because it leads to less training as well as validation
error.
28) Suppose you got the tuned hyper parameters from the previous question. Now,
Imagine you want to add a variable in variable space such that this added feature is
important. Which of the following thing would you observe in such case?
Solution: (D)
If the added feature is important, the training and validation error would decrease.
Suppose, you got a situation where you find that your linear regression model is under
fitting the data.
29) In such situation which of the following options would you consider?
A) 1 and 2
B) 2 and 3
C) 1 and 3
D) 1, 2 and 3
C2 General
Solution: (A)
In case of under fitting, you need to induce more variables in variable space or you can add
some polynomial degree variables to make the model more complex to be able to fir the
data better.
A) L1
B) L2
C) Any
D) None of these
Solution: (D)
I won’t use any regularization methods because regularization is used in case of overfitting.
A) TRUE
B) FALSE
Solution: A
True, Logistic regression is a supervised learning algorithm because it uses true labels for
training. Supervised learning algorithm should have input variables (x) and an target
variable (Y) when you train the model .
A) TRUE
B) FALSE
Solution: B
Logistic regression is a classification algorithm, don’t confuse with the name regression.
C2 General
3) True-False: Is it possible to design a logistic regression algorithm using a Neural
Network Algorithm?
A) TRUE
B) FALSE
Solution: A
A) TRUE
B) FALSE
Solution: A
Yes, we can apply logistic regression on 3 classification problem, We can use One Vs all
method for 3 class classification in logistic regression.
5) Which of the following methods do we use to best fit the data in Logistic
Regression?
Solution: B
Logistic regression uses maximum likely hood estimate for training a logistic regression.
6) Which of the following evaluation metrics can not be applied in case of logistic
regression output to compare with target?
C2 General
A) AUC-ROC
B) Accuracy
C) Logloss
D) Mean-Squared-Error
Solution: D
Since, Logistic Regression is a classification algorithm so it’s output can not be real time
value so mean squared error can not use for evaluating it
7) One of the very good methods to analyze the performance of Logistic Regression
is AIC, which is similar to R-Squared in Linear Regression. Which of the following is
true about AIC?
Solution: A
We select the best model in logistic regression which can least AIC. For more information
refer this source: http://www4.ncsu.edu/~shu3/Presentation/AIC.pdf
A) TRUE
B) FALSE
Solution: B
Standardization isn’t required for logistic regression. The main goal of standardizing
features is to help convergence of the technique used for optimization.
A) LASSO
B) Ridge
C2 General
C) Both
D) None of these
Solution: A
Context: 10-11
Consider a following model for logistic regression: P (y =1|x, w)= g(w0 + w1x)
where g(z) is the logistic function.
In the above equation the P (y =1|x; w) , viewed as a function of x, that we can get by
changing the parameters w.
A) (0, inf)
B) (-inf, 0 )
C) (0, 1)
D) (-inf, inf)
Solution: C
For values of x in the range of real number from −∞ to +∞ Logistic function will give the
output between (0,1)
11) In above question what do you think which function would make p between (0,1)?
A) logistic function
B) Log likelihood function
C) Mixture of both
D) None of them
Solution: A
C2 General
Context: 12-13
Suppose you train a logistic regression classifier and your hypothesis function H is
12) Which of the following figure will represent the decision boundary as given by
above classifier?
A)
B)
C)
D)
C2 General
Solution: B
Option B would be the right answer. Since our line will be represented by y = g(-6+x2) which
is shown in the option A and option B. But option B is the right answer because when you
put the value x2 = 6 in the equation then y = g(0) you will get that means y= 0.5 will be on
the line, if you increase the value of x2 greater then 6 you will get negative values so output
will be the region y =0.
13) If you replace coefficient of x1 with x2 what would be the output figure?
A)
B)
C)
D)
C2 General
Solution: D
14) Suppose you have been given a fair coin and you want to find out the odds of
getting heads. Which of the following option is true for such a case?
A) odds will be 0
B) odds will be 0.5
C) odds will be 1
D) None of these
Solution: C
Odds are defined as the ratio of the probability of success and the probability of failure. So
in case of fair coin probability of success is 1/2 and the probability of failure is 1/2 so odd
would be 1
15) The logit function(given as l(x)) is the log of odds function. What could be the
range of logit function in the domain x=[0,1]?
A) (– ∞ , ∞)
B) (0,1)
C) (0, ∞)
D) (- ∞, 0)
Solution: A
For our purposes, the odds function has the advantage of transforming the probability
function, which has values from 0 to 1, into an equivalent function with values between 0
and ∞. When we take the natural log of the odds function, we get a range of values from -∞
to ∞.
C2 General
16) Which of the following option is true?
A) Linear Regression errors values has to be normally distributed but in case of Logistic
Regression it is not the case
B) Logistic Regression errors values has to be normally distributed but in case of Linear
Regression it is not the case
C) Both Linear Regression and Logistic Regression error values have to be normally
distributed
D) Both Linear Regression and Logistic Regression error values have not to be normally
distributed
Solution:A
Only A is true.
17) Which of the following is true regarding the logistic function for any value “x”?
Note:
Logistic(x): is a logistic function of any number “x”
A) Logistic(x) = Logit(x)
B) Logistic(x) = Logit_inv(x)
C) Logit_inv(x) = Logit(x)
D) None of these
Solution: B
C2 General
MCQ For UNIT 2
Solution: (B)
Ordinal variables are the variables which has some order in their categories. For example,
grade A should be consider as high grade than grade B.
A) PCA
B) K-Means
Solution: (A)
A deterministic algorithm is that in which output does not change on different runs. PCA
would give the same result if we run again, but not k-means.
3) [True or False] A Pearson correlation between two variables is zero but, still their
values can still be related to each other.
A) TRUE
B) FALSE
Solution: (A)
Y=X2. Note that, they are not only associated, but one is a function of the other and
Pearson correlation between them is 0.
4) Which of the following statement(s) is / are true for Gradient Decent (GD) and
Stochastic Gradient Decent (SGD)?
2. In SGD, you have to run through all the samples in your training set for a
single update of a parameter in each iteration.
3. In GD, you either use the entire data or a subset of training data to update a
parameter in each iteration.
A) Only 1
B) Only 2
C) Only 3
D) 1 and 2
E) 2 and 3
F) 1,2 and 3
Solution: (A)
In SGD for each iteration you choose the batch which is generally contain the random
sample of data But in case of GD each iteration contain the all of the training observations.
5) Which of the following hyper parameter(s), when increased may cause random
forest to over fit the data?
1. Number of Trees
2. Depth of Tree
3. Learning Rate
A) Only 1
B) Only 2
C) Only 3
D) 1 and 2
E) 2 and 3
F) 1,2 and 3
Solution: (B)
Usually, if we increase the depth of tree it will cause overfitting. Learning rate is not an
hyperparameter in random forest. Increase in the number of tree will cause under fitting.
6) Imagine, you are working with “Analytics Vidhya” and you want to develop a
machine learning algorithm which predicts the number of views on the articles.
Your analysis is based on features like author name, number of articles written by the
same author on Analytics Vidhya in past and a few other features. Which of the
following evaluation metric would you choose in that case?
2. Accuracy
3. F1 Score
A) Only 1
B) Only 2
C) Only 3
D) 1 and 3
E) 2 and 3
F) 1 and 2
Solution:(A)
You can think that the number of views of articles is the continuous target variable which fall
under the regression problem. So, mean squared error will be used as an evaluation
metrics.
7) Given below are three images (1,2,3). Which of the following option is correct for
these images?
A)
B)
C)
A) 1 is tanh, 2 is ReLU and 3 is SIGMOID activation functions.
Solution: (D)
8) Below are the 8 actual values of target variable in the train file.
[0,0,0,1,1,1,1,1]
Solution: (A)
So the answer is A.
9) Let’s say, you are working with categorical feature(s) and you have not looked at
the distribution of the categorical variable in the test data.
You want to apply one hot encoding (OHE) on the categorical feature(s). What
challenges you may face if you have applied OHE on a categorical variable of train
dataset?
A) All categories of categorical variable are not present in the test dataset.
D) Both A and B
E) None of these
Solution: (D)
Both are true, The OHE will fail to encode the categories which is present in test but not in
train so it could be one of the main challenges while applying OHE. The challenge given in
option B is also true you need to more careful while applying OHE if frequency distribution
doesn’t same in train and test.
10) Skip gram model is one of the best models used in Word2vec algorithm for words
embedding. Which one of the following models depict the skip gram model?
A) A
B) B
C) Both A and B
D) None of these
Solution: (B)
Both models (model1 and model2) are used in Word2vec algorithm. The model1 represent
a CBOW model where as Model2 represent the Skip gram model.
11) Let’s say, you are using activation function X in hidden layers of neural network.
At a particular neuron for any given input, you get the output as “-0.0001”. Which of
the following activation function could X represent?
A) ReLU
B) tanh
C) SIGMOID
D) None of these
Solution: (B)
The function is a tanh because the this function output range is between (-1,-1).
12) [True or False] LogLoss evaluation metric can have negative values.
A) TRUE
B) FALSE
Solution: (B)
13) Which of the following statements is/are true about “Type-1” and “Type-2” errors?
3. Type1 error occurs when we reject a null hypothesis when it is actually true.
A) Only 1
B) Only 2
C) Only 3
D) 1 and 2
E) 1 and 3
F) 2 and 3
Solution: (E)
In statistical hypothesis testing, a type I error is the incorrect rejection of a true null
hypothesis (a “false positive”), while a type II error is incorrectly retaining a false null
hypothesis (a “false negative”).
14) Which of the following is/are one of the important step(s) to pre-process the text
in NLP based projects?
1. Stemming
3. Object Standardization
A) 1 and 2
B) 1 and 3
C) 2 and 3
D) 1,2 and 3
Solution: (D)
Stemming is a rudimentary rule-based process of stripping the suffixes (“ing”, “ly”, “es”, “s”
etc) from a word.
Stop words are those words which will have not relevant to the context of the data for
example is/am/are.
Object Standardization is also one of the good way to pre-process the text.
15) Suppose you want to project high dimensional data into lower dimensions. The
two most famous dimensionality reduction algorithms used here are PCA and t-SNE.
Let’s say you have applied both algorithms respectively on data “X” and you got the
datasets “X_projected_PCA” , “X_projected_tSNE”.
t-SNE algorithm consider nearest neighbour points to reduce the dimensionality of the data.
So, after using t-SNE we can think that reduced dimensions will also have interpretation in
nearest neighbour space. But in case of PCA it is not the case.
Context: 16-17
Given below are three scatter plots for two features (Image 1, 2 & 3 from left to right).
16) In the above images, which of the following is/are example of multi-collinear
features?
A) Features in Image 1
B) Features in Image 2
C) Features in Image 3
Solution: (D)
In Image 1, features have high positive correlation where as in Image 2 has high negative
correlation between the features so in both images pair of features are the example of
multicollinear features.
17) In previous question, suppose you have identified multi-collinear features. Which
of the following action(s) would you perform next?
A) Only 1
B)Only 2
C) Only 3
D) Either 1 or 3
E) Either 2 or 3
Solution: (E)
You cannot remove the both features because after removing the both features you will
lose all of the information so you should either remove the only 1 feature or you can use the
regularization algorithm like L1 and L2.
18) Adding a non-important feature to a linear regression model may result in.
1. Increase in R-square
2. Decrease in R-square
A) Only 1 is correct
B) Only 2 is correct
C) Either 1 or 2
D) None of these
Solution: (A)
After adding a feature in feature space, whether that feature is important or unimportant
features the R-squared always increase.
19) Suppose, you are given three variables X, Y and Z. The Pearson correlation
coefficients for (X, Y), (Y, Z) and (X, Z) are C1, C2 & C3 respectively.
Now, you have added 2 in all values of X (i.enew values become X+2), subtracted 2
from all values of Y (i.e. new values are Y-2) and Z remains the same. The new
coefficients for (X,Y), (Y,Z) and (X,Z) are given by D1, D2 & D3 respectively. How do
the values of D1, D2 & D3 relate to C1, C2 & C3?
E) D1 = C1, D2 = C2, D3 = C3
F) Cannot be determined
Solution: (E)
Correlation between the features won’t change if you add or subtract a value in the
features.
20) Imagine, you are solving a classification problems with highly imbalanced class.
The majority class is observed 99% of times in the training data.
Your model has 99% accuracy after taking the predictions on test data. Which of the
following is true in such a case?
3. Precision and recall metrics are good for imbalanced class problems.
4. Precision and recall metrics aren’t good for imbalanced class problems.
A) 1 and 3
B) 1 and 4
C) 2 and 3
D) 2 and 4
Solution: (A)
21) In ensemble learning, you aggregate the predictions for weak learners, so that an
ensemble of these models will give a better prediction than prediction of individual
models.
Which of the following statements is / are true for weak learners used in ensemble
model?
2. They have high bias, so they cannot solve complex learning problems
A) 1 and 2
B) 1 and 3
C) 2 and 3
D) Only 1
E) Only 2
Solution: (A)
Weak learners are sure about particular part of a problem. So, they usually don’t overfit
which means that weak learners have low variance and high bias.
22) Which of the following options is/are true for K-fold cross-validation?
1. Increase in K will result in higher time required to cross validate the result.
3. If K=N, then it is called Leave one out cross validation, where N is the number
of observations.
A) 1 and 2
B) 2 and 3
C) 1 and 3
D) 1,2 and 3
Solution: (D)
Larger k value means less bias towards overestimating the true expected error (as training
folds will be closer to the total dataset) and higher running time (as you are getting closer to
the limit case: Leave-One-Out CV). We also need to consider the variance between the k
folds accuracy while selecting the k.
Time taken by an algorithm for training (on a model with max_depth 2) 4-fold is 10
seconds and for the prediction on remaining 1-fold is 2 seconds.
23) Which of the following option is true for overall execution time for 5-fold cross
validation with 10 different values of “max_depth”?
D) Can’t estimate
Solution: (D)
Each iteration for depth “2” in 5-fold cross validation will take 10 secs for training and 2
second for testing. So, 5 folds will take 12*5 = 60 seconds. Since we are searching over the
10 depth values so the algorithm would take 60*10 = 600 seconds. But training and testing
a model on depth greater than 2 will take more time than depth “2” so overall timing would
be greater than 600.
24) In previous question, if you train the same algorithm for tuning 2 hyper
parameters say “max_depth” and “learning_rate”.
You want to select the right value against “max_depth” (from given 10 depth values)
and learning rate (from given 5 different learning rates). In such cases, which of the
following will represent the overall time?
A) 1000-1500 second
B) 1500-3000 Second
D) None of these
Solution: (D)
25) Given below is a scenario for training error TE and Validation error VE for a
machine learning algorithm M1. You want to choose a hyperparameter (H) based on
TE and VE.
H TE VE
1 105 90
2 200 85
3 250 96
4 105 85
5 300 100
Which value of H will you choose based on the above table?
A) 1
B) 2
C) 3
D) 4
E) 5
Solution: (D)
26) What would you do in PCA to get the same projection as SVD?
C) Not possible
D) None of these
Solution: (A)
When the data has a zero mean vector PCA will have same projections as SVD, otherwise
you have to centre the data first before taking SVD.
Question Context 27-28
Assume there is a black box algorithm, which takes training data with multiple
observations (t1, t2, t3,…….. tn) and a new observation (q1). The black box outputs
the nearest neighbor of q1 (say ti) and its corresponding class label ci.
You can also think that this black box algorithm is same as 1-NN (1-nearest
neighbor).
27) It is possible to construct a k-NN classification algorithm based on this black box
alone.
A) TRUE
B) FALSE
Solution: (A)
In first step, you pass an observation (q1) in the black box algorithm so this algorithm would
return a nearest observation and its class.
In second step, you through it out nearest observation from train data and again input the
observation (q1). The black box algorithm will again return the a nearest observation and it’s
class.
28) Instead of using 1-NN black box we want to use the j-NN (j>1) algorithm as black
box. Which of the following option is correct for finding k-NN using j-NN?
2. J > k
3. Not possible
A) 1
B) 2
C) 3
Solution: (A)
29) Suppose you are given 7 Scatter plots 1-7 (left to right) and you want to compare
Pearson correlation coefficients between variables of each scatterplot.
1. 1<2<3<4
2. 1>2>3 > 4
3. 7<6<5<4
4. 7>6>5>4
A) 1 and 3
B) 2 and 3
C) 1 and 4
D) 2 and 4
Solution: (B)
from image 1to 4 correlation is decreasing (absolute value). But from image 4 to 7
correlation is increasing but values are negative (for example, 0, -0.3, -0.7, -0.99).
30) You can evaluate the performance of a binary class classification problem using
different metrics such as accuracy, log-loss, F-Score. Let’s say, you are using the
log-loss function as evaluation metric.
2. For a particular observation, the classifier assigns a very small probability for the
correct class then the corresponding contribution to the log-loss will be very large.
A) 1 and 3
B) 2 and 3
C) 1 and 2
D) 1,2 and 3
Solution: (D)
Question 31-32
Note: Visual distance between the points in the image represents the actual distance.
D) 0.4
C) 0.8
D) 1
Solution: (C)
In Leave-One-Out cross validation, we will select (n-1) observations for training and 1
observation of validation. Consider each point as a cross validation point and then find the 3
nearest point to this point. So if you repeat this procedure for all points you will get the
correct classification for all positive class given in the above figure but negative class will be
misclassified. Hence you will get 80% accuracy.
32) Which of the following value of K will have least leave-one-out cross validation
accuracy?
A) 1NN
B) 3NN
C) 4NN
Solution: (A)
Each point which will always be misclassified in 1-NN which means that you will get the 0%
accuracy.
33) Suppose you are given the below data and you want to apply a logistic regression
model for classifying it in two given classes.
You are using logistic regression with L1 regularization.
Which of the following option is correct when you increase the value of C from zero to a
very large value?
Solution: (B)
By looking at the image, we see that even on just using x2, we can efficiently perform
classification. So at first w1 will become 0. As regularization parameter increases more, w2
will come more and more closer to 0.
34) Suppose we have a dataset which can be trained with 100% accuracy with help of
a decision tree of depth 6. Now consider the points below and choose the option
based on these points.
Note: All other hyper parameters are same and other factors are not affected.
A) Only 1
B) Only 2
C) Both 1 and 2
Solution: (A)
If you fit decision tree of depth 4 in such data means it will more likely to underfit the data.
So, in case of underfitting you will have high bias and low variance.
35) Which of the following options can be used to get global minima in k-Means
Algorithm?
A) 2 and 3
B) 1 and 3
C) 1 and 2
D) All of above
Solution: (D)
36) Imagine you are working on a project which is a binary classification problem.
You trained a model on training dataset and get the below confusion matrix on
validation dataset.
Based on the above confusion matrix, choose which option(s) below will give you
correct predictions?
1. Accuracy is ~0.91
A) 1 and 3
B) 2 and 4
C) 1 and 4
D) 2 and 3
Solution: (C)
The true Positive Rate is how many times you are predicting positive class correctly so true
positive rate would be 100/105 = 0.95 also known as “Sensitivity” or “Recall”
37) For which of the following hyperparameters, higher value is better for decision
tree algorithm?
2. Depth of tree
A)1 and 2
B) 2 and 3
C) 1 and 3
D) 1, 2 and 3
E) Can’t say
Solution: (E)
For all three options A, B and C, it is not necessary that if you increase the value of
parameter the performance may increase. For example, if we have a very high value of
depth of tree, the resulting tree may overfit the data, and would not generalize well. On the
other hand, if we have a very low value, the tree may underfit the data. So, we can’t say for
sure that “higher is better”.
Context 38-39
Imagine, you have a 28 * 28 image and you run a 3 * 3 convolution neural network on
it with the input depth of 3 and output depth of 8.
Solution: (A)
39) What is the dimensions of output feature map when you are using following
parameters.
Solution: (B)
Same as above
40) Suppose, we were plotting the visualization for different values of C (Penalty
parameter) in SVM algorithm. Due to some reason, we forgot to tag the C values with
visualizations. In that case, which of the following option best explains the C values
for the images below (1,2,3 left to right, so C values are C1 for image1, C2 for image2
and C3 for image3 ) in case of rbf kernel.
A) C1 = C2 = C3
B) C1 > C2 > C3
C) C1 < C2 < C3
D) None of these
Solution: (C)
MCQ questions for unit 4: Naïve Bayes and Support Vector Machine
Solution: B
Generalisation error in statistics is generally the out-of-sample error which is the measure
of how accurately a model can predict values for previously unseen data.
13. The minimum time complexity for training an SVM is O(n2). According to this fact, what
sizes of datasets are not best suited for SVM’s?
A) Large datasets
B) Small datasets
C) Medium sized datasets
D) Size does not matter
Solution: A
Datasets which have a clear classification boundary will function best with SVM’s.
A) Selection of Kernel
B) Kernel Parameters
C) Soft Margin Parameter C
D) All of the above
Solution: D
The SVM effectiveness depends upon how you choose the basic 3 requirements mentioned
above in such a way that it maximises your efficiency, reduces error and overfitting.
Solution: C
When the data has noise and overlapping points, there is a problem in drawing a clear hyperplane
without misclassifying.
15. Suppose you are using RBF kernel in SVM with high Gamma value. What does this
signify?
A) The model would consider even far away points from hyperplane for modeling
B) The model would consider only the points close to the hyperplane for modeling
C) The model would not be affected by distance of points from hyperplane for modeling
D) None of the above
Solution: B
The gamma parameter in SVM tuning signifies the influence of points either near or far away
from the hyperplane.
For a low gamma, the model will be too constrained and include all points of the training dataset,
without really capturing the shape.
For a higher gamma, the model will capture the shape of the dataset well.
Solution: C
The cost parameter decides how much an SVM should be allowed to “bend” with the data. For a
low cost, you aim for a smooth decision surface and for a higher cost, you aim to classify more
points correctly. It is also simply referred to as the cost of misclassification.
17. Which of the following are real world applications of the SVM?
Solution: D
SVM’s are highly versatile models that can be used for practically all real world problems
ranging from regression to clustering and handwriting recognitions.
18. We usually use feature normalization before using the Gaussian kernel in SVM. What is
true about feature normalization?
A) 1
B) 1 and 2
C) 1 and 3
D) 2 and 3
Solution: B
Statements one and two are correct.
A) 1
B) 2
C) 1 and 2
D) None of these
Solution: C
1. Which of the following is a widely used and effective machine learning algorithm based on
the idea of bagging?
a. Decision Tree
b. Regression
c. Classification
d. Random Forest - answer
2. To find the minimum or the maximum of a function, we set the gradient to zero because:
a. The value of the gradient at extrema of a function is always zero - answer
b. Depends on the type of problem
c. Both A and B
d. None of the above
3. The most widely used metrics and tools to assess a classification model are:
a. Confusion matrix
b. Cost-sensitive accuracy
c. Area under the ROC curve
d. All of the above - answer
4. Which of the following is a good test dataset characteristic?
a. Large enough to yield meaningful results
b. Is representative of the dataset as a whole
c. Both A and B - answer
d. None of the above
5. Which of the following is a disadvantage of decision trees?
a. Factor analysis
b. Decision trees are robust to outliers
c. Decision trees are prone to be overfit - answer
d. None of the above
6. How do you handle missing or corrupted data in a dataset?
a. Drop missing rows or columns
b. Replace missing values with mean/median/mode
c. Assign a unique category to missing values
d. All of the above - answer
7. What is the purpose of performing cross-validation?
a. To assess the predictive performance of the models
b. To judge how the trained model performs outside the sample on test data
c. Both A and B - answer
8. Why is second order differencing in time series needed?
a. To remove stationarity
b. To find the maxima or minima at the local point
c. Both A and B - answer
d. None of the above
9. When performing regression or classification, which of the following is the correct way to
preprocess the data?
a. Normalize the data → PCA → training - answer
b. PCA → normalize PCA output → training
c. Normalize the data → PCA → normalize PCA output → training
d. None of the above
10. Which of the folllowing is an example of feature extraction?
a. Constructing bag of words vector from an email
b. Applying PCA projects to a large high-dimensional data
c. Removing stopwords in a sentence
d. All of the above - answer
11. What is pca.components_ in Sklearn?
a. Set of all eigen vectors for the projection space - answer
b. Matrix of principal components
c. Result of the multiplication matrix
d. None of the above options
12. Which of the following is true about Naive Bayes ?
a. Assumes that all the features in a dataset are equally important
b. Assumes that all the features in a dataset are independent
c. Both A and B - answer
d. None of the above options
13. Which of the following statements about regularization is not correct?
a. Using too large a value of lambda can cause your hypothesis to underfit the data.
b. Using too large a value of lambda can cause your hypothesis to overfit the data.
c. Using a very large value of lambda cannot hurt the performance of your hypothesis.
d. None of the above - answer
14. How can you prevent a clustering algorithm from getting stuck in bad local optima?
a. Set the same seed value for each run
b. Use multiple random initializations - answer
c. Both A and B
d. None of the above
15. Which of the following techniques can be used for normalization in text mining?
a. Stemming
b. Lemmatization
c. Stop Word Removal
d. Both A and B - answer
16. In which of the following cases will K-means clustering fail to give good results? 1) Data
points with outliers 2) Data points with different densities 3) Data points with nonconvex
shapes
a. 1 and 2
b. 2 and 3
c. 1, 2, and 3 - answer
d. 1 and 3
17. Which of the following is a reasonable way to select the number of principal components
"k"?
a. Choose k to be the smallest value so that at least 99% of the varinace is retained. -
answer
b. Choose k to be 99% of m (k = 0.99*m, rounded to the nearest integer).
c. Choose k to be the largest value so that 99% of the variance is retained.
d. Use the elbow method
18. You run gradient descent for 15 iterations with a=0.3 and compute J(theta) after each
iteration. You find that the value of J(Theta) decreases quickly and then levels off. Based on
this, which of the following conclusions seems most plausible?
a. Rather than using the current value of a, use a larger value of a (say a=1.0)
b. Rather than using the current value of a, use a smaller value of a (say a=0.1)
c. a=0.3 is an effective choice of learning rate- answer
d. None of the above
19. What is a sentence parser typically used for?
a. It is used to parse sentences to check if they are utf-8 compliant.
b. It is used to parse sentences to derive their most likely syntax tree structures. -
answer
c. It is used to parse sentences to assign POS tags to all tokens.
d. It is used to check if sentences can be parsed into meaningful tokens.
20. Suppose you have trained a logistic regression classifier and it outputs a new example x
with a prediction ho(x) = 0.2. This means
a. Our estimate for P(y=1 | x)
b. Our estimate for P(y=0 | x) - answer
c. Our estimate for P(y=1 | x)
d. Our estimate for P(y=0 | x)
1) If you remove the following any one red points from the data. Does the
decision boundary will change?
A) Yes
B) No
Solution: A
These three examples are positioned such that removing any one of them introduces slack
in the constraints. So the decision boundary would completely change.
21. [True or False] If you remove the non-red circled points from the data, the decision
boundary will change?
A) True
B) False
Solution: B
On the other hand, rest of the points in the data won’t affect the decision boundary much.
23. When the C parameter is set to infinite, which of the following holds true?
A) The optimal hyperplane if exists, will be the one that completely separates the data
B) The soft-margin classifier will separate the data
C) None of the above
Solution: A
At such a high level of misclassification penalty, soft margin will not hold existence as
there will be no room for error.
25. The minimum time complexity for training an SVM is O(n2). According to this fact, what
sizes of datasets are not best suited for SVM’s?
A) Large datasets
B) Small datasets
C) Medium sized datasets
D) Size does not matter
Solution: A
Datasets which have a clear classification boundary will function best with SVM’s.
29. Suppose you are using RBF kernel in SVM with high Gamma value. What does this
signify?
A) The model would consider even far away points from hyperplane for modeling
B) The model would consider only the points close to the hyperplane for modeling
C) The model would not be affected by distance of points from hyperplane for
modeling
D) None of the above
Solution: B
The gamma parameter in SVM tuning signifies the influence of points either near or far
away from the hyperplane
For a low gamma, the model will be too constrained and include all points of the training
dataset, without really capturing the shape.
For a higher gamma, the model will capture the shape of the dataset well.
31. 12)Suppose you are building a SVM model on data X. The data X can be error prone
which means that you should not trust any specific data point too much. Now think that
you want to build a SVM model which has quadratic kernel function of polynomial
degree 2 that uses Slack variable C as one of it’s hyper parameter. Based upon that give
the answer for following question.
What would happen when you use very large value of C(C->infinity)?
Note: For small C was also classifying all data points correctly
A) We can still classify data correctly for given setting of hyper parameter C
B) We can not classify data correctly for given setting of hyper parameter C
C) Can’t Say
D) None of these
Solution: A
For large values of C, the penalty for misclassifying points is very high, so the decision
boundary will perfectly separate the data if possible.
32. What would happen when you use very small C (C~0)?
A) Misclassification would happen
B) Data will be correctly classified
C) Can’t say
D) None of these
Solution: A
The classifier can maximize the margin between most of the points, while misclassifying
a few points, because the penalty is so low.
33. If I am using all features of my dataset and I achieve 100% accuracy on my training set,
but ~70% on validation set, what should I look out for?
A) Underfitting
B) Nothing, the model is perfect
C) Overfitting
Solution: C
If we’re achieving 100% training accuracy very easily, we need to check to verify if
we’re overfitting our data.
34. Which of the following are real world applications of the SVM?
A) Text and Hypertext Categorization
B) Image Classification
C) Clustering of News Articles
D) All of the above
Solution: D
SVM’s are highly versatile models that can be used for practically all real world problems
ranging from regression to clustering and handwriting recognitions.
Question Context: 16 – 18
Suppose you have trained an SVM with linear decision boundary after training SVM, you
correctly infer that your SVM model is under fitting.
35. Which of the following option would you more likely to consider iterating SVM next
time?
A) You want to increase your data points
B) You want to decrease your data points
C) You will try to calculate more variables
D) You will try to reduce the features
Solution: C
The best option here would be to create more features for the model.
36. Suppose you gave the correct answer in previous question. What do you think that is
actually happening?
1.We are lowering the bias
2. We are lowering the variance
3. We are increasing the bias
4. We are increasing the variance
A) 1 and 2
B) 2 and 3
C) 1 and 4
D) 2 and 4
Solution: C
Better model will lower the bias and increase the variance
37. In above question suppose you want to change one of it’s(SVM) hyperparameter so that
effect would be same as previous questions i.e model will not under fit?
A) We will increase the parameter C
B) We will decrease the parameter C
C) Changing in C don’t effect
D) None of these
Solution: A
Increasing C parameter would be the right thing to do here, as it will ensure regularized
model
38. We usually use feature normalization before using the Gaussian kernel in SVM. What is
true about feature normalization?
1.We do feature normalization so that new feature will dominate other
2. Some times, feature normalization is not feasible in case of categorical variables
3. Feature normalization always helps when we use Gaussian kernel in SVM
A) 1
B) 1 and 2
C) 1 and 3
D) 2 and 3
Solution: B
Statements one and two are correct.
40. Suppose you have same distribution of classes in the data. Now, say for training 1 time in
one vs all setting the SVM is taking 10 second. How many seconds would it require to
train one-vs-all method end to end?
A) 20
B) 40
C) 60
D) 80
Solution: B
It would take 10×4 = 40 seconds
41. Suppose your problem has changed now. Now, data has only 2 classes. What would you
think how many times we need to train SVM in such case?
A) 1
B) 2
C) 3
D) 4
Solution: A
Training the SVM only one time would give you appropriate results
1. Support Vector Machine works well with,
a) Linear Scenarios
b) Non-linear Scenarios
c) Both of these
d) None of these
3. Two classes separated by a margin with two boundaries are called as,
a) Linear Vectors
b) Support Vectors
c) Test Vectors
d) None of these
8. To find out a trade-off between precision and number of support vectors, scikit-learn provides
an implementation called as,
a) NuSVC
b) BuSVC
c) MuSVC
d) AuSVC
Answer: a) NuSVC
a)
b)
c)
d) None of these
Answer: a)
a)
b)
c)
d) None of these
Answer: b)
11. The sigmoid kernel is based on this function:
a)
b)
c)
d) None of these
Answer: c)
a) 1
b) 2
c) Both 1 and 2
d) None of these
Answer: a) Discriminative
Answer: b) Find the optimal separating hyperplane which maximizes the margin of training data.
a)
b)
c)
d) None of these
Answer: a)
Answer: a) SVM algorithms use a set of mathematical functions that are defined as the kernel
9. Probability provides a way of summarizing the ______ that comes from our laziness and
ignorances.
A. Belief
B. Uncertaintity
C. Joint probability distributions
D. Randomness
ANSWER: B
10. The entries in the full joint probability distribution can be calculated as
A. Using variables
B. Both Using variables & information
C. Using information
D. All of the above
ANSWER: C
12. Naïve Bayes algorithm is based on _______ and used for solving classification problems.
A. Bayes Theorem
B. Candidate elimination algorithm
C. EM algorithm
D. None of the above
ANSWER: A
19. Support vectors are the data points that lie closest to the decision surface.
A. TRUE
B. FALSE
ANSWER: A
22. Which of the following are real world applications of the SVM?
A. Text and Hypertext Categorization
B. Image Classification
C. Clustering of News Articles
D. All of the above
ANSWER:D
23. Gaussian naive Bayes is useful when working with continuous values whose probabilities
can be modeled using a Gaussian distribution
A. Bernoulli
B. multinomial
C. Gaussian
D. All of above
ANSWER: C
24. A multinomial distribution is useful to model feature vectors where each value
represents,the number of occurrences of a term or its relative frequency
A. Bernoulli
B. multinomial
C. Gaussian
D. All of above
ANSWER: B
26. The two classes are normally separated by a margin with two boundaries where a few
elements lie. Those elements are called
A. principal componants
B. support vectors
C. factors
D. None
ANSWER: B
27. What is/are true about kernel in SVM? 1. Kernel function map low dimensional data to
high dimensional space. 2.It’s a similarity function
A. 1
B. 2
C. 1 and 2
D. None of these
ANSWER: C
28. Support vector machine (SVM) is a _________ classifier
A. Descrinative
B. Generative
ANSWER: A
30. The training examples closest to the separating hyperplane are called as _______
A. Training vector
B. Testing Vector
C. Support margin
D. Support vector
ANSWER:D
33. When using R, which of the following package is used for SVM?
A. b1072
B. c1071
C. d2012
D. e1071
ANSWER:D
35. Which of the following might be valid reasons for preferring an SVM over a neural
network?
A. An SVM can automatically learn to apply a non-linear transformation on the input space;
a neural net cannot.
B. An SVM can effectively map the data to an infinite-dimensional space; a neural net
cannot.
C. An SVM should not get stuck in local minima, unlike a neural net.
D. The transformed (basis function) representation constructed by an SVM is usually
easier to visualise/interpret than for a neural net.
ANSWER: B,C
36. You are given a labeled binary classification data set with N data points and D features.
Suppose that N < D. In training an SVM on this data set, which of the following kernels
is likely to be most appropriate?
A. Linear kernel
B. Quadratic kernel
C. Higher-order polynomial kernel
D. RBF kernel
ANSWER: A
UNIT I
1. What is classification?
a) when the output variable is a category, such as “red” or “blue” or “disease” and “no
disease”.
b) when the output variable is a real value, such as “dollars” or “weight”.
Ans: Solution A
2. What is regression?
a) When the output variable is a category, such as “red” or “blue” or “disease” and “no
disease”.
b) When the output variable is a real value, such as “dollars” or “weight”.
Ans: Solution B
Ans: Solution B
Ans: Solution A
Ans: Solution D
6. What is Reinforcement learning?
a) All data is unlabelled and the algorithms learn to inherent structure from the input data
b) All data is labelled and the algorithms learn to predict the output from the input data
c) It is a framework for learning where an agent interacts with an environment and receives
a reward for each interaction
d) Some data is labelled but most of it is unlabelled and a mixture of supervised and
unsupervised techniques can be used.
Ans: Solution C
Regression,
Classification
Clustering
Reinforcement Learning
Options:
A. 1 Only
B. 1 and 2
C. 1 and 3
D. 1, 2 and 4
Ans : Solution D
Ans : Solution C
Ans : Solution B
11. Supervised learning and unsupervised clustering both require at least one
a) hidden attribute.
b) output attribute.
c) input attribute.
d) categorical attribute.
Ans : Solution A
12. Supervised learning differs from unsupervised clustering in that supervised learning requires
a) at least one input attribute.
b) input attributes to be categorical.
c) at least one output attribute.
d) output attributes to be categorical.
Ans : Solution B
13. A regression model in which more than one independent variable is used to predict the
dependent variable is called
a) a simple linear regression model
b) a multiple regression models
c) an independent model
d) none of the above
Ans : Solution C
14. A term used to describe the case when the independent variables in a multiple regression model
are correlated is
a) Regression
b) correlation
c) multicollinearity
d) none of the above
Ans : Solution C
15. A multiple regression model has the form: y = 2 + 3x1 + 4x2. As x1 increases by 1 unit (holding x2
constant), y will
a) increase by 3 units
b) decrease by 3 units
c) increase by 4 units
d) decrease by 4 units
Ans : Solution C
Ans : Solution B
17. A measure of goodness of fit for the estimated regression equation is the
a) multiple coefficient of determination
b) mean square due to error
c) mean square due to regression
d) none of the above
Ans : Solution C
Ans : Solution D
Ans : Solution C
20. For a multiple regression model, SST = 200 and SSE = 50. The multiple coefficient of
determination is
a) 0.25
b) 4.00
c) 0.75
d) none of the above
Ans : Solution B
Ans : Solution B
Ans : Solution B
Ans : Solution C
Ans : Solution D
26. Which statement is true about neural network and linear regression models?
a) Both models require input attributes to be numeric.
b) Both models require numeric attributes to range between 0 and 1.
c) The output of both models is a categorical attribute value.
d) Both techniques build models whose output is determined by a linear sum of weighted
input attribute values.
Ans : Solution A
Ans : Solution A
28. The average positive difference between computed and desired outcome values.
a) root mean squared error
b) mean squared error
c) mean absolute error
d) mean positive error
Ans : Solution D
29. Selecting data so as to assure that each class is properly represented in both the training and
test set.
a) cross validation
b) stratification
c) verification
d) bootstrapping
Ans : Solution B
30. The standard error is defined as the square root of this computation.
a) The sample variance divided by the total number of sample instances.
b) The population variance divided by the total number of sample instances.
c) The sample variance divided by the sample mean.
d) The population variance divided by the sample mean.
Ans : Solution A
31. Data used to optimize the parameter settings of a supervised learner model.
a) Training
b) Test
c) Verification
d) Validation
Ans : Solution D
Ans : Solution A
33. The correlation between the number of years an employee has worked for a company and the
salary of the employee is 0.75. What can be said about employee salary and years worked?
a) There is no relationship between salary and years worked.
b) Individuals that have worked for the company the longest have higher salaries.
c) Individuals that have worked for the company the longest have lower salaries.
d) The majority of employees have been with the company a long time.
e) The majority of employees have been with the company a short period of time.
Ans : Solution B
34. The correlation coefficient for two real-valued attributes is –0.85. What does this value tell you?
a) The attributes are not linearly related.
b) As the value of one attribute increases the value of the second attribute also increases.
c) As the value of one attribute decreases the value of the second attribute increases.
d) The attributes show a curvilinear relationship.
Ans : Solution C
35. The average squared difference between classifier predicted output and actual output.
a) mean squared error
b) root mean squared error
c) mean absolute error
d) mean relative error
Ans : Solution A
36. Simple regression assumes a __________ relationship between the input attribute and output
attribute.
a) Linear
b) Quadratic
c) reciprocal
d) inverse
Ans : Solution A
Ans : Solution B
Ans : Solution C
39. Logistic regression is a ________ regression technique that is used to model data having a
_____outcome.
a) linear, numeric
b) linear, binary
c) nonlinear, numeric
d) nonlinear, binary
Ans : Solution D
40. This technique associates a conditional probability value with each data instance.
a) linear regression
b) logistic regression
c) simple regression
d) multiple linear regression
Ans : Solution B
41. This supervised learning technique can process both numeric and categorical input attributes.
a) linear regression
b) Bayes classifier
c) logistic regression
d) backpropagation learning
Ans : Solution A
Ans : Solution B
43. This clustering algorithm merges and splits nodes to help modify nonoptimal partitions.
a) agglomerative clustering
b) expectation maximization
c) conceptual clustering
d) K-Means clustering
Ans : Solution D
44. This clustering algorithm initially assumes that each data instance represents a single cluster.
a) agglomerative clustering
b) conceptual clustering
c) K-Means clustering
d) expectation maximization
Ans : Solution C
45. This unsupervised clustering algorithm terminates when mean values computed for the current
iteration of the algorithm are identical to the computed mean values for the previous iteration.
a) agglomerative clustering
b) conceptual clustering
c) K-Means clustering
d) expectation maximization
Ans : Solution C
46. Machine learning techniques differ from statistical techniques in that machine learning methods
a) typically assume an underlying distribution for the data.
b) are better able to deal with missing and noisy data.
c) are not able to explain their behavior.
d) have trouble with large-sized datasets.
Ans : Solution B
UNIT –II
1.True- False: Over fitting is more likely when you have huge amount of data to train?
A) TRUE
B) FALSE
Ans Solution: (B)
With a small training dataset, it’s easier to find a hypothesis to fit the training data exactly i.e.
over fitting.
3.Which of the following techniques would perform better for reducing dimensions of a data
set?
A. Removing columns which have too many missing values
B. Removing columns which have high variance in data
C. Removing columns with dissimilar data trends
D. None of these
Ans Solution: (A)
If a columns have too many missing values, (say 99%) then we can remove such columns.
4.It is not necessary to have a target variable for applying dimensionality reduction
algorithms.
A. TRUE
B. FALSE
Ans Solution: (A)
LDA is an example of supervised dimensionality reduction algorithm.
5. PCA can be used for projecting and visualizing data in lower dimensions.
A. TRUE
B. FALSE
Ans Solution: (A)
Sometimes it is very useful to plot the data in lower dimensions. We can take the first 2 principal
components and then visualize the data using scatter plot.
6. The most popularly used dimensionality reduction algorithm is Principal Component Analysis
(PCA). Which of the following is/are true about PCA?
PCA is an unsupervised method
It searches for the directions that data have the largest variance
Maximum number of principal components <= number of features
All principal components are orthogonal to each other
A. 1 and 2
B. 1 and 3
C. 2 and 3
D. All of the above
Ans D
8. What happens when you get features in lower dimensions using PCA?
The features will still have interpretability
The features will lose interpretability
The features must carry all information present in data
The features may not carry all information present in data
A. 1 and 3
B. 1 and 4
C. 2 and 3
D. 2 and 4
Ans Solution: (D)
When you get the features in lower dimensions then you will lose some information of data
most of the times and you won’t be able to interpret the lower dimension data.
10. What is of the following statement is true about t-SNE in comparison to PCA?
A. When the data is huge (in size), t-SNE may fail to produce better results.
B. T-NSE always produces better result regardless of the size of the data
C. PCA always performs better than t-SNE for smaller size data.
D. None of these
Ans Solution: (A)
Option A is correct
11. [ True or False ] PCA can be used for projecting and visualizing data in lower dimensions.
A. TRUE
B. FALSE
Solution: (A)
Sometimes it is very useful to plot the data in lower dimensions. We can take the first 2 principal
components and then visualize the data using scatter plot.
12. A feature F1 can take certain value: A, B, C, D, E, & F and represents grade of students from
a college.
1) Which of the following statement is true in following case?
A) Feature F1 is an example of nominal variable.
B) Feature F1 is an example of ordinal variable.
C) It doesn’t belong to any of the above category.
D) Both of these
Solution: (B)
Ordinal variables are the variables which has some order in their categories. For example, grade
A should be consider as high grade than grade B.
1. Which of the following methods do we use to best fit the data in Logistic Regression?
A) Least Square Error
B) Maximum Likelihood
C) Jaccard distance
D) Both A and B
Ans Solution: B
2. Choose which of the following options is true regarding One-Vs-All method in Logistic
Regression.
A) We need to fit n models in n-class classification problem
B) We need to fit n-1 models to classify into n classes
C) We need to fit only 1 model to classify into n classes
D) None of these
Ans Solution: A
3. Suppose, You applied a Logistic Regression model on a given data and got a training accuracy
X and testing accuracy Y. Now, you want to add a few new features in the same data. Select the
option(s) which is/are correct in such a case.
Note: Consider remaining parameters are same.
A) Training accuracy increases
B) Training accuracy increases or remains the same
C) Testing accuracy decreases
D) Testing accuracy increases or remains the same
Ans Solution: A and D
Adding more features to model will increase the training accuracy because model has to
consider more data to fit the logistic regression. But testing accuracy increases if feature is
found to be significant
6. Which of the following methods do we use to find the best fit line for data in Linear
Regression?
A) Least Square Error
B) Maximum Likelihood
C) Logarithmic Loss
D) Both A and B
Ans Solution: (A)
In linear regression, we try to minimize the least square errors of the model to identify the line
of best fit.
8. Suppose you plotted a scatter plot between the residuals and predicted values in linear
regression and you found that there is a relationship between them. Which of the following
conclusion do you make about this situation?
9. Suppose you have fitted a complex regression model on a dataset. Now, you are using Ridge
regression with penalty x.
Choose the option which describes bias in best manner.
A) In case of very large x; bias is low
B) In case of very large x; bias is high
C) We can’t say about bias
D) None of these
Ans Solution: (B)
If the penalty is very large it means model is less complex, therefore the bias would be high.
11. Suppose you have trained a logistic regression classifier and it outputs a new example x with
a prediction ho(x) = 0.2. This means
Our estimate for P(y=1 | x)
Our estimate for P(y=0 | x)
Our estimate for P(y=1 | x)
Our estimate for P(y=0 | x)
Ans Solution: B
A) TRUE
B) FALSE
Solution: (A)
True. A Neural network can be used as a universal approximator, so it can definitely implement
a linear regression algorithm.
15. Which of the following methods do we use to find the best fit line for data in Linear
Regression?
A) Least Square Error
B) Maximum Likelihood
C) Logarithmic Loss
D) Both A and B
Solution: (A)
In linear regression, we try to minimize the least square errors of the model to identify the line
of best fit.
16. Which of the following evaluation metrics can be used to evaluate a model while modeling
a continuous output variable?
A) AUC-ROC
B) Accuracy
C) Logloss
D) Mean-Squared-Error
Solution: (D)
Since linear regression gives output as continuous values, so in such case we use mean squared
error metric to evaluate the model performance. Remaining options are use in case of a
classification problem.
17. True-False: Lasso Regularization can be used for variable selection in Linear Regression.
A) TRUE
B) FALSE
Solution: (A)
True, In case of lasso regression we apply absolute penalty which makes some of the coefficients
zero.
19. Suppose that we have N independent variables (X1,X2… Xn) and dependent variable is Y.
Now Imagine that you are applying linear regression by fitting the best fit line using least square
error on this data.
You found that correlation coefficient for one of it’s variable(Say X1) with Y is -0.95.
Which of the following is true for X1?
A) Relation between the X1 and Y is weak
B) Relation between the X1 and Y is strong
C) Relation between the X1 and Y is neutral
D) Correlation can’t judge the relationship
Solution: (B)
The absolute value of the correlation coefficient denotes the strength of the relationship.
Since absolute correlation is very high it means that the relationship is strong between X1 and
Y.
20. Looking at above two characteristics, which of the following option is the correct for
Pearson correlation between V1 and V2?
If you are given the two variables V1 and V2 and they are following below two characteristics.
1. If V1 increases then V2 also increases
2. If V1 decreases then V2 behavior is unknown
A) Pearson correlation will be close to 1
B) Pearson correlation will be close to -1
C) Pearson correlation will be close to 0
D) None of these
Solution: (D)
We cannot comment on the correlation coefficient by using only statement 1. We need to
consider the both of these two statements. Consider V1 as x and V2 as |x|. The correlation
coefficient would not be close to 1 in such a case.
21. Suppose Pearson correlation between V1 and V2 is zero. In such case, is it right to
conclude that V1 and V2 do not have any relation between them?
A) TRUE
B) FALSE
Solution: (B)
Pearson correlation coefficient between 2 variables might be zero even when they have a
relationship between them. If the correlation coefficient is zero, it just means that that they
don’t move together. We can take examples like y=|x| or y=x^2.
22. True- False: Overfitting is more likely when you have huge amount of data to train?
A) TRUE
B) FALSE
Solution: (B)
With a small training dataset, it’s easier to find a hypothesis to fit the training data exactly i.e.
overfitting.
23. We can also compute the coefficient of linear regression with the help of an analytical
method called “Normal Equation”. Which of the following is/are true about Normal Equation?
1. We don’t have to choose the learning rate
2. It becomes slow when number of features is very large
3. Thers is no need to iterate
A) 1 and 2
B) 1 and 3
C) 2 and 3
D) 1,2 and 3
Solution: (D)
Instead of gradient descent, Normal Equation can also be used to find coefficients.
25. What will happen when you apply very large penalty?
A) Some of the coefficient will become absolute zero
B) Some of the coefficient will approach zero but not absolute zero
C) Both A and B depending on the situation
D) None of these
Solution: (B)
In lasso some of the coefficient value become zero, but in case of Ridge, the coefficients become
close to zero but not zero.
26. What will happen when you apply very large penalty in case of Lasso?
A) Some of the coefficient will become zero
B) Some of the coefficient will be approaching to zero but not absolute zero
C) Both A and B depending on the situation
D) None of these
Solution: (A)
As already discussed, lasso applies absolute penalty, so some of the coefficients will become
zero.
27. Which of the following statement is true about outliers in Linear regression?
A) Linear regression is sensitive to outliers
B) Linear regression is not sensitive to outliers
C) Can’t say
D) None of these
Solution: (A)
The slope of the regression line will change due to outliers in most of the cases. So Linear
Regression is sensitive to outliers.
28. Suppose you plotted a scatter plot between the residuals and predicted values in linear
regression and you found that there is a relationship between them. Which of the following
conclusion do you make about this situation?
31. In terms of bias and variance. Which of the following is true when you fit degree 2
polynomial?
A) Increase
B) Decrease
C) Remain constant
D) Can’t Say
Solution: (D)
Training error may increase or decrease depending on the values that are used to fit the model.
If the values used to train contain more outliers gradually, then the error might just increase.
33. What do you expect will happen with bias and variance as you increase the size of training
data?
34. What would be the root mean square training error for this data if you run a Linear
Regression model of the form (Y = A0+A1X)?
A) Less than 0
B) Greater than zero
C) Equal to 0
D) None of these
Solution: (C)
We can perfectly fit the line on the following data so mean error will be zero.
35. Which of the following scenario would give you the right hyper parameter?
A) 1
B) 2
C) 3
D) 4
Solution: (B)
Option B would be the better option because it leads to less training as well as validation error.
36. Suppose you got the tuned hyper parameters from the previous question. Now, Imagine
you want to add a variable in variable space such that this added feature is important. Which
of the following thing would you observe in such case?
A) Training Error will decrease and Validation error will increase
B) Training Error will increase and Validation error will increase
C) Training Error will increase and Validation error will decrease
D) Training Error will decrease and Validation error will decrease
E) None of the above
Solution: (D)
If the added feature is important, the training and validation error would decrease.
A) L1
B) L2
C) Any
D) None of these
Solution: (D)
I won’t use any regularization methods because regularization is used in case of overfitting.
41. True-False: Is it possible to design a logistic regression algorithm using a Neural Network
Algorithm?
A) TRUE
B) FALSE
Solution: A
True, Neural network is a is a universal approximator so it can implement linear regression
algorithm.
43. Which of the following methods do we use to best fit the data in Logistic Regression?
A) Least Square Error
B) Maximum Likelihood
C) Jaccard distance
D) Both A and B
Solution: B
Logistic regression uses maximum likely hood estimate for training a logistic regression.
44. Which of the following evaluation metrics can not be applied in case of logistic regression
output to compare with target?
A) AUC-ROC
B) Accuracy
C) Logloss
D) Mean-Squared-Error
Solution: D
Since, Logistic Regression is a classification algorithm so it’s output can not be real time value so
mean squared error can not use for evaluating it
45. One of the very good methods to analyze the performance of Logistic Regression is AIC,
which is similar to R-Squared in Linear Regression. Which of the following is true about AIC?
A) We prefer a model with minimum AIC value
B) We prefer a model with maximum AIC value
C) Both but depend on the situation
D) None of these
Solution: A
We select the best model in logistic regression which can least AIC.
Solution: A
In case of lasso we apply a absolute penality, after increasing the penality in lasso some of the
coefficient of variables may become zero.
Context: 48-49
Consider a following model for logistic regression: P (y =1|x, w)= g(w0 + w1x)
where g(z) is the logistic function.
In the above equation the P (y =1|x; w) , viewed as a function of x, that we can get by changing the
parameters w.
A) (0, inf)
B) (-inf, 0 )
C) (0, 1)
D) (-inf, inf)
Solution: C
For values of x in the range of real number from −∞ to +∞ Logistic function will give the output
between (0,1)
49 In above question what do you think which function would make p between (0,1)?
A) logistic function
B) Log likelihood function
C) Mixture of both
D) None of them
Solution: A
50. Suppose you have been given a fair coin and you want to find out the odds of getting heads.
Which of the following option is true for such a case?
A) odds will be 0
B) odds will be 0.5
C) odds will be 1
D) None of these
Solution: C
Odds are defined as the ratio of the probability of success and the probability of failure. So in case of fair
coin probability of success is 1/2 and the probability of failure is 1/2 so odd would be 1
51. The logit function(given as l(x)) is the log of odds function. What could be the range of logit
function in the domain x=[0,1]?
A) (– ∞ , ∞)
B) (0,1)
C) (0, ∞)
D) (- ∞, 0)
Solution: A
For our purposes, the odds function has the advantage of transforming the probability function, which
has values from 0 to 1, into an equivalent function with values between 0 and ∞. When we take the
natural log of the odds function, we get a range of values from -∞ to ∞.
A) Linear Regression errors values has to be normally distributed but in case of Logistic Regression it is
not the case
B) Logistic Regression errors values has to be normally distributed but in case of Linear Regression it is
not the case
C) Both Linear Regression and Logistic Regression error values have to be normally distributed
D) Both Linear Regression and Logistic Regression error values have not to be normally distributed
Solution:A
53. Which of the following is true regarding the logistic function for any value “x”?
Note:
Logistic(x): is a logistic function of any number “x”
A) Logistic(x) = Logit(x)
B) Logistic(x) = Logit_inv(x)
C) Logit_inv(x) = Logit(x)
D) None of these
Solution: B
Suppose you have given the two scatter plot “a” and “b” for two classes( blue for positive and red for
negative class). In scatter plot “a”, you correctly classified all data points using logistic regression ( black
line is a decision boundary).
A) Bias will be high
B) Bias will be low
C) Can’t say
D) None of these
Solution: A
55. Suppose, You applied a Logistic Regression model on a given data and got a training accuracy X
and testing accuracy Y. Now, you want to add a few new features in the same data. Select the
option(s) which is/are correct in such a case.
Solution: A and D
Adding more features to model will increase the training accuracy because model has to consider more
data to fit the logistic regression. But testing accuracy increases if feature is found to be significant
56. Choose which of the following options is true regarding One-Vs-All method in Logistic Regression.
If there are n classes, then n separate logistic regression has to fit, where the probability of each
category is predicted over the rest of the categories combined.
57. Below are two different logistic models with different values for β0 and β1.
Which of the
following statement(s) is true about β0 and β1 values of two logistics models (Green, Black)?
Solution: B
Context 58-60
Below are the three scatter plot(A,B,C left to right) and hand drawn decision boundaries for logistic
regression.
58. Which of the following above figure shows that the decision boundary is overfitting the training
data?
A) A
B) B
C) C
D)None of these
Solution: C
Since in figure 3, Decision boundary is not smooth that means it will over-fitting the data.
1. The training error in first plot is maximum as compare to second and third plot.
2. The best model for this regression problem is the last (third) plot because it has minimum
training error (zero).
3. The second model is more robust than first and third because it will perform best on unseen
data.
5. All will perform same because we have not seen the testing data.
A) 1 and 3
B) 1 and 3
C) 1, 3 and 4
D) 5
Solution: C
The trend in the graphs looks like a quadratic trend over independent variable X. A higher degree(Right
graph) polynomial might have a very high accuracy on the train population but is expected to fail badly
on test dataset. But if you see in left graph we will have training error maximum because it underfits the
training data
60. Suppose, above decision boundaries were generated for the different value of regularization.
Which of the above decision boundary shows the maximum regularization?
A) A
B) B
C) C
D) All have equal regularization
Solution: A
Since, more regularization means more penality means less complex decision boundry that shows in first
figure A.
61. What would do if you want to train logistic regression on same data that will take less time as well
as give the comparatively similar accuracy(may not be same)?
Suppose you are using a Logistic Regression model on a huge dataset. One of the problem you may face
on such huge data is that Logistic regression will take very long time to train.
Solution: D
If you decrease the number of iteration while training it will take less time for surly but will not give the
same accuracy for getting the similar accuracy but not exact you need to increase the learning rate.
62. Which of the following image is showing the cost function for y =1.
Following is the loss function in logistic regression(Y-axis loss function and x axis log probability) for
two class classification problem.
Solution: A
A is the true answer as loss function decreases as the log probability increases
A) 1
B) 2
C) 3
D) 4
Solution: C
There are three local minima present in the graph
64. Can a Logistic Regression classifier do a perfect classification on the below data?
Note: You can use only X1 and X2 variables where X1 and X2 can take only two binary values(0,1).
A) TRUE
B) FALSE
C) Can’t say
D) None of these
Solution: B
No, logistic regression only forms linear decision surface, but the examples in the figure are not linearly
separable.
UNIT IV
Ans Solution: C
When the data has noise and overlapping points, there is a problem in drawing a clear hyperplane
without misclassifying.
Ans Solution: C
The cost parameter decides how much an SVM should be allowed to “bend” with the data. For a low
cost, you aim for a smooth decision surface and for a higher cost, you aim to classify more points
correctly. It is also simply referred to as the cost of misclassification.
Ans Solution: D
SVM’s are highly versatile models that can be used for practically all real world problems ranging from
regression to clustering and handwriting recognitions.
Ans Solution: B
Generalisation error in statistics is generally the out-of-sample error which is the measure of how
accurately a model can predict values for previously unseen data.
Ans Solution: C
When the data has noise and overlapping points, there is a problem in drawing a clear hyperplane
without misclassifying.
A) 1
B) 2
C) 1 and 2
D) None of these
Ans Solution: C
Question Context:8– 9
Suppose you are using a Linear SVM classifier with 2 class classification problem. Now you have been
given the following data in which some points are circled red that are representing support vectors.
8. If you remove the following any one red points from the data. Does the decision boundary will
change?
A) Yes
B) No
Solution: A
These three examples are positioned such that removing any one of them introduces slack in the
constraints. So the decision boundary would completely change.
9. [True or False] If you remove the non-red circled points from the data, the decision boundary will
change?
A) True
B) False
Solution: B
On the other hand, rest of the points in the data won’t affect the decision boundary much.
Solution: B
Generalization error in statistics is generally the out-of-sample error which is the measure of how
accurately a model can predict values for previously unseen data.
11. When the C parameter is set to infinite, which of the following holds true?
A) The optimal hyperplane if exists, will be the one that completely separates the data
B) The soft-margin classifier will separate the data
C) None of the above
Solution: A
At such a high level of misclassification penalty, soft margin will not hold existence as there will be no
room for error.
Solution: A
A hard margin means that an SVM is very rigid in classification and tries to work extremely well in the
training set, causing overfitting.
13. The minimum time complexity for training an SVM is O(n2). According to this fact, what sizes of
datasets are not best suited for SVM’s?
A) Large datasets
B) Small datasets
C) Medium sized datasets
D) Size does not matter
Solution: A
Datasets which have a clear classification boundary will function best with SVM’s.
A) Selection of Kernel
B) Kernel Parameters
C) Soft Margin Parameter C
D) All of the above
Solution: D
The SVM effectiveness depends upon how you choose the basic 3 requirements mentioned above in
such a way that it maximises your efficiency, reduces error and overfitting.
15. upport vectors are the data points that lie closest to the decision surface.
A) TRUE
B) FALSE
Solution: A
They are the points closest to the hyperplane and the hardest ones to classify. They also have a direct
bearing on the location of the decision surface.
Solution: C
When the data has noise and overlapping points, there is a problem in drawing a clear hyperplane
without misclassifying.
17. Suppose you are using RBF kernel in SVM with high Gamma value. What does this signify?
A) The model would consider even far away points from hyperplane for modeling
B) The model would consider only the points close to the hyperplane for modeling
C) The model would not be affected by distance of points from hyperplane for modeling
D) None of the above
Solution: B
The gamma parameter in SVM tuning signifies the influence of points either near or far away from the
hyperplane.
For a low gamma, the model will be too constrained and include all points of the training dataset,
without really capturing the shape.
For a higher gamma, the model will capture the shape of the dataset well.
Solution: C
The cost parameter decides how much an SVM should be allowed to “bend” with the data. For a low
cost, you aim for a smooth decision surface and for a higher cost, you aim to classify more points
correctly. It is also simply referred to as the cost of misclassification.
19. Suppose you are building a SVM model on data X. The data X can be error prone which means that
you should not trust any specific data point too much. Now think that you want to build a SVM model
which has quadratic kernel function of polynomial degree 2 that uses Slack variable C as one of it’s hyper
parameter. Based upon that give the answer for following question.
What would happen when you use very large value of C(C->infinity)?
Note: For small C was also classifying all data points correctly
A) We can still classify data correctly for given setting of hyper parameter C
B) We can not classify data correctly for given setting of hyper parameter C
C) Can’t Say
D) None of these
Solution: A
For large values of C, the penalty for misclassifying points is very high, so the decision boundary will
perfectly separate the data if possible.
20. What would happen when you use very small C (C~0)?
Solution: A
The classifier can maximize the margin between most of the points, while misclassifying a few points,
because the penalty is so low.
21. If I am using all features of my dataset and I achieve 100% accuracy on my training set, but ~70% on
validation set, what should I look out for?
A) Underfitting
B) Nothing, the model is perfect
C) Overfitting
Solution: C
If we’re achieving 100% training accuracy very easily, we need to check to verify if we’re overfitting our
data.
22. Which of the following are real world applications of the SVM?
Solution: D
SVM’s are highly versatile models that can be used for practically all real world problems ranging from
regression to clustering and handwriting recognitions.
Question Context: 23 – 25
Suppose you have trained an SVM with linear decision boundary after training SVM, you correctly infer
that your SVM model is under fitting.
23. Which of the following option would you more likely to consider iterating SVM next time?
Solution: C
The best option here would be to create more features for the model.
24. Suppose you gave the correct answer in previous question. What do you think that is actually
happening?
A) 1 and 2
B) 2 and 3
C) 1 and 4
D) 2 and 4
Solution: C
Better model will lower the bias and increase the variance
25. In above question suppose you want to change one of it’s(SVM) hyperparameter so that effect
would be same as previous questions i.e model will not under fit?
Solution: A
Increasing C parameter would be the right thing to do here, as it will ensure regularized model
26. We usually use feature normalization before using the Gaussian kernel in SVM. What is true about
feature normalization?
A) 1
B) 1 and 2
C) 1 and 3
D) 2 and 3
Solution: B
Suppose you are dealing with 4 class classification problem and you want to train a SVM model on the
data for that you are using One-vs-all method. Now answer the below questions?
27. How many times we need to train our SVM model in such case?
A) 1
B) 2
C) 3
D) 4
Solution: D
For a 4 class problem, you would have to train the SVM at least 4 times if you are using a one-vs-all
method.
28. Suppose you have same distribution of classes in the data. Now, say for training 1 time in one vs all
setting the SVM is taking 10 second. How many seconds would it require to train one-vs-all method end
to end?
A) 20
B) 40
C) 60
D) 80
Solution: B
29 Suppose your problem has changed now. Now, data has only 2 classes. What would you think how
many times we need to train SVM in such case?
A) 1
B) 2
C) 3
D) 4
Solution: A
Training the SVM only one time would give you appropriate results
Suppose you are using SVM with linear kernel of polynomial degree 2, Now think that you have applied
this on data and found that it perfectly fit the data that means, Training and testing accuracy is 100%.
30. Now, think that you increase the complexity (or degree of polynomial of this kernel). What would
you think will happen?
Solution: A
Increasing the complexity of the data would make the algorithm overfit the data.
31. In the previous question after increasing the complexity you found that training accuracy was still
100%. According to you what is the reason behind that?
1. Since data is fixed and we are fitting more polynomial term or parameters so the algorithm starts
memorizing everything in the data
2. Since data is fixed and SVM doesn’t need to search in big hypothesis space
A) 1
B) 2
C) 1 and 2
D) None of these
Solution: C
A) 1
B) 2
C) 1 and 2
D) None of these
Solution: C
UNIT V
1. Which of the following is a widely used and effective machine learning algorithm based on the
idea of bagging?
a) Decision Tree
b) Regression
c) Classification
d) Random Forest
Ans D
a) Factor analysis
b) Decision trees are robust to outliers
c) Decision trees are prone to be overfit
d) None of the above
Ans C
a. True
b. False
Decision trees can also be used to for clusters in the data but clustering often generates natural
clusters and is not dependent on any objective function.
Regression
Classification
Clustering
Reinforcement Learning
Options:
a. 1 Only
b. 1 and 2
c. 1 and 3
d. 1, 2 and 4
Ans D
6 Which of the following is the most appropriate strategy for data cleaning before performing
clustering analysis, given less than desirable number of data points:
Removal of outliers
Options:
a. 1 only
b. 2 only
c. 1 and 2
d. None of the above
Ans A
A) 1
B) 2
C) 1 and 2
D) None of these
Ans Solution: C
Both options are true. In Bagging, each individual trees are independent of each other because they
consider different subset of features and samples.
A) 1
B) 2
C) 1 and 2
D) None of these
Ans Solution: B
In boosting tree individual weak learners are not independent of each other because each tree correct
the results of previous tree. Bagging and boosting both can be consider as improving the base learners
results.
9. In Random forest you can generate hundreds of trees (say T1, T2 …..Tn) and then aggregate
the results of these tree. Which of the following is true about individual (Tk) tree in Random Forest?
1. Individual tree is built on a subset of the features
A) 1 and 3
B) 1 and 4
C) 2 and 3
D) 2 and 4
Ans Solution: A
Random forest is based on bagging concept, that consider faction of sample and faction of feature for
building the individual trees.
10. Suppose you are using a bagging based algorithm say a RandomForest in model building.
Which of the following can be true?
A) 1
B) 2
C) 1 and 2
D) None of these
Ans Solution: A
Since Random Forest aggregate the result of different weak learners, If It is possible we would want
more number of trees in model building. Random Forest is a black box model you will lose
interpretability after using it.
11. Which of the following is/are true about Random Forest and Gradient Boosting ensemble
methods?
2. Random Forest is use for classification whereas Gradient Boosting is use for regression task
3. Random Forest is use for regression whereas Gradient Boosting is use for Classification task
Solution: E
12. In Random forest you can generate hundreds of trees (say T1, T2 …..Tn) and then aggregate the
results of these tree. Which of the following is true about individual(Tk) tree in Random Forest?
A) 1 and 3
B) 1 and 4
C) 2 and 3
D) 2 and 4
Solution: A
Random forest is based on bagging concept, that consider faction of sample and faction of feature for
building the individual trees.
13. Which of the following algorithm doesn’t uses learning Rate as of one of its hyperparameter?
1. Gradient Boosting
2. Extra Trees
3. AdaBoost
4. Random Forest
A) 1 and 3
B) 1 and 4
C) 2 and 3
D) 2 and 4
Solution: D
Random Forest and Extra Trees don’t have learning rate as a hyperparameter.
14. Which of the following algorithm are not an example of ensemble learning algorithm?
A) Random Forest
B) Adaboost
C) Extra Trees
D) Gradient Boosting
E) Decision Trees
Solution: E
Decision trees doesn’t aggregate the results of multiple trees so it is not an ensemble algorithm.
15. Suppose you are using a bagging based algorithm say a RandomForest in model building. Which of
the following can be true?
A) 1
B) 2
C) 1 and 2
D) None of these
Solution: A
Since Random Forest aggregate the result of different weak learners, If It is possible we would want
more number of trees in model building. Random Forest is a black box model you will lose
interpretability after using it.
16. True-False: The bagging is suitable for high variance low bias models?
A) TRUE
B) FALSE
Solution: A
The bagging is suitable for high variance low bias models or you can say for complex models.
17. To apply bagging to regression trees which of the following is/are true in such case?
Solution: D
Solution: B
We always consider the validation results to compare with the test result.
19. In which of the following scenario a gain ratio is preferred over Information Gain?
Solution: A
When high cardinality problems, gain ratio is preferred over Information Gain technique.
20. Suppose you have given the following scenario for training and validation error for Gradient
Boosting. Which of the following hyper parameter would you choose in such case?
1 2 100 110
2 4 90 105
3 6 50 100
4 8 45 105
5 10 30 150
A) 1
B) 2
C) 3
D) 4
Solution: B
Scenario 2 and 4 has same validation accuracies but we would select 2 because depth is lower is better
hyper parameter.
21. Which of the following is/are not true about DBSCAN clustering algorithm:
1. For data points to be in a cluster, they must be in a distance threshold to a core point
5. It is robust to outliers
Options:
A. 1 only
B. 2 only
C. 4 only
D. 2 and 3
Solution: D
DBSCAN can form a cluster of any arbitrary shape and does not have strong assumptions for the
distribution of data points in the data space.
Answer: d
Explanation: K-means clustering follows partitioning approach.
Answer: c
Explanation: k-nearest neighbour has nothing to do with k-means.
Answer: a
Explanation: K-means requires a number of clusters.
Answer: a
Explanation: K-means clustering produces the final estimate of cluster centroids.
27.
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
S.r No Question a b c d Correct Image
Answer
Write down question Option a Option b Option c Option d a/b/c/d img.jpg
1 In reinforcement learning if feedback is Penalty Overlearning Reward None of above A
negative one it is defined as____.
2 According to____ , it�s a key success Claude Shannon's theory Gini Index Darwin�s theory None of above C
factor for the survival and evolution of all
species.
3 How can you avoid overfitting ? By using a lot of data By using inductive machine By using validation only None of above A
learning
4 What are the popular algorithms of Decision Trees and Neural Probabilistic networks and Support vector machines All D
Machine Learning? Networks (back Nearest Neighbor
propagation)
5 What is �Training set�? Training set is used to test A set of data is used to Both A & B None of above B
the accuracy of the discover the potentially
hypotheses generated by the predictive relationship.
learner.
6 Common deep learning applications Image classification, Autonomous car driving, All above D
include____ Real-time visual tracking Logistic optimization Bioinformatics,
Speech recognition
7 what is the function of �Supervised Classifications, Predict time Speech recognition, Both A & B None of above C
Learning�? series, Annotate strings Regression
8 Commons unsupervised applications Object segmentation Similarity detection Automatic labeling All above D
include
9 Reinforcement learning is particularly the environment is not it's often very dynamic it's impossible to have a All above D
efficient when______________. completely deterministic precise error measure
10 if there is only a discrete number of Regression Classification. Modelfree Categories B
possible outcomes (called categories),
the process becomes a______.
11 Which of the following are supervised Spam detection, Image classification, Autonomous car driving, A
learning applications Pattern detection, Real-time visual tracking Logistic optimization Bioinformatics,
Natural Language Speech recognition
Processing
12 During the last few years, many ______ Logical Classical Classification None of above D
algorithms have been applied to deep
neural networks to learn the best policy
for playing Atari video games and to teach
an agent how to associate the right action
with an input representing the state.
13 Which of the following sentence is Machine learning relates Data mining can be defined Both A & B None of the above C
correct? with the study, design and as the process in which the
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
development of the unstructured data tries to
algorithms that give extract knowledge or
computers the capability to unknown interesting
learn without being explicitly patterns.
programmed.
14 What is �Overfitting� in Machine when a statistical model Robots are programed so While involving the process a set of data is used to A
learning? describes random error or that they can perform the of learning �overfitting� discover the potentially
noise instead of underlying task based on data they occurs. predictive relationship
relationship �overfitting� gather from sensors.
occurs.
15 What is �Test set�? Test set is used to test the It is a set of data is used to Both A & B None of above A
accuracy of the hypotheses discover the potentially
generated by the learner. predictive relationship.
16 ________is much more difficult because it's Removing the whole line Creating sub-model to Using an automatic All above B
necessary to determine a supervised predict those features strategy to input them
strategy to train a model for each feature according to the other
and, finally, to predict their value known values
17 How it's possible to use a different regression classification random_state missing_values D
placeholder through the
parameter_______.
18 If you need a more powerful scaling RobustScaler DictVectorizer LabelBinarizer FeatureHasher A
feature, with a superior control on outliers
and the possibility to select a quantile
range, there's also the class________.
19 scikit-learn also provides a class for per- max, l0 and l1 norms max, l1 and l2 norms max, l2 and l3 norms max, l3 and l4 norms B
sample normalization, Normalizer. It can
apply________to each element of a dataset
20 There are also many univariate methods F-tests and p-values chi-square ANOVA All above A
that can be used in order to select the
best features according to specific criteria
based on________.
21 Which of the following selects only a SelectPercentile FeatureHasher SelectKBest All above A
subset of features belonging to a certain
percentile
22 ________performs a PCA with non-linearly SparsePCA KernelPCA SVD None of the Mentioned B
separable data sets.
23 A feature F1 can take certain value: A, B, Feature F1 is an example of Feature F1 is an example of It doesn�t belong to any Both of these B
C, D, E, & F and represents grade of nominal variable. ordinal variable. of the above category.
students from a college.
Which of the following statement is true in
following case?
24 What would you do in PCA to get the Transform data to zero mean Transform data to zero Not possible None of these A
same projection as SVD? median
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
25 What is PCA, KPCA and ICA used for? Principal Components Kernel based Principal Independent Component All above D
Analysis Component Analysis Analysis
26 Can a model trained for item based YES NO A
similarity also choose from a given set of
items?
27 What are common feature selection correlation coefficient Greedy algorithms All above None of these C
methods in regression task?
28 The parameter______ allows specifying test_size training_size All above None of these C
the percentage of elements to put into the
test/training set
29 In many classification problems, the random_state dataset test_size All above B
target ______ is made up of categorical
labels which cannot immediately be
processed by any algorithm.
30 _______adopts a dictionary-oriented LabelEncoder class LabelBinarizer class DictVectorizer FeatureHasher A
approach, associating to each category
label a progressive integer number.
31 If Linear regression model perfectly first a) Test error is also always b) Test error is non zero c) Couldn�t comment on d) Test error is equal to Train c
i.e., train error is zero, then zero Test error error
_____________________
32 Which of the following metrics can be a) ii and iv b) i and ii c) ii, iii and iv d) i, ii, iii and iv d
used for evaluating regression models?i)
R Squaredii) Adjusted R Squarediii) F
Statisticsiv) RMSE / MSE / MAE
33 How many coefficients do you need to a) 1 b) 2 c) 3 d) 4 b
estimate in a simple linear regression
model (One independent variable)?
34 In a simple linear regression model (One a) by 1 b) no change c) by intercept d) by its slope d
independent variable), If we change the
input variable by 1 unit. How much output
variable will change?
35 �Function used for linear regression in R a) lm(formula, data) b) lr(formula, data) c) lrm(formula, data) d) regression.linear(formula, a
is __________ data)
36 In syntax of linear model a) Matrix b) Vector c) Array d) List b
lm(formula,data,..), data refers to ______
37 In the mathematical Equation of Linear a) (X-intercept, Slope) b) (Slope, X-Intercept) c) (Y-Intercept, Slope) d) (slope, Y-Intercept) c
Regression Y?=??1 + ?2X + ?, (?1, ?2)
refers to __________
38 Linear Regression is a supervised A) TRUE B) FALSE a
machine learning algorithm.
39 It is possible to design a Linear regression A) TRUE B) FALSE a
algorithm using a neural network?
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
40 Which of the following methods do we A)�Least Square Error B)�Maximum Likelihood C) Logarithmic Loss D) Both A and B a
use to find the best fit line for data in
Linear Regression?
41 Which of the following evaluation metrics A)�AUC-ROC B)�Accuracy C)�Logloss D)�Mean-Squared-Error d
can be used to evaluate a model while
modeling a continuous output variable?
42 Which of the following is true about A) Lower is better B)�Higher is better C)�A or B depend on the D)�None of these a
Residuals ? situation
43 Overfitting is more likely when you have A) TRUE B) FALSE b
huge amount of data to train?
44 Which of the following statement is true A)�Linear regression is B)�Linear regression is C)�Can�t say D)�None of these a
about outliers in Linear regression? sensitive to outliers not sensitive to outliers
45 Suppose you plotted a scatter plot A)�Since the there is a B)�Since the there is a C)�Can�t say D)�None of these a
between the residuals and predicted relationship means our relationship means our
values in linear regression and you found model is not good model is good
that there is a relationship between them.
Which of the following conclusion do you
make about this situation?
46 Naive Bayes classifiers are a collection Classification Clustering Regression All a
------------------of algorithms�
47 Naive Bayes classifiers is _______________ Supervised Unsupervised Both None a
Learning
48 Features being classified is independent False TRUE b
of each other in Na�ve Bayes Classifier
49 Features being classified is __________ of Independent Dependent Partial Dependent None a
each other in Na�ve Bayes Classifier
50 Bayes Theorem is given by where 1. P(H) True FALSE a bayes.jpg
is the probability of hypothesis H being
true.
2. P(E) is the probability of the
evidence(regardless of the hypothesis).
3. P(E|H) is the probability of the evidence
given that hypothesis is true.
4. P(H|E) is the probability of the
hypothesis given that the evidence is
there.
51 In given image, P(H|E) Posterior Prior a bayes.jpg
is__________probability.
52 In given image, P(H) Posterior Prior b bayes.jpg
is__________probability.
53 Conditional probability is a measure of the True FALSE a
probability of an event given that another
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
event has already occurred.
54 Bayes� theorem describes the True FALSE a
probability of an event, based on prior
knowledge of conditions that might be
related to the event.
55 Bernoulli Na�ve Bayes Classifier is Continuous Discrete Binary c
___________distribution
56 Multinomial Na�ve Bayes Classifier is Continuous Discrete Binary b
___________distribution
57 Gaussian Na�ve Bayes Classifier is Continuous Discrete Binary a
___________distribution
58 Binarize parameter in BernoulliNB scikit True FALSE a
sets threshold for binarizing of sample
features.
59 Gaussian distribution when plotted, gives Mean Variance Discrete Random a
a bell shaped curve which is symmetric
about the _______ of the feature values.
60 SVMs directly give us the posterior True FALSE b
probabilities P(y = 1jx) and P(y = ??1jx)
61 Any linear combination of the True FALSE a
components of a multivariate Gaussian is
a univariate Gaussian.
62 Solving a non linear separation problem True FALSE a
with a hard margin Kernelized SVM
(Gaussian RBF Kernel) might lead to
overfitting
63 SVM is a ------------------ algorithm� Classification Clustering Regression All a
64 SVM is a ------------------ learning Supervised Unsupervised Both None a
65 The linear�SVM�classifier works by True FALSE a
drawing a straight line between two
classes
66 Which of the following function provides cl_forecastB cl_nowcastC cl_precastD None of the Mentioned D --
unsupervised prediction ?
67 Which of the following is characteristic of fast accuracy scalable All above D --
best machine learning method ?
68 What are the different Algorithm Supervised Learning and Unsupervised Learning and Both A & B None of the Mentioned C --
techniques in Machine Learning? Semi-supervised Learning Transduction
69 What is the standard approach to split the set of example into group the set of example a set of observed learns programs from data A --
supervised learning? the training set and the test into the training set and the instances tries to induce a
test general rule
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
70 Which of the following is not Machine Artificial Intelligence Rule based inference Both A & B None of the Mentioned B --
Learning?
71 What is Model Selection in Machine The process of selecting when a statistical model Find interesting directions All above A --
Learning? models among different describes random error or in data and find novel
mathematical models, which noise instead of underlying observations/ database
are used to describe the relationship cleaning
same data set
72 Which are two techniques of Machine Genetic Programming and Speech recognition and Both A & B None of the Mentioned A --
Learning ? Inductive Learning Regression
73 Even if there are no actual supervisors Supervised Reinforcement Unsupervised None of the above B --
________ learning is also based on
feedback provided by the environment
74 What does learning exactly mean? Robots are programed so A set of data is used to Learning is the ability to It is a set of data is used to C --
that they can perform the discover the potentially change according to discover the potentially
task based on data they predictive relationship. external stimuli and predictive relationship.
gather from sensors. remembering most of all
previous experiences.
75 When it is necessary to allow the model to Overfitting Overlearning Classification Regression A --
develop a generalization ability and avoid
a common problem called______.
76 Techniques involve the usage of both Supervised Semi-supervised Unsupervised None of the above B --
labeled and unlabeled data is called___.
77 In reinforcement learning if feedback is Penalty Overlearning Reward None of above A --
negative one it is defined as____.
78 According to____ , it�s a key success Claude Shannon's theory Gini Index Darwin�s theory None of above C --
factor for the survival and evolution of all
species.
79 A supervised scenario is characterized by Programmer Teacher Author Farmer B --
the concept of a _____.
80 overlearning causes due to an excessive Capacity Regression Reinforcement Accuracy A --
______.
81 Which of the following is an example of a PCA K-Means None of the above A --
deterministic algorithm?
82 Which of the following model model MCV MARS MCRS All above B --
include a backwards elimination feature
selection routine?
83 Can we extract knowledge without apply YES NO A --
feature selection
84 While using feature selection on the data, NO YES B --
is the number of features decreases.
85 Which of the following are several models regression classification None of the above C --
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
for feature extraction
86 _____ provides some built-in datasets that scikit-learn classification regression None of the above A --
can be used for testing purposes.
87 While using _____ all labels are LabelEncoder class LabelBinarizer class DictVectorizer FeatureHasher A --
turned into sequential numbers.
88 _______produce sparse matrices of real DictVectorizer FeatureHasher Both A & B None of the Mentioned C --
numbers that can be fed into any machine
learning model.
89 scikit-learn offers the class______, which is LabelEncoder LabelBinarizer DictVectorizer Imputer D --
responsible for filling the holes using a
strategy based on the mean, median, or
frequency
90 Which of the following scale data by MinMaxScaler MaxAbsScaler Both A & B None of the Mentioned C --
removing elements that don't belong to a
given range or by considering a maximum
absolute value.
91 scikit-learn also provides a class for per- Normalizer Imputer Classifier All above A --
sample normalization,_____
92 ______dataset with many features normalized unnormalized Both A & B None of the Mentioned B --
contains information proportional to the
independence of all features and their
variance.
93 In order to assess how much information Concuttent matrix Convergance matrix Supportive matrix Covariance matrix D --
is brought by each component, and the
correlation among them, a useful tool is
the_____.
94 The_____ parameter can assume different run start stop C --
values which determine how the data init
matrix is initially processed.
95 ______allows exploiting the natural SparsePCA KernelPCA SVD init parameter A --
sparsity of data while extracting principal
components.
96 Which of the following evaluation metrics AUC-ROC Accuracy Logloss Mean-Squared-Error D --
can be used to evaluate a model while
modeling a continuous output variable?
97 Which of the following is true about Lower is better Higher is better A or B depend on the None of these A --
Residuals ? situation
98 Overfitting is more likely when you have TRUE FALSE B --
huge amount of data to train?
99 Which of the following statement is true Linear regression is sensitive Linear regression is not Can�t say None of these A --
about outliers in Linear regression? to outliers sensitive to outliers
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
100 Suppose you plotted a scatter plot Since the there is a Since the there is a Can�t say None of these A --
between the residuals and predicted relationship means our relationship means our
values in linear regression and you found model is not good model is good
that there is a relationship between them.
Which of the following conclusion do you
make about this situation?
101 Let�s say, a �Linear regression� model You will always have test You can not have test error None of the above C --
perfectly fits the training data (train error error zero zero
is zero). Now, Which of the following
statement is true?
102 In a linear regression problem, we are If R Squared increases, this If R Squared decreases, this Individually R squared None of these. C --
using �R-squared� to measure variable is significant. variable is not significant. cannot tell about variable
goodness-of-fit. We add a feature in linear importance. We can�t say
regression model and retrain the same anything about it right now.
model.Which of the following option is
true?
103 Which of the one is true about Linear Regression with Linear Regression with Linear Regression with None of these A --
Heteroskedasticity? varying error terms constant error terms zero error terms
104 Which of the following assumptions do 1,2 and 3. 1,3 and 4. 1 and 3. All of above. D --
we make while deriving linear regression
parameters?1. The true relationship
between dependent y and predictor x is
linear2. The model errors are statistically
independent3. The errors are normally
distributed with a 0 mean and constant
standard deviation4. The predictor x is
non-stochastic and is measured error-free
105 To test linear relationship of y(dependent) Scatter plot Barchart Histograms None of these A --
and x(independent) continuous variables,
which of the following plot best suited?
106 which of the following step / assumption The polynomial degree Whether we learn the The use of a constant-term A --
in regression modeling impacts the trade- weights by matrix inversion
off between under-fitting and over-fitting or gradient descent
the most.
107 Can we calculate the skewness of TRUE FALSE B --
variables based on mean and median?
108 Which of the following is true about Ridge regression uses Lasso regression uses Both use subset selection None of above B --
�Ridge� or �Lasso� regression subset selection of features subset selection of features of features
methods in case of feature selection?
109 Which of the following statement(s) can 1 and 2 1 and 3 2 and 4 None of the above A --
be true post adding a variable in a linear
regression model?1. R-Squared and
Adjusted R-squared both increase2. R-
Squared increases and Adjusted R-
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
squared decreases3. R-Squared
decreases and Adjusted R-squared
decreases4. R-Squared decreases and
Adjusted R-squared increases
110 How many coefficients do you need to 1 2 Can�t Say B --
estimate in a simple linear regression
model (One independent variable)?
111 In given image, P(H) Posterior Prior B bayes.jpg
is__________probability.
112 Conditional probability is a measure of the True FALSE A --
probability of an event given that another
event has already occurred.
113 Gaussian distribution when plotted, gives Mean Variance Discrete Random A --
a bell shaped curve which is symmetric
about the _______ of the feature values.
114 SVMs directly give us the posterior True FALSE B --
probabilities P(y = 1jx) and P(y = ??1jx)
115 SVM is a ------------------ algorithm� Classification Clustering Regression All A --
116 What is/are true about kernel in SVM?1. 1 2 1 and 2 None of these C --
Kernel function map low dimensional data
to high dimensional space2. It�s a
similarity function
117 Suppose you are building a SVM model on Misclassification would Data will be correctly Can�t say None of these A --
data X. The data X can be error prone happen classified
which means that you should not trust
any specific data point too much. Now
think that you want to build a SVM model
which has quadratic kernel function of
polynomial degree 2 that uses Slack
variable C as one of it�s hyper
parameter.What would happen when you
use very small C (C~0)?
118 The cost parameter in the SVM means: The number of cross- The kernel to be used The tradeoff between None of the above C --
validations to be made misclassification and
simplicity of the model
119 Bayes� theorem describes the True FALSE A --
probability of an event, based on prior
knowledge of conditions that might be
related to the event.
120 Bernoulli Na�ve Bayes Classifier is Continuous Discrete Binary C --
___________distribution
121 If you remove the non-red circled points TRUE FALSE B svm.jpg
from the data, the decision boundary will
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
change?
122 How do you handle missing or corrupted a. Drop missing rows or b. Replace missing values c. Assign a unique d. All of the above� D --
data in a dataset? columns with mean/median/mode category to missing values
123 Binarize parameter in BernoulliNB scikit True FALSE A --
sets threshold for binarizing of sample
features.
124 Which of the following statements about A.�����Attributes are B.�����Attributes are C.�����Attributes are D.�����Attributes can B --
Naive Bayes is incorrect? equally important. statistically dependent of statistically independent of be nominal or numeric
one another given the class one another given the
value. class value.
125 The SVM�s are less effective when: The data is linearly separable The data is clean and ready The data is noisy and C --
to use contains overlapping
points
126 Naive Bayes classifiers is _______________ Supervised Unsupervised Both None A --
Learning
127 Features being classified is independent False TRUE B --
of each other in Na�ve Bayes Classifier
128 Features being classified is __________ of Independent Dependent Partial Dependent None A --
each other in Na�ve Bayes Classifier
129 Bayes Theorem is given by where 1. P(H) True FALSE A bayes.jpg
is the probability of hypothesis H being
true.
2. P(E) is the probability of the
evidence(regardless of the hypothesis).
3. P(E|H) is the probability of the evidence
given that hypothesis is true.
4. P(H|E) is the probability of the
hypothesis given that the evidence is
there.
130 Any linear combination of the True FALSE A --
components of a multivariate Gaussian is
a univariate Gaussian.
This sheet
is for 2
Mark
questions
S.r No Question a b c d Correct Image
Answer
e.g 1 Write down question Option a Option b Option c Option d a/b/c/d img.jpg
1 A supervised scenario is characterized by Programmer Teacher Author Farmer B
the concept of a _____.
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
2 overlearning causes due to an excessive Capacity Regression Reinforcement Accuracy A
______.
3 If there is only a discrete number of Modelfree Categories Prediction None of above B
possible outcomes called _____.
4 What is the standard approach to split the set of example into group the set of example a set of observed learns programs from data A
supervised learning? the training set and the test into the training set and the instances tries to induce a
test general rule
5 Some people are using the term ___ Inference Interference Accuracy None of above A
instead of prediction only to avoid the
weird idea that machine learning is a sort
of modern magic.
6 The term _____ can be freely used, but Accuracy Cluster Regression Prediction D
with the same meaning adopted in
physics or system theory.
7 Which are two techniques of Machine Genetic Programming and Speech recognition and Both A & B None of the Mentioned A
Learning ? Inductive Learning Regression
8 Even if there are no actual supervisors Supervised Reinforcement Unsupervised None of the above B
________ learning is also based on
feedback provided by the environment
9 Common deep learning applications / Real-time visual object Classic approaches Automatic labeling Bio-inspired adaptive B
problems can also be solved using____ identification systems
10 Identify the various approaches for Concept Vs Classification Symbolic Vs Statistical Inductive Vs Analytical All above D
machine learning. Learning Learning Learning
11 what is the function of �Unsupervised Find clusters of the data and Find interesting directions Interesting coordinates All D
Learning�? find low-dimensional in data and find novel and correlations
representations of the data observations/ database
cleaning
12 What are the two methods used for the Platt Calibration and Isotonic Statistics and A
calibration in Supervised Learning? Regression Informal Retrieval
13 What is the standard approach to split the set of example into group the set of example a set of observed learns programs from data A
supervised learning? the training set and the test into the training set and the instances tries to induce a
test general rule
14 Which of the following is not Machine Artificial Intelligence Rule based inference Both A & B None of the Mentioned B
Learning?
15 What is Model Selection in Machine The process of selecting when a statistical model Find interesting directions All above A
Learning? models among different describes random error or in data and find novel
mathematical models, which noise instead of underlying observations/ database
are used to describe the relationship cleaning
same data set
16 _____ provides some built-in datasets that scikit-learn classification regression None of the above A
can be used for testing purposes.
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
17 While using _____ all labels are LabelEncoder class LabelBinarizer class DictVectorizer FeatureHasher A
turned into sequential numbers.
18 _______produce sparse matrices of real DictVectorizer FeatureHasher Both A & B None of the Mentioned C
numbers that can be fed into any machine
learning model.
19 scikit-learn offers the class______, which is LabelEncoder LabelBinarizer DictVectorizer Imputer D
responsible for filling the holes using a
strategy based on the mean, median, or
frequency
20 Which of the following scale data by MinMaxScaler MaxAbsScaler Both A & B None of the Mentioned C
removing elements that don't belong to a
given range or by considering a maximum
absolute value.
21 Which of the following model model MCV MARS MCRS All above B
include a backwards elimination feature
selection routine?
22 Can we extract knowledge without apply YES NO A
feature selection
23 While using feature selection on the data, NO YES B
is the number of features decreases.
24 Which of the following are several models regression classification None of the above C
for feature extraction
25 scikit-learn also provides a class for per- Normalizer Imputer Classifier All above A
sample normalization,_____
26 ______dataset with many features normalized unnormalized Both A & B None of the Mentioned B
contains information proportional to the
independence of all features and their
variance.
27 In order to assess how much information Concuttent matrix Convergance matrix Supportive matrix Covariance matrix D
is brought by each component, and the
correlation among them, a useful tool is
the_____.
28 The_____ parameter can assume different run start stop C
values which determine how the data init
matrix is initially processed.
29 ______allows exploiting the natural SparsePCA KernelPCA SVD init parameter A
sparsity of data while extracting principal
components.
30 Which of the following is an example of a PCA K-Means None of the above A
deterministic algorithm?
31 Let�s say, a �Linear regression� model A. You will always have test B. You can not have test C. None of the above c
perfectly fits the training data (train error error zero error zero
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
is zero). Now, Which of the following
statement is true?
32 In a linear regression problem, we are A. If R Squared increases, B. If R Squared decreases, C. Individually R squared D. None of these. c
using �R-squared� to measure this variable is significant. this variable is not cannot tell about variable
goodness-of-fit. We add a feature in linear significant. importance. We can�t say
regression model and retrain the same anything about it right now.
model.Which of the following option is
true?
33 Which of the one is true about A. Linear Regression with B. Linear Regression with C. Linear Regression with D. None of these a
Heteroskedasticity? varying error terms constant error terms zero error terms
34 Which of the following assumptions do A. 1,2 and 3. B. 1,3 and 4. C. 1 and 3. D. All of above. d
we make while deriving linear regression
parameters?1. The true relationship
between dependent y and predictor x is
linear2. The model errors are statistically
independent3. The errors are normally
distributed with a 0 mean and constant
standard deviation4. The predictor x is
non-stochastic and is measured error-free
35 To test linear relationship of y(dependent) A. Scatter plot B. Barchart C. Histograms D. None of these a
and x(independent) continuous variables,
which of the following plot best suited?
36 Generally, which of the following A. 1 and 2 B. only 1 C. only 2 D. None of these. b
method(s) is used for predicting
continuous dependent variable?1. Linear
Regression2. Logistic Regression
37 Suppose you are training a linear A. Both are False B. 1 is False and 2 is True C. 1 is True and 2 is False D. Both are True c
regression model. Now consider these
points.1. Overfitting is more likely if we
have less data2. Overfitting is more likely
when the hypothesis space is small.Which
of the above statement(s) are correct?
38 Suppose we fit �Lasso Regression� to a A. It is more likely for X1 to B. It is more likely for X1 to C. Can�t say D. None of these b
data set, which has 100 features be excluded from the model be included in the model
(X1,X2�X100).� Now, we rescale one of
these feature by multiplying with 10 (say
that feature is X1),� and then refit Lasso
regression with the same regularization
parameter.Now, which of the following
option will be correct?
39 Which of the following is true about A. Ridge regression uses B. Lasso regression uses C. Both use subset D. None of above b
�Ridge� or �Lasso� regression subset selection of features subset selection of features selection of features
methods in case of feature selection?
40 Which of the following statement(s) can A. 1 and 2 B. 1 and 3 C. 2 and 4 D. None of the above a
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
be true post adding a variable in a linear
regression model?1. R-Squared and
Adjusted R-squared both increase2. R-
Squared increases and Adjusted R-
squared decreases3. R-Squared
decreases and Adjusted R-squared
decreases4. R-Squared decreases and
Adjusted R-squared increases
41 We can also compute the coefficient of A. 1 and 2 B. 1 and 3. C. 2 and 3. D. 1,2 and 3. d
linear regression with the help of an
analytical method called �Normal
Equation�. Which of the following is/are
true about �Normal Equation�?1. We
don�t have to choose the learning rate2.
It becomes slow when number of features
is very large3. No need to iterate
42 How many coefficients do you need to A. 1 B. 2 C. Can�t Say b
estimate in a simple linear regression
model (One independent variable)?
43 �If two variables are correlated, is it A. Yes B. No b
necessary that they have a linear
relationship?
44 Correlated variables can have zero A. True B. False a
correlation coeffficient. True or False?
45 Which of the following option is true A. The relationship is B. The relationship is not C. The relationship is not D. The relationship is d
regarding �Regression� and symmetric between x and y symmetric between x and y symmetric between x and symmetric between x and y
�Correlation� ?Note: y is dependent in both. in both. y in case of correlation but in case of correlation but in
variable and x is independent variable. in case of regression it is case of regression it is not
symmetric. symmetric.
46 What is/are true about kernel in SVM?1. 1 2 1 and 2 None of these c
Kernel function map low dimensional data
to high dimensional space2. It�s a
similarity function
47 Suppose you are building a SVM model on Misclassification would Data will be correctly Can�t say None of these a
data X. The data X can be error prone happen classified
which means that you should not trust
any specific data point too much. Now
think that you want to build a SVM model
which has quadratic kernel function of
polynomial degree 2 that uses Slack
variable C as one of it�s hyper
parameter.What would happen when you
use very small C (C~0)?
48 Suppose you are using a Linear SVM yes no a svm.jpg
classifier with 2 class classification
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
problem. Now you have been given the
following data in which some points are
circled red that are representing support
vectors.If you remove the following any
one red points from the data. Does the
decision boundary will change?
49 If you remove the non-red circled points TRUE FALSE b svm.jpg
from the data, the decision boundary will
change?
50 When the C parameter is set to infinite, The optimal hyperplane if The soft-margin classifier None of the above a
which of the following holds true? exists, will be the one that will separate the data
completely separates the
data
51 Suppose you are building a SVM model on We can still classify data We can not classify data Can�t Say None of these a
data X. The data X can be error prone correctly for given setting of correctly for given setting
which means that you should not trust hyper parameter C of hyper parameter C
any specific data point too much. Now
think that you want to build a SVM model
which has quadratic kernel function of
polynomial degree 2 that uses Slack
variable C as one of it�s hyper
parameter.What would happen when you
use very large value of C(C->infinity)?
52 SVM can solve�linear�and non- TRUE FALSE a
linear�problems
53 The objective of the support vector TRUE FALSE a
machine algorithm is to find a hyperplane
in an N-dimensional space(N � the
number of features) that distinctly
classifies the data points.
54 Hyperplanes are _____________boundaries usual decision parallel b
that help classify the data points.�
55 The _____of the hyperplane depends upon dimension classification reduction a
the number of features.
56 Hyperplanes are decision boundaries that TRUE FALSE a
help classify the data points.�
57 SVM�algorithms�use�a set of TRUE FALSE a
mathematical functions that are defined
as the�kernel.
58 In SVM, Kernel function is used to map a TRUE FALSE a
lower dimensional data into a higher
dimensional data.
59 In SVR we try to fit the error within a TRUE FALSE a
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
certain threshold.
60 When the C parameter is set to infinite, The optimal hyperplane if The soft-margin classifier None of the above a
which of the following holds true? exists, will be the one that will separate the data
completely separates the
data
61 How do you handle missing or corrupted a. Drop missing rows or b. Replace missing values c. Assign a unique d. All of the above� d
data in a dataset? columns with mean/median/mode category to missing values
62 What is the purpose of performing cross- a. To assess the predictive b. To judge how the trained c. Both A and B� c
validation? performance of the models model performs outside the
sample on test data
63 Which of the following is true about Naive a. Assumes that all the b. Assumes that all the c. Both A and B� d. None of the above option c
Bayes ? features in a dataset are features in a dataset are
equally important independent
64 Which of the following statements about A.�����Attributes are B.�����Attributes are C.�����Attributes are D.�����Attributes can b
Naive Bayes is incorrect? equally important. statistically dependent of statistically independent of be nominal or numeric
one another given the class one another given the
value. class value.
65 Which of the following ��PCA ��Decision Tree ��Naive Bayesian Linerar regression a
is�not�supervised learning?
66 How can you avoid overfitting ? By using a lot of data By using inductive machine By using validation only None of above A --
learning
67 What are the popular algorithms of Decision Trees and Neural Probabilistic networks and Support vector machines All D --
Machine Learning? Networks (back Nearest Neighbor
propagation)
68 What is �Training set�? Training set is used to test A set of data is used to Both A & B None of above B --
the accuracy of the discover the potentially
hypotheses generated by the predictive relationship.
learner.
69 Identify the various approaches for Concept Vs Classification Symbolic Vs Statistical Inductive Vs Analytical All above D --
machine learning. Learning Learning Learning
70 what is the function of �Unsupervised Find clusters of the data and Find interesting directions Interesting coordinates All D --
Learning�? find low-dimensional in data and find novel and correlations
representations of the data observations/ database
cleaning
71 What are the two methods used for the Platt Calibration and Isotonic Statistics and A --
calibration in Supervised Learning? Regression Informal Retrieval
72 ______can be adopted when it's necessary Supervised Semi-supervised Reinforcement Clusters B --
to categorize a large amount of data with
a few complete examples or when there's
the need to impose some constraints to a
clustering algorithm.
73 In reinforcement learning, this feedback is Overfitting Overlearning Reward None of above C --
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
usually called as___.
74 In the last decade, many researchers Deep learning Machine learning Reinforcement learning Unsupervised learning A --
started training bigger and bigger models,
built with several different layers that's
why this approach is called_____.
75 there's a growing interest in pattern Regression Accuracy Modelfree Scalable C --
recognition and associative memories
whose structure and functioning are
similar to what happens in the neocortex.
Such an approach also allows simpler
algorithms called _____
76 ______ showed better performance than Machine learning Deep learning Reinforcement learning Supervised learning B --
other approaches, even without a context-
based model
77 Common deep learning applications / Real-time visual object Classic approaches Automatic labeling Bio-inspired adaptive B --
problems can also be solved using____ identification systems
78 Some people are using the term ___ Inference Interference Accuracy None of above A --
instead of prediction only to avoid the
weird idea that machine learning is a sort
of modern magic.
79 The term _____ can be freely used, but Accuracy Cluster Regression Prediction D --
with the same meaning adopted in
physics or system theory.
80 If there is only a discrete number of Modelfree Categories Prediction None of above B --
possible outcomes called _____.
81 A feature F1 can take certain value: A, B, Feature F1 is an example of Feature F1 is an example of It doesn�t belong to any Both of these B --
C, D, E, & F and represents grade of nominal variable. ordinal variable. of the above category.
students from a college.
Which of the following statement is true in
following case?
82 What would you do in PCA to get the Transform data to zero mean Transform data to zero Not possible None of these A --
same projection as SVD? median
83 What is PCA, KPCA and ICA used for? Principal Components Kernel based Principal Independent Component All above D --
Analysis Component Analysis Analysis
84 Can a model trained for item based YES NO A --
similarity also choose from a given set of
items?
85 What are common feature selection correlation coefficient Greedy algorithms All above None of these C --
methods in regression task?
86 The parameter______ allows specifying test_size training_size All above None of these C --
the percentage of elements to put into the
test/training set
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
87 In many classification problems, the random_state dataset test_size All above B --
target ______ is made up of categorical
labels which cannot immediately be
processed by any algorithm.
88 _______adopts a dictionary-oriented LabelEncoder class LabelBinarizer class DictVectorizer FeatureHasher A --
approach, associating to each category
label a progressive integer number.
89 ________is much more difficult because it's Removing the whole line Creating sub-model to Using an automatic All above B --
necessary to determine a supervised predict those features strategy to input them
strategy to train a model for each feature according to the other
and, finally, to predict their value known values
90 How it's possible to use a different regression classification random_state missing_values D --
placeholder through the
parameter_______.
91 If you need a more powerful scaling RobustScaler DictVectorizer LabelBinarizer FeatureHasher A --
feature, with a superior control on outliers
and the possibility to select a quantile
range, there's also the class________.
92 scikit-learn also provides a class for per- max, l0 and l1 norms max, l1 and l2 norms max, l2 and l3 norms max, l3 and l4 norms B --
sample normalization, Normalizer. It can
apply________to each element of a dataset
93 There are also many univariate methods F-tests and p-values chi-square ANOVA All above A --
that can be used in order to select the
best features according to specific criteria
based on________.
94 Which of the following selects only a SelectPercentile FeatureHasher SelectKBest All above A --
subset of features belonging to a certain
percentile
95 ________performs a PCA with non-linearly SparsePCA KernelPCA SVD None of the Mentioned B --
separable data sets.
96 �If two variables are correlated, is it Yes No B --
necessary that they have a linear
relationship?
97 Correlated variables can have zero TRUE FALSE A --
correlation coeffficient. True or False?
98 Suppose we fit �Lasso Regression� to a It is more likely for X1 to be It is more likely for X1 to be Can�t say None of these B --
data set, which has 100 features excluded from the model included in the model
(X1,X2�X100).� Now, we rescale one of
these feature by multiplying with 10 (say
that feature is X1),� and then refit Lasso
regression with the same regularization
parameter.Now, which of the following
option will be correct?
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
99 If Linear regression model perfectly first Test error is also always zero Test error is non zero Couldn�t comment on Test error is equal to Train C --
i.e., train error is zero, then Test error error
_____________________
100 Which of the following metrics can be ii and iv i and ii ii, iii and iv i, ii, iii and iv D --
used for evaluating regression models?i)
R Squaredii) Adjusted R Squarediii) F
Statisticsiv) RMSE / MSE / MAE
101 In syntax of linear model Matrix Vector Array List B --
lm(formula,data,..), data refers to ______
102 Linear Regression is a supervised TRUE FALSE A --
machine learning algorithm.
103 It is possible to design a Linear regression TRUE FALSE A --
algorithm using a neural network?
104 Which of the following methods do we Least Square Error Maximum Likelihood Logarithmic Loss Both A and B A --
use to find the best fit line for data in
Linear Regression?
105 Suppose you are training a linear Both are False 1 is False and 2 is True 1 is True and 2 is False Both are True C --
regression model. Now consider these
points.1. Overfitting is more likely if we
have less data2. Overfitting is more likely
when the hypothesis space is small.Which
of the above statement(s) are correct?
106 We can also compute the coefficient of 1 and 2 1 and 3. 2 and 3. 1,2 and 3. D --
linear regression with the help of an
analytical method called �Normal
Equation�. Which of the following is/are
true about �Normal Equation�?1. We
don�t have to choose the learning rate2.
It becomes slow when number of features
is very large3. No need to iterate
107 Which of the following option is true The relationship is The relationship is not The relationship is not The relationship is D --
regarding �Regression� and symmetric between x and y symmetric between x and y symmetric between x and symmetric between x and y
�Correlation� ?Note: y is dependent in both. in both. y in case of correlation but in case of correlation but in
variable and x is independent variable. in case of regression it is case of regression it is not
symmetric. symmetric.
108 In a simple linear regression model (One by 1 no change by intercept by its slope D --
independent variable), If we change the
input variable by 1 unit. How much output
variable will change?
109 Generally, which of the following 1 and 2 only 1 only 2 None of these. B --
method(s) is used for predicting
continuous dependent variable?1. Linear
Regression2. Logistic Regression
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
110 How many coefficients do you need to 1 2 3 4 B --
estimate in a simple linear regression
model (One independent variable)?
111 Suppose you are building a SVM model on We can still classify data We can not classify data Can�t Say None of these A --
data X. The data X can be error prone correctly for given setting of correctly for given setting
which means that you should not trust hyper parameter C of hyper parameter C
any specific data point too much. Now
think that you want to build a SVM model
which has quadratic kernel function of
polynomial degree 2 that uses Slack
variable C as one of it�s hyper
parameter.What would happen when you
use very large value of C(C->infinity)?
112 SVM can solve�linear�and non- TRUE FALSE A --
linear�problems
113 The objective of the support vector TRUE FALSE A --
machine algorithm is to find a hyperplane
in an N-dimensional space(N � the
number of features) that distinctly
classifies the data points.
114 Hyperplanes are _____________boundaries usual decision parallel B --
that help classify the data points.�
115 When the C parameter is set to infinite, The optimal hyperplane if The soft-margin classifier None of the above A --
which of the following holds true? exists, will be the one that will separate the data
completely separates the
data
116 SVM is a ------------------ learning Supervised Unsupervised Both None A --
117 The linear�SVM�classifier works by True FALSE A --
drawing a straight line between two
classes
118 In a real problem, you should check to see TRUE FALSE B --
if the SVM is separable and then include
slack variables if it is not separable.
119 Which of the following are real world Text and Hypertext Image Classification Clustering of News All of the above D --
applications of the SVM? Categorization Articles
120 The _____of the hyperplane depends upon dimension classification reduction A --
the number of features.
121 Hyperplanes are decision boundaries that TRUE FALSE A --
help classify the data points.�
122 SVM�algorithms�use�a set of TRUE FALSE A --
mathematical functions that are defined
as the�kernel.
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
123 Naive Bayes classifiers are a collection Classification Clustering Regression All A --
------------------of algorithms�
124 In given image, P(H|E) Posterior Prior A bayes.jpg
is__________probability.
125 Solving a non linear separation problem True FALSE A
with a hard margin Kernelized SVM
(Gaussian RBF Kernel) might lead to
overfitting
126 100 people are at party. Given data gives TRUE FALSE A man.jpg
information about how many wear pink or
not, and if a man or not. Imagine a pink
wearing guest leaves, was it a man?
127 For the given weather data, Calculate 0.4 0.64 0.29 0.75 B weather
probability of playing data.jpg
128 In SVM, Kernel function is used to map a TRUE FALSE A --
lower dimensional data into a higher
dimensional data.
129 In SVR we try to fit the error within a TRUE FALSE A --
certain threshold.
130 When the C parameter is set to infinite, The optimal hyperplane if The soft-margin classifier None of the above A --
which of the following holds true? exists, will be the one that will separate the data
completely separates the
data
This sheet
is for 3
Mark
questions
S.r No Question a b c d Correct Image
Answer
e.g 1 Write down question Option a Option b Option c Option d a/b/c/d img.jpg
1 Which of the following is characteristic of fast accuracy scalable All above D
best machine learning method ?
2 What are the different Algorithm Supervised Learning and Unsupervised Learning and Both A & B None of the Mentioned C
techniques in Machine Learning? Semi-supervised Learning Transduction
3 ______can be adopted when it's necessary Supervised Semi-supervised Reinforcement Clusters B
to categorize a large amount of data with
a few complete examples or when there's
the need to impose some constraints to a
clustering algorithm.
4 In reinforcement learning, this feedback is Overfitting Overlearning Reward None of above C
usually called as___.
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
5 In the last decade, many researchers Deep learning Machine learning Reinforcement learning Unsupervised learning A
started training bigger and bigger models,
built with several different layers that's
why this approach is called_____.
6 What does learning exactly mean? Robots are programed so A set of data is used to Learning is the ability to It is a set of data is used to C
that they can perform the discover the potentially change according to discover the potentially
task based on data they predictive relationship. external stimuli and predictive relationship.
gather from sensors. remembering most of all
previous experiences.
7 When it is necessary to allow the model to Overfitting Overlearning Classification Regression A
develop a generalization ability and avoid
a common problem called______.
8 Techniques involve the usage of both Supervised Semi-supervised Unsupervised None of the above B
labeled and unlabeled data is called___.
9 there's a growing interest in pattern Regression Accuracy Modelfree Scalable C
recognition and associative memories
whose structure and functioning are
similar to what happens in the neocortex.
Such an approach also allows simpler
algorithms called _____
10 ______ showed better performance than Machine learning Deep learning Reinforcement learning Supervised learning B
other approaches, even without a context-
based model
11 Which of the following sentence is Machine learning relates Data mining can be defined Both A & B None of the above C --
correct? with the study, design and as the process in which the
development of the unstructured data tries to
algorithms that give extract knowledge or
computers the capability to unknown interesting
learn without being explicitly patterns.
programmed.
12 What is �Overfitting� in Machine when a statistical model Robots are programed so While involving the process a set of data is used to A --
learning? describes random error or that they can perform the of learning �overfitting� discover the potentially
noise instead of underlying task based on data they occurs. predictive relationship
relationship �overfitting� gather from sensors.
occurs.
13 What is �Test set�? Test set is used to test the It is a set of data is used to Both A & B None of above A --
accuracy of the hypotheses discover the potentially
generated by the learner. predictive relationship.
14 what is the function of �Supervised Classifications, Predict time Speech recognition, Both A & B None of above C --
Learning�? series, Annotate strings Regression
15 Commons unsupervised applications Object segmentation Similarity detection Automatic labeling All above D --
include
16 Reinforcement learning is particularly the environment is not it's often very dynamic it's impossible to have a All above D --
This sheet FIELD2 FIELD3 FIELD4 FIELD5 FIELD6 FIELD7 FIELD8 FIELD9
is for 1
Mark
questions
efficient when______________. completely deterministic precise error measure
17 During the last few years, many ______ Logical Classical Classification None of above D --
algorithms have been applied to deep
neural networks to learn the best policy
for playing Atari video games and to teach
an agent how to associate the right action
with an input representing the state.
18 Common deep learning applications Image classification, Autonomous car driving, All above D --
include____ Real-time visual tracking Logistic optimization Bioinformatics,
Speech recognition
19 if there is only a discrete number of Regression Classification. Modelfree Categories B --
possible outcomes (called categories),
the process becomes a______.
20 Which of the following are supervised Spam detection, Image classification, Autonomous car driving, A --
learning applications Pattern detection, Real-time visual tracking Logistic optimization Bioinformatics,
Natural Language Speech recognition
Processing
21 Let�s say, you are working with All categories of categorical Frequency distribution of Train and Test always have Both A and B D --
categorical feature(s) and you have not variable are not present in categories is different in same distribution.
looked at the distribution of the the test dataset. train as compared to the
categorical variable in the test data. test dataset.
a. Supervised Learning
b. Unsupervised Learning
c. Semi-supervised Learning
d. Reinforcement Learning
4. Supervised Learning algorithms are accompanied by both Input and Expected Output?
a. True- answer
b. False
a. Regression- answer
b. Classification
c. Association Rule mining
d. All of these
8. k-NN algorithm does more computation on ‘test’ time rather than ‘train’ time.
a. True- answer
b. False
a. Manhattan
b. Minkowski
c. Jaccard
d. Mahalanobis
e. All can be used- answer
10. Which of the following machine learning algorithm can be used for imputing missing values
of both categorical and continuous variables?
a. K-NN- answer
b. Linear Regression
c. Logistic Regression
d. Decision Tree
11. Which of the following algorithm isNOT an example of ensemble learning algorithm
a. Random Forest
b. Adaboost
c. Gradient Boosting
d. Decision Trees
a. Semi-supervised learning.
b. Supervised Learning
c. Unsupervised Learning
d. All of these
14. Unsupervised Learning algorithms are accompanied by both Input and Expected Output?
a. True
b. False (Only Input) - answer
a. Clustering- answer
b. Classification
c. Regression
d. Association
a. Centroid-based Clustering
b. Density-based Clustering
c. Hierarchical Clustering
d. All of the above- answer
17. Learning algorithms that use both labelled and unlabelled data can be categorised as
a. Supervised Algorithms
b. Unsupervised Algorithms
c. Semi-supervised Algorithms- answer
d. Reinforcement Learning
a. True- answer
b. False
19. When the number of output classes is greater than one, which is / are the possible strategy
used to handle them
a. One-vs-All
b. One-vs-One
c. Both of them- answer
d. None of the above
20. In One-vs-All strategy how many classifiers are trained for n classes
a. 1
b. n- answer
c. n/2
d. None of the above
21. In One-vs-One strategy how many classifiers are trained for n classes
a. 1
b. n
c. n*(n-1)/2- answer
d. n/2
22. When the model isn't able to capture the dynamicsshown by the same training set, such
situation is called as
a. Underfitting- answer
b. Overfitting
c. Normal fitting
d. Regularization
23. When the model can associate almost perfectly all the known samples to the corresponding
output values, but when an unknown input is presented, the corresponding prediction error
can be very high, such situation is called as
a. Underfitting
b. Overfitting- answer
c. Normal fitting
d. None of these
a. Information Gain
b. Entropy- answer
c. Probability of an event
d. None of the above
a. Logistic Regression
b. Naïve Bayes
c. K-Nearest Neighbors- answer
d. Simple Neural Networks
28. Which of the factors affect the performance of learner system does not include?
a. Adaptive system
b. Non-adaptive system- answer
c. Both
d. None of the above
a. K-Nearest Neighbor
b. Decision Tree
c. K-means- answer
d. Linear Regression
a. Supervised Learning
b. Unsupervised Learning
c. Reinforcement Learning
d. All of the above- answer
32. In which type of Learning, both features and labels are given to an algorithm?
33. In which type of learning, the algorithm maps input variable to output variable?
a. Classification
b. Regression
c. Clustering- answer
d. None of the above
a. Classification- answer
b. Clustering
c. Regression
d. Association
a. Classification
b. Clustering
c. Regression- answer
d. Association
37. In which learning technique, the system discovers patterns from dataset?
a. Supervised Learning
b. Unsupervised Learning- answer
c. Reinforcement Learning
d. None of the above
38. In which type of learning, the problem can be solved without knowing labels?
a. Supervised Learning
b. Unsupervised Learning- answer
c. All of the above
d. None of the above
a. Clustering- answer
b. Association
c. Regression
d. None of the above
a. Clustering
b. Association- answer
c. Regression
d. None of the above
41. From the following, which is best suited to build a game of chess?
a. Supervised Learning
b. Unsupervised Learning
c. Deep Learning- answer
d. None of the above
a. Supervised Learning
b. Unsupervised Learning
c. Reinforcement Learning- answer
d. None of the above
a. Supervised Learning
b. Unsupervised Learning- answer
c. Reinforcement Learning
d. None of the above
44. From the options, which application you should solve by deep learning for the best
performance?
a. Spam filtering
b. Image classification- answer
c. Sales prediction
d. Automatic labelling
45. A neural network model is said to be inspired from the human brain.Which of the following
statement(s) correctly represents a real neuron?
a. Underfitting
b. Overfitting- answer
c. Both
d. Not a and b
a. Underfitting- answer
b. Overfitting
c. Both
d. None of the above
Unit 1: Two marks questions
1. The goal(s) of the supervised learning system is (are) ___________
a. Training a system that must also work with samples never seen before.
b. To allow the model to develop a generalization ability and avoid a common problem
called over fitting
c. Supervisor: to provide the agent with a precise measure of its error
d. All of the above- answer
a. Reinforcement learning
b. Supervised learning- answer
c. Un supervised learning
d. Semi supervised learning
6. Identify the type of Machine learning approach to solve the given problems:
Decision Support System to predict the decision to play Match or not to play
a. Reinforcement learning
b. Supervised learning- answer
c. Un supervised learning
d. Semi supervised learning
7. Identify the type of Machine learning approach to solve the given problems:
Grouping of documents retrieved by Google Search Engine
a. Reinforcement learning
b. Supervised learning
c. Un supervised learning- answer
d. Semi supervised learning
8. Identify the type of Machine learning approach to solve the given problems:
a. Reinforcement learning
b. Supervised learning- answer
c. Unsupervised learning
d. Semi supervised learning
9. Identify the type of Machine learning approach to solve the given problems:
System to predict the suitable treatment
a. Reinforcement learning
b. Supervised learning
c. Un supervised learning
d. Semi supervised learning
10. Identify the type of Machine learning approach to solve the given problems:
System for Driverless Car
a. Reinforcement learning- answer
b. Supervised learning
c. Unsupervised learning
d. Semi supervised learning
1. For creating Training and Test datasets which statements are true?
a. Both datasets must reflect the original distribution
b. The original dataset must be randomly shuffled before the split phase in order to avoid
correlation between consequent elements
c. Both a and b - answer
d. None of the above
4. Scikit-learn class Imputer fills the holes using a strategy based on the:
a. mean
b. median
c. frequency (the most frequent entry)
d. All of the above- answer
10. Consider Q1=31 and Q3=119. The inter quartile range (IQR) will be______
a. 88 - answer
b. -88
c. 150
d. -150
MCQs on unit 2 (One mark question)
1) Which of the following contains train_test_split() function
A) sklearn.feature_extraction
B) sklearn.preprocessing
C) sklearn.model_selection- answer
D) sklearn.decomposition
2) Default value of test_size in train_test_split() when both test_size and train_size are none
A) 0.33
B) 0.25 - answer
C) 0.50
D) 0.20
A) Dictionary-oriented- answer
B) List-oriented
C) Tree-oriented
D) Map-oriented
A) SHA256
B) MD5
C) MurmurHash 3- answer
D) BLAKE3
6) When performing regression or classification, which of the following is the correct way to
preprocess the data?
10) Principal component analysis is a method to select only a subset of features which contain
the largest amount of?
A) Total covariance
B) Total variance - answer
C) Total count
D) Mean
11) In the following loss function which parametercontrols the level of sparsity?
A) xi
B) c - answer
C) D
D) αi
12) Which parameter determines the number of atoms in scikit-learn DictioanryLearning class?
A) alpha
B) n_jobs
C) n_components - answer
D) tol
14) Non negative matrix factorization algorithm optimizes a loss function based on?
A) L1 Norm
B) Frobenius norm - answer
C) linalgnorm
D) matrix norm
15) Which of the following encoding technique is efficient to deal with large number of possible
categories?
A) Effect Encoding
B) Feature Hashing
C) One Hot Encoding
D) Bin counting scheme - answer
16) Which scaling technique scales data without being affected by outliers?
A) Filter Methods
B) Wrapper Methods - answer
C) Embedded Methods
D) Subset Methods
18) From the following which can be applied on dataset with more than one dimension?
A) Mean
B) Standard Deviation
C) Covariance - answer
D) Variance
19) In principal component analysis the sparse loadings can be obtained by imposing which
constraint on regression coefficients:
A) Ridge
B) Lasso - answer
C) Linear
D) Logistic
21) Eigen vector with ____ Eigen value is the principle component of dataset.
A) Lowest
B) Highest - answer
C) Mean
D) Zero
22) Trace is equal to the ___ of the Eigen values.
A) Difference
B) Sum - answer
C) Product
D) Mean
23) In which scaling technique the upper and lower can be specified by user?
A) Robust Scaling
B) Min Max Scaling - answer
C) Standardized Scaling
D) Z-score Scaling
24) Principal component analysis (PCA) can be used with variables of any mathematical types:
quantitative, qualitative, ora mixture of these types.
A) True
B) False - answer
25) Variances and covariances can be computed for variables of any mathematical types:
quantitative, qualitative, or a mixture of these types.
A) True
B) False - answer
Unit- 3: Regression (One mark)
1. A process by which we estimate the value of dependent variable on the basis of one or more
independent variables is called:
a. Correlation
b. Regression - answer
c. Residual
d. Slope
2. All data points falling along a straight line is called:
a. Linear relationship - answer
b. Non linear relationship
c. Residual
d. Scatter diagram
3. A relationship where the flow of the data points is best represented by a curve is called:
a. Linear relationship
b. Nonlinear relationship - answer
c. Linear positive
d. Linear negative
4. The value we would predict for the dependent variable when the independent variables are all
equal to zero is called:
(a) Slope
(b) Sum of residual
(c) Intercept - answer
(d) Difficult to tell
5. The predicted rate of response of the dependent variable to changes in the independent variable is
called:
(a) Slope - answer
(b) Intercept
(c) Error
(d) Regression equation
6. The slope of the regression line of Y on X is also called the:
(a) Correlation coefficient of X on Y
(b) Correlation coefficient of Y on X
(c) Regression coefficient of X on Y
(d) Regression coefficient of Y on X - answer
8. In simple linear regression, the numbers of unknown constants are:
(a) One
(b) Two - answer
(c) Three
(d) Four
9. In simple regression equation, the numbers of variables involved are:
(a) 0
(b) 1
(c) 2 - answer
(d) 3
10. If the value of any regression coefficient is zero, then two variables are:
(a) Qualitative
(b) Correlation
(c) Dependent
(d) Independent- answer
11. In SK-Learn Linear Regression offers two instance variables, __________ and ____________
a) intercept_ and coef_ - answer
b) Intercept and coef
c) Slope and Intercept
d) Slope and Coef
12. _________ regression imposes an additional shrinkage penalty to the ordinary least squares loss
function to limit its squared L2 norm:
a) Lasso
b) LassoCV
c) Ridge - answer
d) ElasticNet
13. _____________ regressor imposes a penalty on the L1 norm of w to determine a potentially
higher number of null coefficients:
a) Lasso - answer
b) RidgeCV
c) Ridge
d) ElasticNet
14. A Regression approach to avoid the problem of outliers is offered by _______________
a) Linear Regression
b) Logistic Regression
c) RANSAC Regressor - answer
d) Polynomial Regressor
16. ________ occurs when our model neither fits the training data nor generalizes on the new data.
a) Over-fitting
b) Under-fitting - answer
c) Best fitting
d) None of the above
17. ________________ is the process of adding information in order to solve an ill-posed problem
or to prevent overfitting
a) Under-fitting
b) Regularization - answer
c) Best fitting
d) None of the above
18. ____________ selects the only some feature while reduces the coefficients of others to zero.
This property is known as feature selection
a) Lasso - answer
b) RidgeCV
c) Ridge
d) ElasticNet
19. ______ combines both Lasso and Ridge Regression into one model with two penalty factors, one
proportional to L1 norm and other proportional to L2 norm.
a) LassoCV
b) RidgeCV
c) ElasticNet - answer
d) None of the above
20. ____________minimizes the cost function by gradually updating the weight values.
a) Linear Regression
b) Logistic Regression
c) RANSAC Regressor
d) Polynomial Regressor - answer
22. The Regression technique that uses sigmoid function is called________________
a) Linear Regression
b) Logistic Regression - answer
c) RANSAC Regressor
d) Polynomial Regressor
23. Confusion Matrix can be used to measure the performance of _______________ model.
a) Linear Regression
b) Logistic Regression - answer
c) RANSAC Regressor
d) Polynomial Regressor
24. The residual is defined as the difference between the:
a) actual value of y and the estimated value of y - answer
b) actual value of x and the estimated value of x
c) actual value of y and the estimated value of x
d) actual value of x and the estimated value of y
25)Which of the following methods do we use to find the best fit line for data in Linear
Regression?
A) Least Square Error
B) Maximum Likelihood
C) Logarithmic Loss
D) Both A and B
Answer:(A)
26)True- False: Overfitting is more likely when you have a huge amount of data to train.
A) TRUE
B) FALSE
Solution: (B)
27) What will happen when you apply very large penalty in the case of Lasso?
A) Some of the coefficients will become zero
B) Some of the coefficients will be approaching to zero but not absolute zero
C) Both A and B depending on the situation
D) None of these
Solution: (A)
28) Generally, which of the following method(s) is used for predicting continuous dependent
variable?
1. Linear Regression 2. Logistic Regression
A) 1 and 2
B)only 1
C)only 2
D)None of these
Solution:(B)
31)Which is L1 regression
A)Lasso
B)Ridge
C)polynomial
D)Isotonic
Answer A
32)Which of the following is true about “Ridge” or “Lasso” regression methods in case of feature
selection?
A) Ridge regression uses subset selection of features
B)Lasso regression uses subset selection of features
C)Both use subset selection of features
D)None of the above
Solution:(B)
35) Which of the following statement is true about outliers in Linear regression?
A) Linear regression is sensitive to outliers
B) Linear regression is not sensitive to outliers
C) Can’t say
D) None of these
Solution: (A)
36) What do you expect will happen with bias and variance as you increase the size of training
data?
A) Bias increases and Variance increases
B) Bias decreases and Variance increases
C) Bias decreases and Variance decreases
D) Bias increases and Variance decreases
Solution: (D)
37)A Pearson correlation between two variables is zero but, still, their values can still be related
to each other.
A) TRUE
B) FALSE
Solution: (A)
38) Which of the following statement(s) is / are true for Gradient Decent (GD) and Stochastic
Gradient Decent (SGD)?
1. In GD and SGD, you update a set of parameters in an iterative manner to minimize the
error function.
2. In SGD, you have to run through all the samples in your training set for a single update of
a parameter in each iteration.
3. In GD, you either use the entire data or a subset of training data to update a parameter in
each iteration.
A) Only 1
B) Only 2
C) Only 3
D) 1 and 2
Solution:(A)
39) When hypothesis tests and confidence limits are to be used, the residuals are assumed
to follow the __________distribution.
A) Formal
B) Mutual
C) Normal
D) Abnormal
Solution:(C)
40)The error due to simplistic assumptions made by the model in fitting the data is called as
A)variance
B)bias
C)MSE
D)none of these
Solution:(B)
43) Least square method calculates the best-fitting line for the observed data by minimizing the sum
of the squares of the _______ deviations.
a) Vertical
b) Horizontal
c) Both of these
d) None of these
Solution:(A)
Unit-3 (Two marks)
1. The regression line yhat = 3 + 2x has been fitted to the data points (4,8), (2,5), and (1,2). The
residual sum of squares will be:
a) 10
b) 15
c) 13
d) 22 - answer
2. Suppose you have trained a logistic regression classifier and it outputs a new example x with a
prediction ho(x) = 0.2. This means
a. Our estimate for P(y=1 | x)
b. Our estimate for P(y=0 | x) - answer
c. Our estimate for P(y=1 | x)
d. Our estimate for P(y=0 | x)
3. A regression analysis between sales (in $1000) and advertising (in $100) resulted in the following
least squares line: yhat = 75 +6x. This implies that if advertising is $800, then the predicted amount
of sales (in dollars) is:
a. $4875 - answer
b. $123,000
c. $487,500
d. $12,300
4. The value for SSE equals zero. This means that the coefficient of determination (r^2) must equal:
a. 0.0.
b. -1.0.
c. 2.3.
d. 1.0 - answer
a) 12.58 - answer
b) 10.58
c) 11.85
d) 10.85
7. For the given results of a recently conducted study on the correlation of the number of hours spent
driving with the risk of developing acute backache. The slope of the line is_______.
a) 4.59 - answer
b) 10.58
c) 5.85
d) 10.85
8. for the given vector of outputs the Mean squared error is ________.
y_true = [3, -0.5, 2, 7]
y_pred = [2.5, 0.0, 2, 8]
a) 0.45
b) 0.375 - answer
c) 0.56
d) None of the above
9)The correct relationship between SST, SSR, and SSE is given by;
a) SSR = SST + SSE
b) SST = SSR + SSE
c) SSE = SSR – SST
d) all of the above
Solution:(B)
10)Stochastic gradient descent performs less computation per update than batch gradient descent.
A)True
B)False
Solution:(A)
11)A parameter that is external to model and whose value cannot be estimated from data is called as
A)Hyperparameter
B)Model Parameter
C)Outlier
D)Regularization constant
Solution:(A)
14)The most widely used metrics and tools to assess a classification model are:
A)Confusion matrix
B)Cost-sensitive accuracy
C)Area under the ROC curve
D)All of the above
Solution:(D)
16) In practice, Line of best fit or regression line is found when _____________
a) Sum of residuals (Σ(Y – h(X))) is minimum
b) Sum of the absolute value of residuals (Σ|Y-h(X)|) is maximum
c) Sum of the square of residuals ( Σ (Y-h(X))2) is minimum
d) Sum of the square of residuals ( Σ (Y-h(X))2) is maximum
Solution:(C)
Unit- 4 : Naïve Bayes and SVM
(one mark)
1. Naive bayes falls under which category-
a. Unsupervised classification learning
b. Supervised classification learning
c. Semi- supervised classification learning
d. Reinforcement learning
Ans - b
2. What machine learning task is the Naive Bayes algorithm used for?
a. dimensionality reduction
b. clustering
c. classification
d. regression
Ans - c
3. Naive Bayes assumption about data is-
a. input is independent, conditional on the output label.
b. input is dependent, conditional on the output label.
c. input is independent, not conditional on the output label.
d. input is dependent, not conditional on the output label.
Ans - a
4. Bayes rule:
a. P(A |B) = P(B|A) .P(B) / P(A)
b. P(A |B) = P(B|A) .P(A) / P(B)
c. P(A |B) = P(B|A) .P(A)
d. P(A |B) = P(B|A) .P(B)
Ans - b
7. Which type of naive bayes classifier is best suited for document classification problem -
a. Bernoulli naive bayes
b. Multinomial naive bayes
c. Gaussian naive bayes
d. Complement Naive bayes
Ans - b
8. Which type of naive bayes classifiers is usually used for yes/no type boolean predictores-
a. Bernoulli naive bayes
b. Multinomial naive bayes
c. Gaussian naive bayes
d. Complement Naive bayes
Ans - a
(Two marks)
1. One marble jar has several different colored marbles inside of it. It has 1 red, 2 green, 4 blue, and
8 yellow marbles. All the marbles are the same size and shape. If Peter takes out a marble from the
jar without looking, what is the probability that he will NOT choose a yellow marble.
a. 7/15
b. 8/15
c. 7/8
d. 5/8
Ans- a
2. If we train a Naive Bayes classifier using infinite training data that satisfies all of its modeling
assumptions , then in general, what can we say about the training error (error in training data) and
test error (error in held-out test data)?
a. It may not achieve either zero training error or zero test error
b. It will always achieve zero training error and zero test error.
c. It will always achieve zero training error but may not achieve zero test error.
d. It may not achieve zero training error but will always achieve zero test error.
Ans - a
12. Which of the following are real world applications of the SVM?
a. Text and Hypertext Categorization
b. Image Classification
c. Clustering of News Articles
d. All of the above
Ans- d
4. Which of the following is a widely used and effective machine learning algorithm based on the
idea of bagging?
a. Decision Tree
b. Regression
c. Classification
d. Random Forest - answer
5. In the given formula of Decision Tree family , what A and D represents?
Gain(A) = Cross_Entropy(D) – EntropyA(D)
a. Attribute, Decision
b. Attribute, Dataset- answer
c. Probability, Dataset
d. None of the above
6. In the given formula of Decision Tree family , which are the given statements are true?
Gain(A) = Cross_Entropy(D) – EntropyA(D)
7. A _________ is a decision support tool that uses a tree-like graph or model of decisions and
their possible consequences, including chance event outcomes, resource costs, and utility.
a. Decision tree- answer
b. Graphs
c. Trees
d. Neural Networks
10. The most widely used metrics and tools to assess a classification model are:
a. Confusion matrix
b. Cost-sensitive accuracy
c. Area under the ROC curve
d. All of the above - answer
11. Which of the following is a good test dataset characteristic?
a. Large enough to yield meaningful results
b. Is representative of the dataset as a whole
c. Both A and B - answer
d. None of the above
12. Which of the following is a disadvantage of decision trees?
a. Factor analysis
b. Decision trees are robust to outliers
c. Decision trees are prone to be overfit - answer
d. None of the above
13. What is the purpose of performing cross-validation?
a. To assess the predictive performance of the models
b. To judge how the trained model performs outside the sample on test data
c. Both A and B – answer
d. None of the above
2.Bagging is the method for improving the performance by aggregating the results of weak
learners
A) 1
B) 2
C) 1 and 2- answer
D) None of these
A) 1
B) 2- answer
C) 1 and 2
D) None of these
16. Which of the following algorithm are not an example of ensemble learning algorithm?
A) Random Forest
B) Adaboost
C) Extra Trees
D) Gradient Boosting
E) Decision Trees- answer
17. Suppose you are using a bagging based algorithm say a RandomForest in model building.
Which of the following can be true?
A) 1- answer
B) 2
C) 1 and 2
D) None of these
18. True-False: The bagging is suitable for high variance low bias models?
A) TRUE- answer
B) FALSE
19. In which of the following scenario a gain ratio is preferred over Information Gain?
20. In K-means clustering, the distance between each sample and each centroid is computed and the
sample is assigned to the cluster where the distance is minimum. This approach is often called ----
3)The algorithm stops when the centroids become stable and, therefore, the inertia is minimized
22. [True or False] k-NN algorithm does more computation on test time rather than train
time.
A) TRUE - answer
B) FALSE
2. In Random forest you can generate hundreds of trees (say T1, T2 …..Tn) and then aggregate the
results of these tree. Which of the following is true about individual(Tk) tree in Random Forest?
A) 1 and 3 - answer
B) 1 and 4
C) 2 and 3
D) 2 and 4
3. Which of the following algorithm doesn’t uses learning Rate as of one of its hyperparameter?
1. Gradient Boosting
2. Extra Trees
3. AdaBoost
4. Random Forest
A) 1 and 3
B) 1 and 4
C) 2 and 3
D) 2 and 4 - answer
4. Which of the following algorithm would you take into the consideration in your final model
building on the basis of performance?
Suppose you have given the following graph which shows the ROC curve for two different
classification algorithms such as Random Forest(Red) and Logistic Regression(Blue)
A) Random Forest- anwser
B) Logistic Regression
C) Both of the above
D) None of these
5. Which of the following is true about training and testing error in such case?
Suppose you want to apply AdaBoost algorithm on Data D which has T observations. You
set half the data for training and half for testing initially. Now you want to increase the
number of data points for training T1, T2 … Tn where T1 < T2…. Tn-1 < Tn.
E) The difference between training error and test error increases as number of observations
increases
B) The difference between training error and test error decreases as number of
observations increases- answer
C) The difference between training error and test error will not change
D) None of These
6. In random forest or gradient boosting algorithms, features can be of any type. For example,
it can be a continuous feature or a categorical feature. Which of the following option is true
when you consider these types of features?
A) Only Random forest algorithm handles real valued attributes by discretizing them
B) Only Gradient boosting algorithm handles real valued attributes by discretizing them
C) Both algorithms can handle real valued attributes by discretizing them- answer
D) None of these
7. Consider the following figure for answering the next few questions. In the figure, X1 and X2
are the two features and the data point is represented by dots (-1 is negative class and +1 is a
positive class). And you first split the data based on feature X1(say splitting point is x11)
which is shown in the figure using vertical line. Every value less than x11 will be predicted
as positive class and greater than x will be predicted as negative class.
1. In each stage, introduce a new regression tree to compensate the shortcomings of existing
model
2. We can use gradient decent method for minimize the loss function
A) 1
B) 2
C) 1 and 2- answer
D) None of these
a. cluster_centers_
b. inertia_
c. n_clusters
d. all of the above
10. In which of the following cases will K-means clustering fail to give good results?
1) Data points with outliers 2) Data points with different densities 3) Data points with
nonconvex shapes
1. 1 and 2
2. 2 and 3
3. 1, 2, and 3 - answer
4. 1 and 3
11. Which of the following is a reasonable way to select the number of clusters "k"?
1. Choose k to be the smallest value so that at least 99% of the varinace is retained.
2. Choose k to be 99% of m (k = 0.99*m, rounded to the nearest integer).
3. Choose k to be the largest value so that 99% of the variance is retained.
4. Use the elbow method- answer
12. A company has build a kNN classifier that gets 100% accuracy on training data. When they
deployed this model on client side it has been found that the model is not at all accurate.
Which of the following thing might gone wrong?
Note: Model has successfully deployed and no technical issues are found at client side except
the model performance
13. In k-NN it is very likely to overfit due to the curse of dimensionality. Which of the
following option would you consider to handle such problem?
1. Dimensionality Reduction
2. Feature selection
A) 1
B) 2
C) 1 and 2 - answer
D) None of these
14. In the image below, which would be the best value for k assuming that the algorithm you are
using is k-Nearest Neighbor.
A) 3
B) 10 - answer
C) 20
D 50
15. Which of the following is/are not true about DBSCAN clustering algorithm:
1. For data points to be in a cluster, they must be in a distance threshold to a core point
2. It has strong assumptions for the distribution of data points in dataspace
3. It has substantially high time complexity of order O(n3)
4. It does not require prior knowledge of the no. of desired clusters
5. It is robust to outliers
Options:
A. 1 only
B. 2 only
C. 4 only
D. 2 and 3 - answer
1. After performing K-Means Clustering analysis on a dataset, you observed the following
dendrogram. Which of the following conclusion can be drawn from the dendrogram?
D. The above dendrogram interpretation is not possible for K-Means clustering analysis -
answer
3. In the figure below, if you draw a horizontal line on y-axis for y=2. What will be the number
of clusters formed?
A. 1
B. 2 - answer
C. 3
D. 4
4. What should be the best choice for number of clusters based on the following results:
A. 5
B. 6 - answer
C. 14
D. Greater than 14
5. Which of the following is/are not true about Centroid based K-Means clustering algorithm
and Distribution based expectation-maximization clustering algorithm:
Options:
A. 1 only
B. 5 only - answer
C. 1 and 3
D. 6 and 7
7. If you are using Multinomial mixture models with the expectation-maximization algorithm for
clustering a set of data points into two clusters, which of the assumptions are important:
• x1, x2,…, xN: These are inputs to the neuron. These can either be the actual observations
from input layer or an intermediate value from one of the hidden layers.
• w1, w2,…,wN: The Weight of each input.
• bi: Is termed as Bias units. These are constant values added to the input of the activation
function corresponding to each weight. It works similar to an intercept term.
• a: Is termed as the activation of the neuron which can be represented as
• and y: is the output of the neuron
Considering the above notations, will a line equation (y = mx + c) fall into the category of a
neuron?
A. Yes- answer
B. No
9. In the graph below, we observe that the error has many “ups and downs”
Should we be worried?
A. Yes, because this means there is a problem with the learning rate of neural network.
B. No, as long as there is a cumulative decrease in both training and validation error,
we don’t need to worry - answer
1. Which of the following metrics, do we have for finding dissimilarity between two clusters in
hierarchical clustering?
1. Single-link
2. Complete-link
3. Average-link
Options:
A. 1 and 2
B. 1 and 3
C. 2 and 3
D. 1, 2 and 3 - answer
3. If you increase the number of hidden layers in a Multi Layer Perceptron, the classification
error of test data always decreases. True or False?
A. True
B. False - answer
4. You are building a neural network where it gets input from the previous layer as well as from
itself.
D. None of these
6. In a neural network, which of the following techniques is used to deal with overfitting?
A. Dropout
B. Regularization
C. Batch Normalization
D. All of these - answer
A. A unit which doesn’t update during training by any of its neighbour - answer
B. A unit which does not respond completely to any of the training patterns
D. None of these
9. For an image recognition problem (recognizing a cat in a photo), which architecture of neural
network would be better suited to solve the problem?
A. Multi Layer Perceptron
B. Convolutional Neural Network - answer
C. Recurrent Neural network
D. Perceptron
10. What are the factors to select the depth of neural network?
A. 1, 2, 4, 5
B. 2, 3, 4, 5
C. 1, 3, 4, 5
Options:
1. 2 Only
2. 1 only
C. 1 and 2
D. 2 and 3 - answer
13. Recommendation systems are used in which of the following applications:
a. Banking
b. Shopping
c. Search Engine
d. All of the above – answer
17. For each pair of clusters, which algorithm computes the maximum distance between the clusters
using below formula?
a. Single link
b. Complete link -answer
c. Average link
d. Ward’s Linkage
18. ___________ Graphical method to better understand the agglomeration process shows in a static
way how the aggregations are performed ,starting from the bottom (where all samples are separated)
till the top (where the linkage is complete).
a. Flow chart
b. Histo graph
c. Dendrogram –answer
d. Decision tree
21. ___________ are general computers which can learn algorithms to map input
sequences to output sequences
a. CNN
b. RNN- answer
c. Deep Q-Learning
d. All of these
UNIT I
1. What is classification?
a) when the output variable is a category, such as “red” or “blue” or “disease” and “no
disease”.
b) when the output variable is a real value, such as “dollars” or “weight”.
Ans: Solution A
2. What is regression?
a) When the output variable is a category, such as “red” or “blue” or “disease” and “no
disease”.
b) When the output variable is a real value, such as “dollars” or “weight”.
Ans: Solution B
Ans: Solution B
Ans: Solution A
Ans: Solution D
6. What is Reinforcement learning?
a) All data is unlabelled and the algorithms learn to inherent structure from the input data
b) All data is labelled and the algorithms learn to predict the output from the input data
c) It is a framework for learning where an agent interacts with an environment and receives
a reward for each interaction
d) Some data is labelled but most of it is unlabelled and a mixture of supervised and
unsupervised techniques can be used.
Ans: Solution C
Regression,
Classification
Clustering
Reinforcement Learning
Options:
A. 1 Only
B. 1 and 2
C. 1 and 3
D. 1, 2 and 4
Ans : Solution D
Ans : Solution C
Ans : Solution B
11. Supervised learning and unsupervised clustering both require at least one
a) hidden attribute.
b) output attribute.
c) input attribute.
d) categorical attribute.
Ans : Solution A
12. Supervised learning differs from unsupervised clustering in that supervised learning requires
a) at least one input attribute.
b) input attributes to be categorical.
c) at least one output attribute.
d) output attributes to be categorical.
Ans : Solution B
13. A regression model in which more than one independent variable is used to predict the
dependent variable is called
a) a simple linear regression model
b) a multiple regression models
c) an independent model
d) none of the above
Ans : Solution C
14. A term used to describe the case when the independent variables in a multiple regression model
are correlated is
a) Regression
b) correlation
c) multicollinearity
d) none of the above
Ans : Solution C
15. A multiple regression model has the form: y = 2 + 3x1 + 4x2. As x1 increases by 1 unit (holding x2
constant), y will
a) increase by 3 units
b) decrease by 3 units
c) increase by 4 units
d) decrease by 4 units
Ans : Solution C
Ans : Solution B
17. A measure of goodness of fit for the estimated regression equation is the
a) multiple coefficient of determination
b) mean square due to error
c) mean square due to regression
d) none of the above
Ans : Solution C
Ans : Solution D
Ans : Solution C
20. For a multiple regression model, SST = 200 and SSE = 50. The multiple coefficient of
determination is
a) 0.25
b) 4.00
c) 0.75
d) none of the above
Ans : Solution B
Ans : Solution B
Ans : Solution B
Ans : Solution C
Ans : Solution D
26. Which statement is true about neural network and linear regression models?
a) Both models require input attributes to be numeric.
b) Both models require numeric attributes to range between 0 and 1.
c) The output of both models is a categorical attribute value.
d) Both techniques build models whose output is determined by a linear sum of weighted
input attribute values.
Ans : Solution A
Ans : Solution A
28. The average positive difference between computed and desired outcome values.
a) root mean squared error
b) mean squared error
c) mean absolute error
d) mean positive error
Ans : Solution D
29. Selecting data so as to assure that each class is properly represented in both the training and
test set.
a) cross validation
b) stratification
c) verification
d) bootstrapping
Ans : Solution B
30. The standard error is defined as the square root of this computation.
a) The sample variance divided by the total number of sample instances.
b) The population variance divided by the total number of sample instances.
c) The sample variance divided by the sample mean.
d) The population variance divided by the sample mean.
Ans : Solution A
31. Data used to optimize the parameter settings of a supervised learner model.
a) Training
b) Test
c) Verification
d) Validation
Ans : Solution D
Ans : Solution A
33. The correlation between the number of years an employee has worked for a company and the
salary of the employee is 0.75. What can be said about employee salary and years worked?
a) There is no relationship between salary and years worked.
b) Individuals that have worked for the company the longest have higher salaries.
c) Individuals that have worked for the company the longest have lower salaries.
d) The majority of employees have been with the company a long time.
e) The majority of employees have been with the company a short period of time.
Ans : Solution B
34. The correlation coefficient for two real-valued attributes is –0.85. What does this value tell you?
a) The attributes are not linearly related.
b) As the value of one attribute increases the value of the second attribute also increases.
c) As the value of one attribute decreases the value of the second attribute increases.
d) The attributes show a curvilinear relationship.
Ans : Solution C
35. The average squared difference between classifier predicted output and actual output.
a) mean squared error
b) root mean squared error
c) mean absolute error
d) mean relative error
Ans : Solution A
36. Simple regression assumes a __________ relationship between the input attribute and output
attribute.
a) Linear
b) Quadratic
c) reciprocal
d) inverse
Ans : Solution A
Ans : Solution B
Ans : Solution C
39. Logistic regression is a ________ regression technique that is used to model data having a
_____outcome.
a) linear, numeric
b) linear, binary
c) nonlinear, numeric
d) nonlinear, binary
Ans : Solution D
40. This technique associates a conditional probability value with each data instance.
a) linear regression
b) logistic regression
c) simple regression
d) multiple linear regression
Ans : Solution B
41. This supervised learning technique can process both numeric and categorical input attributes.
a) linear regression
b) Bayes classifier
c) logistic regression
d) backpropagation learning
Ans : Solution A
Ans : Solution B
43. This clustering algorithm merges and splits nodes to help modify nonoptimal partitions.
a) agglomerative clustering
b) expectation maximization
c) conceptual clustering
d) K-Means clustering
Ans : Solution D
44. This clustering algorithm initially assumes that each data instance represents a single cluster.
a) agglomerative clustering
b) conceptual clustering
c) K-Means clustering
d) expectation maximization
Ans : Solution C
45. This unsupervised clustering algorithm terminates when mean values computed for the current
iteration of the algorithm are identical to the computed mean values for the previous iteration.
a) agglomerative clustering
b) conceptual clustering
c) K-Means clustering
d) expectation maximization
Ans : Solution C
46. Machine learning techniques differ from statistical techniques in that machine learning methods
a) typically assume an underlying distribution for the data.
b) are better able to deal with missing and noisy data.
c) are not able to explain their behavior.
d) have trouble with large-sized datasets.
Ans : Solution B
UNIT –II
1.True- False: Over fitting is more likely when you have huge amount of data to train?
A) TRUE
B) FALSE
Ans Solution: (B)
With a small training dataset, it’s easier to find a hypothesis to fit the training data exactly i.e.
over fitting.
3.Which of the following techniques would perform better for reducing dimensions of a data
set?
A. Removing columns which have too many missing values
B. Removing columns which have high variance in data
C. Removing columns with dissimilar data trends
D. None of these
Ans Solution: (A)
If a columns have too many missing values, (say 99%) then we can remove such columns.
4.It is not necessary to have a target variable for applying dimensionality reduction
algorithms.
A. TRUE
B. FALSE
Ans Solution: (A)
LDA is an example of supervised dimensionality reduction algorithm.
5. PCA can be used for projecting and visualizing data in lower dimensions.
A. TRUE
B. FALSE
Ans Solution: (A)
Sometimes it is very useful to plot the data in lower dimensions. We can take the first 2 principal
components and then visualize the data using scatter plot.
6. The most popularly used dimensionality reduction algorithm is Principal Component Analysis
(PCA). Which of the following is/are true about PCA?
PCA is an unsupervised method
It searches for the directions that data have the largest variance
Maximum number of principal components <= number of features
All principal components are orthogonal to each other
A. 1 and 2
B. 1 and 3
C. 2 and 3
D. All of the above
Ans D
8. What happens when you get features in lower dimensions using PCA?
The features will still have interpretability
The features will lose interpretability
The features must carry all information present in data
The features may not carry all information present in data
A. 1 and 3
B. 1 and 4
C. 2 and 3
D. 2 and 4
Ans Solution: (D)
When you get the features in lower dimensions then you will lose some information of data
most of the times and you won’t be able to interpret the lower dimension data.
10. What is of the following statement is true about t-SNE in comparison to PCA?
A. When the data is huge (in size), t-SNE may fail to produce better results.
B. T-NSE always produces better result regardless of the size of the data
C. PCA always performs better than t-SNE for smaller size data.
D. None of these
Ans Solution: (A)
Option A is correct
11. [ True or False ] PCA can be used for projecting and visualizing data in lower dimensions.
A. TRUE
B. FALSE
Solution: (A)
Sometimes it is very useful to plot the data in lower dimensions. We can take the first 2 principal
components and then visualize the data using scatter plot.
12. A feature F1 can take certain value: A, B, C, D, E, & F and represents grade of students from
a college.
1) Which of the following statement is true in following case?
A) Feature F1 is an example of nominal variable.
B) Feature F1 is an example of ordinal variable.
C) It doesn’t belong to any of the above category.
D) Both of these
Solution: (B)
Ordinal variables are the variables which has some order in their categories. For example, grade
A should be consider as high grade than grade B.
1. Which of the following methods do we use to best fit the data in Logistic Regression?
A) Least Square Error
B) Maximum Likelihood
C) Jaccard distance
D) Both A and B
Ans Solution: B
2. Choose which of the following options is true regarding One-Vs-All method in Logistic
Regression.
A) We need to fit n models in n-class classification problem
B) We need to fit n-1 models to classify into n classes
C) We need to fit only 1 model to classify into n classes
D) None of these
Ans Solution: A
3. Suppose, You applied a Logistic Regression model on a given data and got a training accuracy
X and testing accuracy Y. Now, you want to add a few new features in the same data. Select the
option(s) which is/are correct in such a case.
Note: Consider remaining parameters are same.
A) Training accuracy increases
B) Training accuracy increases or remains the same
C) Testing accuracy decreases
D) Testing accuracy increases or remains the same
Ans Solution: A and D
Adding more features to model will increase the training accuracy because model has to
consider more data to fit the logistic regression. But testing accuracy increases if feature is
found to be significant
6. Which of the following methods do we use to find the best fit line for data in Linear
Regression?
A) Least Square Error
B) Maximum Likelihood
C) Logarithmic Loss
D) Both A and B
Ans Solution: (A)
In linear regression, we try to minimize the least square errors of the model to identify the line
of best fit.
8. Suppose you plotted a scatter plot between the residuals and predicted values in linear
regression and you found that there is a relationship between them. Which of the following
conclusion do you make about this situation?
9. Suppose you have fitted a complex regression model on a dataset. Now, you are using Ridge
regression with penalty x.
Choose the option which describes bias in best manner.
A) In case of very large x; bias is low
B) In case of very large x; bias is high
C) We can’t say about bias
D) None of these
Ans Solution: (B)
If the penalty is very large it means model is less complex, therefore the bias would be high.
11. Suppose you have trained a logistic regression classifier and it outputs a new example x with
a prediction ho(x) = 0.2. This means
Our estimate for P(y=1 | x)
Our estimate for P(y=0 | x)
Our estimate for P(y=1 | x)
Our estimate for P(y=0 | x)
Ans Solution: B
A) TRUE
B) FALSE
Solution: (A)
True. A Neural network can be used as a universal approximator, so it can definitely implement
a linear regression algorithm.
15. Which of the following methods do we use to find the best fit line for data in Linear
Regression?
A) Least Square Error
B) Maximum Likelihood
C) Logarithmic Loss
D) Both A and B
Solution: (A)
In linear regression, we try to minimize the least square errors of the model to identify the line
of best fit.
16. Which of the following evaluation metrics can be used to evaluate a model while modeling
a continuous output variable?
A) AUC-ROC
B) Accuracy
C) Logloss
D) Mean-Squared-Error
Solution: (D)
Since linear regression gives output as continuous values, so in such case we use mean squared
error metric to evaluate the model performance. Remaining options are use in case of a
classification problem.
17. True-False: Lasso Regularization can be used for variable selection in Linear Regression.
A) TRUE
B) FALSE
Solution: (A)
True, In case of lasso regression we apply absolute penalty which makes some of the coefficients
zero.
19. Suppose that we have N independent variables (X1,X2… Xn) and dependent variable is Y.
Now Imagine that you are applying linear regression by fitting the best fit line using least square
error on this data.
You found that correlation coefficient for one of it’s variable(Say X1) with Y is -0.95.
Which of the following is true for X1?
A) Relation between the X1 and Y is weak
B) Relation between the X1 and Y is strong
C) Relation between the X1 and Y is neutral
D) Correlation can’t judge the relationship
Solution: (B)
The absolute value of the correlation coefficient denotes the strength of the relationship.
Since absolute correlation is very high it means that the relationship is strong between X1 and
Y.
20. Looking at above two characteristics, which of the following option is the correct for
Pearson correlation between V1 and V2?
If you are given the two variables V1 and V2 and they are following below two characteristics.
1. If V1 increases then V2 also increases
2. If V1 decreases then V2 behavior is unknown
A) Pearson correlation will be close to 1
B) Pearson correlation will be close to -1
C) Pearson correlation will be close to 0
D) None of these
Solution: (D)
We cannot comment on the correlation coefficient by using only statement 1. We need to
consider the both of these two statements. Consider V1 as x and V2 as |x|. The correlation
coefficient would not be close to 1 in such a case.
21. Suppose Pearson correlation between V1 and V2 is zero. In such case, is it right to
conclude that V1 and V2 do not have any relation between them?
A) TRUE
B) FALSE
Solution: (B)
Pearson correlation coefficient between 2 variables might be zero even when they have a
relationship between them. If the correlation coefficient is zero, it just means that that they
don’t move together. We can take examples like y=|x| or y=x^2.
22. True- False: Overfitting is more likely when you have huge amount of data to train?
A) TRUE
B) FALSE
Solution: (B)
With a small training dataset, it’s easier to find a hypothesis to fit the training data exactly i.e.
overfitting.
23. We can also compute the coefficient of linear regression with the help of an analytical
method called “Normal Equation”. Which of the following is/are true about Normal Equation?
1. We don’t have to choose the learning rate
2. It becomes slow when number of features is very large
3. Thers is no need to iterate
A) 1 and 2
B) 1 and 3
C) 2 and 3
D) 1,2 and 3
Solution: (D)
Instead of gradient descent, Normal Equation can also be used to find coefficients.
25. What will happen when you apply very large penalty?
A) Some of the coefficient will become absolute zero
B) Some of the coefficient will approach zero but not absolute zero
C) Both A and B depending on the situation
D) None of these
Solution: (B)
In lasso some of the coefficient value become zero, but in case of Ridge, the coefficients become
close to zero but not zero.
26. What will happen when you apply very large penalty in case of Lasso?
A) Some of the coefficient will become zero
B) Some of the coefficient will be approaching to zero but not absolute zero
C) Both A and B depending on the situation
D) None of these
Solution: (A)
As already discussed, lasso applies absolute penalty, so some of the coefficients will become
zero.
27. Which of the following statement is true about outliers in Linear regression?
A) Linear regression is sensitive to outliers
B) Linear regression is not sensitive to outliers
C) Can’t say
D) None of these
Solution: (A)
The slope of the regression line will change due to outliers in most of the cases. So Linear
Regression is sensitive to outliers.
28. Suppose you plotted a scatter plot between the residuals and predicted values in linear
regression and you found that there is a relationship between them. Which of the following
conclusion do you make about this situation?
31. In terms of bias and variance. Which of the following is true when you fit degree 2
polynomial?
A) Increase
B) Decrease
C) Remain constant
D) Can’t Say
Solution: (D)
Training error may increase or decrease depending on the values that are used to fit the model.
If the values used to train contain more outliers gradually, then the error might just increase.
33. What do you expect will happen with bias and variance as you increase the size of training
data?
34. What would be the root mean square training error for this data if you run a Linear
Regression model of the form (Y = A0+A1X)?
A) Less than 0
B) Greater than zero
C) Equal to 0
D) None of these
Solution: (C)
We can perfectly fit the line on the following data so mean error will be zero.
35. Which of the following scenario would give you the right hyper parameter?
A) 1
B) 2
C) 3
D) 4
Solution: (B)
Option B would be the better option because it leads to less training as well as validation error.
36. Suppose you got the tuned hyper parameters from the previous question. Now, Imagine
you want to add a variable in variable space such that this added feature is important. Which
of the following thing would you observe in such case?
A) Training Error will decrease and Validation error will increase
B) Training Error will increase and Validation error will increase
C) Training Error will increase and Validation error will decrease
D) Training Error will decrease and Validation error will decrease
E) None of the above
Solution: (D)
If the added feature is important, the training and validation error would decrease.
A) L1
B) L2
C) Any
D) None of these
Solution: (D)
I won’t use any regularization methods because regularization is used in case of overfitting.
41. True-False: Is it possible to design a logistic regression algorithm using a Neural Network
Algorithm?
A) TRUE
B) FALSE
Solution: A
True, Neural network is a is a universal approximator so it can implement linear regression
algorithm.
43. Which of the following methods do we use to best fit the data in Logistic Regression?
A) Least Square Error
B) Maximum Likelihood
C) Jaccard distance
D) Both A and B
Solution: B
Logistic regression uses maximum likely hood estimate for training a logistic regression.
44. Which of the following evaluation metrics can not be applied in case of logistic regression
output to compare with target?
A) AUC-ROC
B) Accuracy
C) Logloss
D) Mean-Squared-Error
Solution: D
Since, Logistic Regression is a classification algorithm so it’s output can not be real time value so
mean squared error can not use for evaluating it
45. One of the very good methods to analyze the performance of Logistic Regression is AIC,
which is similar to R-Squared in Linear Regression. Which of the following is true about AIC?
A) We prefer a model with minimum AIC value
B) We prefer a model with maximum AIC value
C) Both but depend on the situation
D) None of these
Solution: A
We select the best model in logistic regression which can least AIC.
Solution: A
In case of lasso we apply a absolute penality, after increasing the penality in lasso some of the
coefficient of variables may become zero.
Context: 48-49
Consider a following model for logistic regression: P (y =1|x, w)= g(w0 + w1x)
where g(z) is the logistic function.
In the above equation the P (y =1|x; w) , viewed as a function of x, that we can get by changing the
parameters w.
A) (0, inf)
B) (-inf, 0 )
C) (0, 1)
D) (-inf, inf)
Solution: C
For values of x in the range of real number from −∞ to +∞ Logistic function will give the output
between (0,1)
49 In above question what do you think which function would make p between (0,1)?
A) logistic function
B) Log likelihood function
C) Mixture of both
D) None of them
Solution: A
50. Suppose you have been given a fair coin and you want to find out the odds of getting heads.
Which of the following option is true for such a case?
A) odds will be 0
B) odds will be 0.5
C) odds will be 1
D) None of these
Solution: C
Odds are defined as the ratio of the probability of success and the probability of failure. So in case of fair
coin probability of success is 1/2 and the probability of failure is 1/2 so odd would be 1
51. The logit function(given as l(x)) is the log of odds function. What could be the range of logit
function in the domain x=[0,1]?
A) (– ∞ , ∞)
B) (0,1)
C) (0, ∞)
D) (- ∞, 0)
Solution: A
For our purposes, the odds function has the advantage of transforming the probability function, which
has values from 0 to 1, into an equivalent function with values between 0 and ∞. When we take the
natural log of the odds function, we get a range of values from -∞ to ∞.
A) Linear Regression errors values has to be normally distributed but in case of Logistic Regression it is
not the case
B) Logistic Regression errors values has to be normally distributed but in case of Linear Regression it is
not the case
C) Both Linear Regression and Logistic Regression error values have to be normally distributed
D) Both Linear Regression and Logistic Regression error values have not to be normally distributed
Solution:A
53. Which of the following is true regarding the logistic function for any value “x”?
Note:
Logistic(x): is a logistic function of any number “x”
A) Logistic(x) = Logit(x)
B) Logistic(x) = Logit_inv(x)
C) Logit_inv(x) = Logit(x)
D) None of these
Solution: B
Suppose you have given the two scatter plot “a” and “b” for two classes( blue for positive and red for
negative class). In scatter plot “a”, you correctly classified all data points using logistic regression ( black
line is a decision boundary).
A) Bias will be high
B) Bias will be low
C) Can’t say
D) None of these
Solution: A
55. Suppose, You applied a Logistic Regression model on a given data and got a training accuracy X
and testing accuracy Y. Now, you want to add a few new features in the same data. Select the
option(s) which is/are correct in such a case.
Solution: A and D
Adding more features to model will increase the training accuracy because model has to consider more
data to fit the logistic regression. But testing accuracy increases if feature is found to be significant
56. Choose which of the following options is true regarding One-Vs-All method in Logistic Regression.
If there are n classes, then n separate logistic regression has to fit, where the probability of each
category is predicted over the rest of the categories combined.
57. Below are two different logistic models with different values for β0 and β1.
Which of the
following statement(s) is true about β0 and β1 values of two logistics models (Green, Black)?
Solution: B
Context 58-60
Below are the three scatter plot(A,B,C left to right) and hand drawn decision boundaries for logistic
regression.
58. Which of the following above figure shows that the decision boundary is overfitting the training
data?
A) A
B) B
C) C
D)None of these
Solution: C
Since in figure 3, Decision boundary is not smooth that means it will over-fitting the data.
1. The training error in first plot is maximum as compare to second and third plot.
2. The best model for this regression problem is the last (third) plot because it has minimum
training error (zero).
3. The second model is more robust than first and third because it will perform best on unseen
data.
5. All will perform same because we have not seen the testing data.
A) 1 and 3
B) 1 and 3
C) 1, 3 and 4
D) 5
Solution: C
The trend in the graphs looks like a quadratic trend over independent variable X. A higher degree(Right
graph) polynomial might have a very high accuracy on the train population but is expected to fail badly
on test dataset. But if you see in left graph we will have training error maximum because it underfits the
training data
60. Suppose, above decision boundaries were generated for the different value of regularization.
Which of the above decision boundary shows the maximum regularization?
A) A
B) B
C) C
D) All have equal regularization
Solution: A
Since, more regularization means more penality means less complex decision boundry that shows in first
figure A.
61. What would do if you want to train logistic regression on same data that will take less time as well
as give the comparatively similar accuracy(may not be same)?
Suppose you are using a Logistic Regression model on a huge dataset. One of the problem you may face
on such huge data is that Logistic regression will take very long time to train.
Solution: D
If you decrease the number of iteration while training it will take less time for surly but will not give the
same accuracy for getting the similar accuracy but not exact you need to increase the learning rate.
62. Which of the following image is showing the cost function for y =1.
Following is the loss function in logistic regression(Y-axis loss function and x axis log probability) for
two class classification problem.
Solution: A
A is the true answer as loss function decreases as the log probability increases
A) 1
B) 2
C) 3
D) 4
Solution: C
There are three local minima present in the graph
64. Can a Logistic Regression classifier do a perfect classification on the below data?
Note: You can use only X1 and X2 variables where X1 and X2 can take only two binary values(0,1).
A) TRUE
B) FALSE
C) Can’t say
D) None of these
Solution: B
No, logistic regression only forms linear decision surface, but the examples in the figure are not linearly
separable.
UNIT IV
Ans Solution: C
When the data has noise and overlapping points, there is a problem in drawing a clear hyperplane
without misclassifying.
Ans Solution: C
The cost parameter decides how much an SVM should be allowed to “bend” with the data. For a low
cost, you aim for a smooth decision surface and for a higher cost, you aim to classify more points
correctly. It is also simply referred to as the cost of misclassification.
Ans Solution: D
SVM’s are highly versatile models that can be used for practically all real world problems ranging from
regression to clustering and handwriting recognitions.
Ans Solution: B
Generalisation error in statistics is generally the out-of-sample error which is the measure of how
accurately a model can predict values for previously unseen data.
Ans Solution: C
When the data has noise and overlapping points, there is a problem in drawing a clear hyperplane
without misclassifying.
A) 1
B) 2
C) 1 and 2
D) None of these
Ans Solution: C
Question Context:8– 9
Suppose you are using a Linear SVM classifier with 2 class classification problem. Now you have been
given the following data in which some points are circled red that are representing support vectors.
8. If you remove the following any one red points from the data. Does the decision boundary will
change?
A) Yes
B) No
Solution: A
These three examples are positioned such that removing any one of them introduces slack in the
constraints. So the decision boundary would completely change.
9. [True or False] If you remove the non-red circled points from the data, the decision boundary will
change?
A) True
B) False
Solution: B
On the other hand, rest of the points in the data won’t affect the decision boundary much.
Solution: B
Generalization error in statistics is generally the out-of-sample error which is the measure of how
accurately a model can predict values for previously unseen data.
11. When the C parameter is set to infinite, which of the following holds true?
A) The optimal hyperplane if exists, will be the one that completely separates the data
B) The soft-margin classifier will separate the data
C) None of the above
Solution: A
At such a high level of misclassification penalty, soft margin will not hold existence as there will be no
room for error.
Solution: A
A hard margin means that an SVM is very rigid in classification and tries to work extremely well in the
training set, causing overfitting.
13. The minimum time complexity for training an SVM is O(n2). According to this fact, what sizes of
datasets are not best suited for SVM’s?
A) Large datasets
B) Small datasets
C) Medium sized datasets
D) Size does not matter
Solution: A
Datasets which have a clear classification boundary will function best with SVM’s.
A) Selection of Kernel
B) Kernel Parameters
C) Soft Margin Parameter C
D) All of the above
Solution: D
The SVM effectiveness depends upon how you choose the basic 3 requirements mentioned above in
such a way that it maximises your efficiency, reduces error and overfitting.
15. upport vectors are the data points that lie closest to the decision surface.
A) TRUE
B) FALSE
Solution: A
They are the points closest to the hyperplane and the hardest ones to classify. They also have a direct
bearing on the location of the decision surface.
Solution: C
When the data has noise and overlapping points, there is a problem in drawing a clear hyperplane
without misclassifying.
17. Suppose you are using RBF kernel in SVM with high Gamma value. What does this signify?
A) The model would consider even far away points from hyperplane for modeling
B) The model would consider only the points close to the hyperplane for modeling
C) The model would not be affected by distance of points from hyperplane for modeling
D) None of the above
Solution: B
The gamma parameter in SVM tuning signifies the influence of points either near or far away from the
hyperplane.
For a low gamma, the model will be too constrained and include all points of the training dataset,
without really capturing the shape.
For a higher gamma, the model will capture the shape of the dataset well.
Solution: C
The cost parameter decides how much an SVM should be allowed to “bend” with the data. For a low
cost, you aim for a smooth decision surface and for a higher cost, you aim to classify more points
correctly. It is also simply referred to as the cost of misclassification.
19. Suppose you are building a SVM model on data X. The data X can be error prone which means that
you should not trust any specific data point too much. Now think that you want to build a SVM model
which has quadratic kernel function of polynomial degree 2 that uses Slack variable C as one of it’s hyper
parameter. Based upon that give the answer for following question.
What would happen when you use very large value of C(C->infinity)?
Note: For small C was also classifying all data points correctly
A) We can still classify data correctly for given setting of hyper parameter C
B) We can not classify data correctly for given setting of hyper parameter C
C) Can’t Say
D) None of these
Solution: A
For large values of C, the penalty for misclassifying points is very high, so the decision boundary will
perfectly separate the data if possible.
20. What would happen when you use very small C (C~0)?
Solution: A
The classifier can maximize the margin between most of the points, while misclassifying a few points,
because the penalty is so low.
21. If I am using all features of my dataset and I achieve 100% accuracy on my training set, but ~70% on
validation set, what should I look out for?
A) Underfitting
B) Nothing, the model is perfect
C) Overfitting
Solution: C
If we’re achieving 100% training accuracy very easily, we need to check to verify if we’re overfitting our
data.
22. Which of the following are real world applications of the SVM?
Solution: D
SVM’s are highly versatile models that can be used for practically all real world problems ranging from
regression to clustering and handwriting recognitions.
Question Context: 23 – 25
Suppose you have trained an SVM with linear decision boundary after training SVM, you correctly infer
that your SVM model is under fitting.
23. Which of the following option would you more likely to consider iterating SVM next time?
Solution: C
The best option here would be to create more features for the model.
24. Suppose you gave the correct answer in previous question. What do you think that is actually
happening?
A) 1 and 2
B) 2 and 3
C) 1 and 4
D) 2 and 4
Solution: C
Better model will lower the bias and increase the variance
25. In above question suppose you want to change one of it’s(SVM) hyperparameter so that effect
would be same as previous questions i.e model will not under fit?
Solution: A
Increasing C parameter would be the right thing to do here, as it will ensure regularized model
26. We usually use feature normalization before using the Gaussian kernel in SVM. What is true about
feature normalization?
A) 1
B) 1 and 2
C) 1 and 3
D) 2 and 3
Solution: B
Suppose you are dealing with 4 class classification problem and you want to train a SVM model on the
data for that you are using One-vs-all method. Now answer the below questions?
27. How many times we need to train our SVM model in such case?
A) 1
B) 2
C) 3
D) 4
Solution: D
For a 4 class problem, you would have to train the SVM at least 4 times if you are using a one-vs-all
method.
28. Suppose you have same distribution of classes in the data. Now, say for training 1 time in one vs all
setting the SVM is taking 10 second. How many seconds would it require to train one-vs-all method end
to end?
A) 20
B) 40
C) 60
D) 80
Solution: B
29 Suppose your problem has changed now. Now, data has only 2 classes. What would you think how
many times we need to train SVM in such case?
A) 1
B) 2
C) 3
D) 4
Solution: A
Training the SVM only one time would give you appropriate results
Suppose you are using SVM with linear kernel of polynomial degree 2, Now think that you have applied
this on data and found that it perfectly fit the data that means, Training and testing accuracy is 100%.
30. Now, think that you increase the complexity (or degree of polynomial of this kernel). What would
you think will happen?
Solution: A
Increasing the complexity of the data would make the algorithm overfit the data.
31. In the previous question after increasing the complexity you found that training accuracy was still
100%. According to you what is the reason behind that?
1. Since data is fixed and we are fitting more polynomial term or parameters so the algorithm starts
memorizing everything in the data
2. Since data is fixed and SVM doesn’t need to search in big hypothesis space
A) 1
B) 2
C) 1 and 2
D) None of these
Solution: C
A) 1
B) 2
C) 1 and 2
D) None of these
Solution: C
UNIT V
1. Which of the following is a widely used and effective machine learning algorithm based on the
idea of bagging?
a) Decision Tree
b) Regression
c) Classification
d) Random Forest
Ans D
a) Factor analysis
b) Decision trees are robust to outliers
c) Decision trees are prone to be overfit
d) None of the above
Ans C
a. True
b. False
Decision trees can also be used to for clusters in the data but clustering often generates natural
clusters and is not dependent on any objective function.
Regression
Classification
Clustering
Reinforcement Learning
Options:
a. 1 Only
b. 1 and 2
c. 1 and 3
d. 1, 2 and 4
Ans D
6 Which of the following is the most appropriate strategy for data cleaning before performing
clustering analysis, given less than desirable number of data points:
Removal of outliers
Options:
a. 1 only
b. 2 only
c. 1 and 2
d. None of the above
Ans A
A) 1
B) 2
C) 1 and 2
D) None of these
Ans Solution: C
Both options are true. In Bagging, each individual trees are independent of each other because they
consider different subset of features and samples.
A) 1
B) 2
C) 1 and 2
D) None of these
Ans Solution: B
In boosting tree individual weak learners are not independent of each other because each tree correct
the results of previous tree. Bagging and boosting both can be consider as improving the base learners
results.
9. In Random forest you can generate hundreds of trees (say T1, T2 …..Tn) and then aggregate
the results of these tree. Which of the following is true about individual (Tk) tree in Random Forest?
1. Individual tree is built on a subset of the features
A) 1 and 3
B) 1 and 4
C) 2 and 3
D) 2 and 4
Ans Solution: A
Random forest is based on bagging concept, that consider faction of sample and faction of feature for
building the individual trees.
10. Suppose you are using a bagging based algorithm say a RandomForest in model building.
Which of the following can be true?
A) 1
B) 2
C) 1 and 2
D) None of these
Ans Solution: A
Since Random Forest aggregate the result of different weak learners, If It is possible we would want
more number of trees in model building. Random Forest is a black box model you will lose
interpretability after using it.
11. Which of the following is/are true about Random Forest and Gradient Boosting ensemble
methods?
2. Random Forest is use for classification whereas Gradient Boosting is use for regression task
3. Random Forest is use for regression whereas Gradient Boosting is use for Classification task
Solution: E
12. In Random forest you can generate hundreds of trees (say T1, T2 …..Tn) and then aggregate the
results of these tree. Which of the following is true about individual(Tk) tree in Random Forest?
A) 1 and 3
B) 1 and 4
C) 2 and 3
D) 2 and 4
Solution: A
Random forest is based on bagging concept, that consider faction of sample and faction of feature for
building the individual trees.
13. Which of the following algorithm doesn’t uses learning Rate as of one of its hyperparameter?
1. Gradient Boosting
2. Extra Trees
3. AdaBoost
4. Random Forest
A) 1 and 3
B) 1 and 4
C) 2 and 3
D) 2 and 4
Solution: D
Random Forest and Extra Trees don’t have learning rate as a hyperparameter.
14. Which of the following algorithm are not an example of ensemble learning algorithm?
A) Random Forest
B) Adaboost
C) Extra Trees
D) Gradient Boosting
E) Decision Trees
Solution: E
Decision trees doesn’t aggregate the results of multiple trees so it is not an ensemble algorithm.
15. Suppose you are using a bagging based algorithm say a RandomForest in model building. Which of
the following can be true?
A) 1
B) 2
C) 1 and 2
D) None of these
Solution: A
Since Random Forest aggregate the result of different weak learners, If It is possible we would want
more number of trees in model building. Random Forest is a black box model you will lose
interpretability after using it.
16. True-False: The bagging is suitable for high variance low bias models?
A) TRUE
B) FALSE
Solution: A
The bagging is suitable for high variance low bias models or you can say for complex models.
17. To apply bagging to regression trees which of the following is/are true in such case?
Solution: D
Solution: B
We always consider the validation results to compare with the test result.
19. In which of the following scenario a gain ratio is preferred over Information Gain?
Solution: A
When high cardinality problems, gain ratio is preferred over Information Gain technique.
20. Suppose you have given the following scenario for training and validation error for Gradient
Boosting. Which of the following hyper parameter would you choose in such case?
1 2 100 110
2 4 90 105
3 6 50 100
4 8 45 105
5 10 30 150
A) 1
B) 2
C) 3
D) 4
Solution: B
Scenario 2 and 4 has same validation accuracies but we would select 2 because depth is lower is better
hyper parameter.
21. Which of the following is/are not true about DBSCAN clustering algorithm:
1. For data points to be in a cluster, they must be in a distance threshold to a core point
5. It is robust to outliers
Options:
A. 1 only
B. 2 only
C. 4 only
D. 2 and 3
Solution: D
DBSCAN can form a cluster of any arbitrary shape and does not have strong assumptions for the
distribution of data points in the data space.
Answer: d
Explanation: K-means clustering follows partitioning approach.
Answer: c
Explanation: k-nearest neighbour has nothing to do with k-means.
Answer: a
Explanation: K-means requires a number of clusters.
Answer: a
Explanation: K-means clustering produces the final estimate of cluster centroids.
27.
((MARKS)) QUESTION IS OF HOW MANY MARKS? (1 OR 2 OR 1
3 UPTO 10)
((QUESTION)) Which of the following step / assumption in regression modeling impacts
the trade-off between under-fitting and over-fitting the most
ENTER CONTENT. QTN CAN HAVE IMAGES ALSO 1. The true relationship between dependent y and predictor x is linear
2. The model errors are statistically independent
3. The errors are normally distributed with a 0 mean and constant standard deviation.
((OPTION_A)) 1,2&3
THIS IS MANDATORY OPTION
((OPTION_B)) 1&3
THIS IS ALSO MANDATORY OPTION
((OPTION_C)) All of above
This is optional
((OPTION_D))
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E C
((EXPLANATION)) This is also optional
ENTER CONTENT. QTN CAN HAVE IMAGES ALSO 1. Simple Linear regression will have high bias and low variance
2. Simple Linear regression will have low bias and high variance
3. polynomial of degree 3 will have low bias and high variance
Polynomial of degree 3 will have low bias and Low variance
((OPTION_A)) . Only 1
THIS IS MANDATORY OPTION
((OPTION_B)) 1&3
THIS IS ALSO MANDATORY OPTION
((OPTION_C)) 1&4
This is optional
((OPTION_D)) None of the above
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E C
((EXPLANATION)) This is also optional
ENTER CONTENT. QTN CAN HAVE IMAGES ALSO Now, which of the following option will be correct ?
((OPTION_A)) It is more likely for X1 to be excluded from the model
THIS IS MANDATORY OPTION
((OPTION_B)) It is more likely for X1 to be included in the model
THIS IS ALSO MANDATORY OPTION
((OPTION_C)) . Can’t say
This is optional
((OPTION_D)) None of the above
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E B
((EXPLANATION)) This is also optional
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E A
((EXPLANATION)) This is also optional
ENTER CONTENT. QTN CAN HAVE IMAGES ALSO 1. We don’t have to choose the learning rate
2. It becomes slow when number of features is very large
3. No need to iterate
((OPTION_A)) 1 and 2
THIS IS MANDATORY OPTION
((OPTION_B)) 1&3
THIS IS ALSO MANDATORY OPTION
((OPTION_C)) 2&3
This is optional
((OPTION_D)) 1,2&3
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E D
((EXPLANATION)) This is also optional
((OPTION_A)) . 1 and 2
THIS IS MANDATORY OPTION
((OPTION_B)) 1 and 3
THIS IS ALSO MANDATORY OPTION
((OPTION_C)) 2 and 3
This is optional
((OPTION_D)) 1,2 and 3
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E D
((EXPLANATION)) This is also optional
ENTER CONTENT. QTN CAN HAVE IMAGES ALSO Note: Consider remaining parameters are same.
1. Training accuracy always decreases.
2. Training accuracy always increases or remain same.
3. Testing accuracy always decreases
Testing accuracy always increases or remain same
((OPTION_A)) Only 2
THIS IS MANDATORY OPTION
((OPTION_B)) Only 1
THIS IS ALSO MANDATORY OPTION
((OPTION_C)) Only3
This is optional
((OPTION_D)) All of the above
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E A
((EXPLANATION)) This is also optional
((MARKS)) QUESTION IS OF HOW MANY MARKS? (1 OR 2 OR 1
3 UPTO 10)
((QUESTION)) The graph below represents a regression line predicting Y from X. The values on the
graph shows the residuals for each predictions value. Use this information to compute
the SSE.
ENTER CONTENT. QTN CAN HAVE IMAGES ALSO
((OPTION_A)) 3.02
THIS IS MANDATORY OPTION
((OPTION_B)) 0.75
THIS IS ALSO MANDATORY OPTION
((OPTION_C)) 1.01
This is optional
((OPTION_D)) None of these
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E A
((EXPLANATION)) This is also optional
ENTER CONTENT. QTN CAN HAVE IMAGES ALSO Would a person with Salary $1 be considered an Outlier?
((OPTION_A)) YES
THIS IS MANDATORY OPTION
((OPTION_B)) NO
THIS IS ALSO MANDATORY OPTION
((OPTION_C)) . More information is required
This is optional
((OPTION_D)) None of these
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E C
((EXPLANATION)) This is also optional
ENTER CONTENT. QTN CAN HAVE IMAGES ALSO Logistic(x): is a logistic function of any number “x”
Logit(x): is a logit function of any number “x”
Logit_inv(x): is a inverse logit function of any number “x””?
ENTER CONTENT. QTN CAN HAVE IMAGES ALSO Note: Consider remaining parameters are same.
((OPTION_A)) Training accuracy increases
THIS IS MANDATORY OPTION
((OPTION_B)) Training accuracy increases or remains the same
THIS IS ALSO MANDATORY OPTION
((OPTION_C)) Testing accuracy decreases
This is optional
((OPTION_D)) Testing accuracy increases or remains the same
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E A&D
((EXPLANATION)) This is also optional
ENTER CONTENT. QTN CAN HAVE IMAGES ALSO Following is the loss function in logistic regression(Y-axis loss function and x axis log probability) for two
class classification problem.
Note: Y is the target class
((OPTION_A)) A
THIS IS MANDATORY OPTION
((OPTION_B)) B
THIS IS ALSO MANDATORY OPTION
((OPTION_C)) BOTH
This is optional
((OPTION_D)) NON OF THESE
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E A
((EXPLANATION)) This is also optional
This is optional
((OPTION_D)) Predict a continuous variable from dichotomous or continuous variables
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E A
((EXPLANATION)) This is also optional
((MARKS)) QUESTION IS OF HOW MANY MARKS? (1 OR 2 OR 1
3 UPTO 10)
((QUESTION)) The odds ratio is
This is optional
((OPTION_D)) The ratio of the probability of an event happening to the probability of the event not happening.
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E C
((EXPLANATION)) This is also optional
This is optional
((OPTION_D)) That the statistical model is a poor fit of the data.
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E B
((EXPLANATION)) This is also optional
This is optional
((OPTION_D)) Linear relationship between observations.
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E B
((EXPLANATION)) This is also optional
This is optional
((OPTION_D)) There is no dependent variable.
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E C
((EXPLANATION)) This is also optional
This is optional
((OPTION_D)) none
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E C
((EXPLANATION)) This is also optional
ENTER CONTENT. QTN CAN HAVE IMAGES ALSO pairs as follows: y 1 = 22, x 1 = 1, y 2 = 3, x 2 = 1, y 3 = 3, x 3 = 2. What
ENTER CONTENT. QTN CAN HAVE IMAGES ALSO stored it in a vector g. What is the cost of one gradient descent update
given the gradient?
((OPTION_A))
O (D )
THIS IS MANDATORY OPTION
((OPTION_B))
O (N )
THIS IS ALSO MANDATORY OPTION
((OPTION_C))
O (ND )
This is optional
((OPTION_D))
O (ND 2)
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E A
((EXPLANATION)) This is also optional
ENTER CONTENT. QTN CAN HAVE IMAGES ALSO you increase the amount of training data, the test error decreases and the
training error increases. The train error is quite low (almost what you expect
it to), while the test error is much higher than the train error.
What do you think is the main reason behind this behavior. Choose the
most probable option
((OPTION_A))
High variance
THIS IS MANDATORY OPTION
((OPTION_B))
High model bias
THIS IS ALSO MANDATORY OPTION
((OPTION_C))
High estimation bias
This is optional
((OPTION_D))
None of the above
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E A
((EXPLANATION)) This is also optional
((OPTION_A))
TRUE
THIS IS MANDATORY OPTION
((OPTION_B))
FALSE
THIS IS ALSO MANDATORY OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E A
((EXPLANATION)) This is also optional
((MARKS)) QUESTION IS OF HOW MANY MARKS? (1 OR 2 OR
1
3 UPTO 10)
((QUESTION))
Training set is normally a representation of a global distribution
ENTER CONTENT. QTN CAN HAVE IMAGES ALSO
((OPTION_A))
TRUE
THIS IS MANDATORY OPTION
((OPTION_B))
FALSE
THIS IS ALSO MANDATORY OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E A
((EXPLANATION)) This is also optional
((OPTION_A))
Underfitting
THIS IS MANDATORY OPTION
((OPTION_B))
Overfitting
THIS IS ALSO MANDATORY OPTION
((OPTION_C))
Both
Both
This is optional
((OPTION_D))
None
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E
((EXPLANATION)) This is also optional
ENTER CONTENT. QTN CAN HAVE IMAGES ALSO values, but when an unknown input is presented, the corresponding prediction
error can be very high, This problem is called as
((OPTION_A))
Underfitting
THIS IS MANDATORY OPTION
((OPTION_B))
Overfitting
THIS IS ALSO MANDATORY OPTION
((OPTION_C))
Both
This is optional
((OPTION_D))
None
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E
((EXPLANATION)) This is also optional
((MARKS)) QUESTION IS OF HOW MANY MARKS? (1 OR 2 OR
1
3 UPTO 10)
((QUESTION)) ---------- may prove to be more difficult to discover as it could be initially considered
ENTER CONTENT. QTN CAN HAVE IMAGES ALSO the result of a perfect fitting
((OPTION_A))
Underfitting
THIS IS MANDATORY OPTION
((OPTION_B))
Overfitting
THIS IS ALSO MANDATORY OPTION
((OPTION_C))
Both
This is optional
((OPTION_D))
None
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E
((EXPLANATION)) This is also optional
measure e m which takes two arguments and allows us to compute a total error value
ENTER CONTENT. QTN CAN HAVE IMAGES ALSO
over the whole dataset. Those two arguments are.
((OPTION_A))
expected and predicted output
THIS IS MANDATORY OPTION
((OPTION_B))
calculated and predicted output
THIS IS ALSO MANDATORY OPTION
((OPTION_C))
calculated and measured output
calculated and measured output
This is optional
((OPTION_D))
none
This is optional
((OPTION_E)) This is optional. If optional keep empty so that
system will skip this option
((CORRECT_CHOICE)) Either A or B or C or D or E A
ENTER CONTENT. QTN CAN HAVE IMAGES ALSO generic training algorithm has to find the global minimum or a point quite close to it
ENTER CONTENT. QTN CAN HAVE IMAGES ALSO proposed a mathematical approach to determine whether a problem is learnable by a
((QUESTION)) Choose the options that is incorrect regarding machine learning (ML) and
artificial intelligence (AI)
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
THIS IS
MANDATORY
OPTION
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH D
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_A)) Linear in D
THIS IS
MANDATORY
OPTION
((OPTION_B)) Exponential in D
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) Linear in N
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH D
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_A)) -1.66
THIS IS
MANDATORY
OPTION
((OPTION_B)) 2
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) 3
This is optional
((OPTION_D)) 4
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) Let us say that we have computed the gradient of our cost function and
stored it in a vector g. What is the cost of one gradient descent update
ENTER given the gradient?
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A)) O(D)
THIS IS
MANDATORY
OPTION
((OPTION_B)) O(N)
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) O(ND)
This is optional
((OPTION_D)) O(ND2)
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) You observe the following while fitting a linear regression to the data: As
you increase the amount of training data, the test error decreases and the
ENTER training error increases. The train error is quite low (almost what you
CONTENT. QTN expect
CAN HAVE it to), while the test error is much higher than the train error.
IMAGES ALSO What do you think is the main reason behind this behavior. Choose the
most probable option
((OPTION_A)) High variance
THIS IS
MANDATORY
OPTION
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) Adding more basis functions in a linear model... (pick the most probably
option)
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_C)) Serration
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH D
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) You are given data about seismic activity in Japan, and you want to
predict a magnitude of the next earthquake, this is in an example of
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_C)) Serration
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_A)) Classification
THIS IS
MANDATORY
OPTION
((OPTION_B)) Regression
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) Clustering
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_A)) True
THIS IS
MANDATORY
OPTION
((OPTION_B)) False
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_A)) Outcome
THIS IS
MANDATORY
OPTION
((OPTION_B)) Feature
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) Attribute
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) It may be better to avoid the metric of ROC curve as it can suffer
from accuracy paradox.
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A)) True
THIS IS
MANDATORY
OPTION
((OPTION_B)) False
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH D
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) The expected value or _______ of a random variable is the center of its
distribution.
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A)) Mode
THIS IS
MANDATORY
OPTION
((OPTION_B)) median
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) mean
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH C
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH D
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A)) variance
THIS IS
MANDATORY
OPTION
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) The square root of the variance is called the ________ deviation
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A)) empirical
THIS IS
MANDATORY
OPTION
((OPTION_B)) mean
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) continuous
This is optional
((OPTION_D)) standard
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH D
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) For continuous random variables, the CDF is the derivative of the PDF.
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A)) True
THIS IS
MANDATORY
OPTION
((OPTION_B)) False
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_A)) True
THIS IS
MANDATORY
OPTION
((OPTION_B)) False
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) Consider the results of a medical experiment that aims to predict whether someone is
going to develop myopia based on some physical measurements and heredity. In this
ENTER case, the input dataset consists of the person’s medical characteristics and the target
variable is binary: 1 for those who are likely to develop myopia and 0 for those who
CONTENT. QTN aren’t. This can be best classified as
CAN HAVE
IMAGES ALSO
((OPTION_A)) Regression
THIS IS
MANDATORY
OPTION
((OPTION_C)) Clustering
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
The purpose of a machine learning model is to approximate an unknown function
((QUESTION))
that
ENTER associates input elements to output ones
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A)) True
THIS IS
MANDATORY
OPTION
((OPTION_B)) False
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
Training set is normally a representation of a global distribution
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A)) True
THIS IS
MANDATORY
OPTION
((OPTION_B)) False
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 2
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
The model has an excessive capacity and it's not more able to
((QUESTION))
generalize considering the original dynamics provided by the training set. This
ENTER problem is called as
CONTENT. QTN
CAN HAVE
IMAGES ALSO
Underfitting
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B)) Overfitting
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) Both
This is optional
((OPTION_D)) None
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
It can associate almost perfectly all the known samples to the corresponding
((QUESTION))
output
ENTER values, but when an unknown input is presented, the corresponding prediction
CONTENT. QTN error can be very high, This problem is called as
CAN HAVE
IMAGES ALSO
Underfitting
((OPTION_A))
THIS IS
MANDATORY
OPTION
Overfitting
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) Both
This is optional
((OPTION_D)) None
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
---------- may prove to be more difficult to discover as it could be initially
((QUESTION))
considered the result of a perfect fitting
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
Underfitting
((OPTION_A))
THIS IS
MANDATORY
OPTION
Overfitting
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) Both
This is optional
((OPTION_D)) None
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
when working with a supervised scenario, we define a non-negative error
((QUESTION))
measure em which takes two arguments and allows us to compute a total error
ENTER value over the whole dataset. Those two arguments are.
CONTENT. QTN
CAN HAVE
IMAGES ALSO
expected and predicted output
((OPTION_A))
THIS IS
MANDATORY
OPTION
calculated and predicted output
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
calculated and measured output
((OPTION_C))
This is optional
none
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
A
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
Initial value represents a starting point over the surface of a n-variables function.
((QUESTION))
A
ENTER generic training algorithm has to find the global minimum or a point quite close
CONTENT. QTN to it
CAN HAVE (there's always a tolerance to avoid an excessive number of iterations and a
IMAGES ALSO consequent risk
of overfitting). This measure is also called
loss function
((OPTION_A))
THIS IS
MANDATORY
OPTION
predicted output
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
measured output
((OPTION_C))
This is optional
mean square error
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH C
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) In particular, a concept is a subset of input patterns X which determine the same
output element
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A)) True
THIS IS
MANDATORY
OPTION
((OPTION_B)) False
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_A)) True
THIS IS
MANDATORY
OPTION
((OPTION_B)) False
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) An exponential time could lead to computational explosions when the datasets
are too large
ENTER or the optimization starting point is very far from an acceptable minimum.
CONTENT. QTN Moreover, it's
CAN HAVE important to remember the so-called …….
IMAGES ALSO
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE First term is called as
IMAGES ALSO
((OPTION_A)) posteriori
THIS IS
MANDATORY
OPTION
((OPTION_B)) Apriori
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) likelihood.
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
second term is called as
CAN HAVE
IMAGES ALSO
((OPTION_A)) posteriori
THIS IS
MANDATORY
OPTION
((OPTION_B)) Apriori
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) likelihood.
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
Third term is called as
CAN HAVE
IMAGES ALSO
((OPTION_A)) posteriori
THIS IS
MANDATORY
OPTION
((OPTION_B)) Apriori
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) likelihood.
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH C
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
THIS IS
MANDATORY
OPTION
THIS IS ALSO
MANDATORY
OPTION
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) Suppose you have the following data with one real-value input
variable & one real-value output variable. What is leave-one out cross
ENTER validation mean square error in case of linear regression (Y = bX+c)?
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A)) 10/27
THIS IS
MANDATORY
OPTION
((OPTION_B)) 20/27
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) 50/27
This is optional
((OPTION_D)) 49/27
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH D
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) Which of the following is/ are true about “Maximum Likelihood
estimate (MLE)”?
ENTER
CONTENT. QTN 1. MLE may not always exist
CAN HAVE 2. MLE always exists
IMAGES ALSO 3. If MLE exist, it (they) may not be unique
4. If MLE exist, it (they) must be unique
((OPTION_A)) 1and4
THIS IS
MANDATORY
OPTION
((OPTION_B)) 2 and3
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) 1 and3
This is optional
((OPTION_D)) 2 and4
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH C
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) Let’s say, a “Linear regression” model perfectly fits the training data
(train error is zero). Now, Which of the following statement is true?
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
THIS IS ALSO
MANDATORY
OPTION
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH C
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
THIS IS
MANDATORY
OPTION
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
THIS IS
MANDATORY
OPTION
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
THIS IS
MANDATORY
OPTION
((OPTION_B)) . The p-value for the null hypothesis Beta coefficient =0 is 0.0001
THIS IS ALSO
MANDATORY
OPTION
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
Which of the following assumptions do we make while deriving linear regression param
((QUESTION))
1. The true relationship between dependent y and predictor x is linear
ENTER 2. The model errors are statistically independent
CONTENT. QTN 3. The errors are normally distributed with a 0 mean and constant standard deviation.
CAN HAVE
IMAGES ALSO
((OPTION_A)) 1,2&3
THIS IS
MANDATORY
OPTION
((OPTION_B)) 1&3
THIS IS ALSO
MANDATORY
OPTION
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH C
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_B)) Barchart
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) Histograms
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_A)) 1&2
THIS IS
MANDATORY
OPTION
((OPTION_B)) Only 1
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) Only 2
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
THIS IS
MANDATORY
OPTION
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH C
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
Which of the following offsets, do we use in case of least square line fit? Suppose horizontal axis is
((QUESTION)) independent variable and vertical axis is dependent variable
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
THIS IS
MANDATORY
OPTION
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
Suppose we have generated the data with help of polynomial regression of degree 3 (degree 3 will
((QUESTION)) perfectly fit this data). Now consider below points and choose the option based on these points.
ENTER 1. Simple Linear regression will have high bias and low variance
CONTENT. QTN 2. Simple Linear regression will have low bias and high variance
3. polynomial of degree 3 will have low bias and high variance
CAN HAVE
IMAGES ALSO Polynomial of degree 3 will have low bias and Low variance
((OPTION_A)) . Only 1
THIS IS
MANDATORY
OPTION
((OPTION_B)) 1&3
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) 1&4
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH C
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) . Suppose you are training a linear regression model. Now consider
these points.
ENTER
CONTENT. QTN 1. Overfitting is more likely if we have less data
CAN HAVE 2. Overfitting is more likely when the hypothesis space is small
IMAGES ALSO
Which of the above statement(s) are correct?
((OPTION_A)) Both are False
THIS IS
MANDATORY
OPTION
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH c
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
Suppose we fit “Lasso Regression” to a data set, which has 100 features (X1,X2…X100). Now, we rescale
((QUESTION)) one of these feature by multiplying with 10 (say that feature is X1), and then refit Lasso regression with
the same regularization parameter.
ENTER
CONTENT. QTN Now, which of the following option will be correct?
CAN HAVE
IMAGES ALSO
THIS IS
MANDATORY
OPTION
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_A)) . 1 and 2
THIS IS
MANDATORY
OPTION
((OPTION_B)) 1 and 3
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) 2 and 4
This is optional
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) . Which of the following metrics can be used for evaluating regression
models?
ENTER 1. R Squared
CONTENT. QTN 2. Adjusted R Squared
CAN HAVE 3. F Statistics
IMAGES ALSO 1. RMSE / MSE / MAE
((OPTION_A)) 2 and 4
THIS IS
MANDATORY
OPTION
((OPTION_B)) 1 and 2.
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) . 2, 3 and 4.
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH D
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) We can also compute the coefficient of linear regression with the help
of an analytical method called “Normal Equation”. Which of the
ENTER following is/are true about “Normal Equation”?
CONTENT. QTN 1. We don’t have to choose the learning rate
CAN HAVE 2. It becomes slow when number of features is very large
IMAGES ALSO 3. No need to iterate
((OPTION_A)) 1 and 2
THIS IS
MANDATORY
OPTION
((OPTION_B)) 1&3
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) 2&3
This is optional
((OPTION_D)) 1,2&3
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH D
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
. The expected value of Y is a linear function of the X(X1,X2….Xn) variables and regression line is
((QUESTION)) defined as:
Y = β0 + β1 X1 + β2 X2……+ βn Xn
ENTER Which of the following statement(s) are true?
1. If Xi changes by an amount ∆Xi, holding other variables constant, then the expected value of Y
CONTENT. QTN changes by a proportional amount βi ∆Xi, for some constant βi (which in general could be a
CAN HAVE positive or negative number).
2. The value of βi is always the same, regardless of values of the other X’s.
IMAGES ALSO 3. The total effect of the X’s on the expected value of Y is the sum of their separate effects.
((OPTION_A)) . 1 and 2
THIS IS
MANDATORY
OPTION
((OPTION_B)) 1 and 3
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) 2 and 3
This is optional
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH D
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_A)) 1
THIS IS
MANDATORY
OPTION
((OPTION_B)) 2
THIS IS ALSO
MANDATORY
OPTION
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 2
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
. Below graphs show two fitted regression lines (A & B) on randomly generated data. Now, I want to find
((QUESTION)) the sum of residuals in both cases A and B.
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO Which of the following statement is true about sum of residuals of A and B
THIS IS
MANDATORY
OPTION
This is optional
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH C
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) If two variables are correlated, is it necessary that they have a linear
relationsh
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A)) YES
THIS IS
MANDATORY
OPTION
((OPTION_B)) NO
THIS IS ALSO
MANDATORY
OPTION
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_A)) TRUE
THIS IS
MANDATORY
OPTION
((OPTION_B)) FALSE
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
Suppose I applied a logistic regression model on data and got training accuracy X and testing accuracy Y.
((QUESTION)) Now I want to add few new features in data. Select option(s) which are correct in such case.
Note: Consider remaining parameters are same.
ENTER 1. Training accuracy always decreases.
2. Training accuracy always increases or remain same.
CONTENT. QTN 3. Testing accuracy always decreases
CAN HAVE Testing accuracy always increases or remain same
IMAGES ALSO
((OPTION_A)) Only 2
THIS IS
MANDATORY
OPTION
((OPTION_B)) Only 1
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) Only3
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
The graph below represents a regression line predicting Y from X. The values on the
((QUESTION)) graph shows the residuals for each predictions value. Use this information to
ENTER compute the SSE.
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A)) 3.02
THIS IS
MANDATORY
OPTION
((OPTION_B)) 0.75
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) 1.01
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_A)) YES
THIS IS
MANDATORY
OPTION
((OPTION_B)) NO
THIS IS ALSO
MANDATORY
OPTION
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH C
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
THIS IS
MANDATORY
OPTION
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_A)) TRUE
THIS IS
MANDATORY
OPTION
((OPTION_B)) FALSE
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) _
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_A)) TRUE
THIS IS
MANDATORY
OPTION
((OPTION_B)) FALSE
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_A)) TRUE
THIS IS
MANDATORY
OPTION
((OPTION_B)) FALSE
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_A)) TRUE
THIS IS
MANDATORY
OPTION
((OPTION_B)) FALSE
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) Which of the following methods do we use to best fit the data in
Logistic Regression?
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
THIS IS
MANDATORY
OPTION
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) One of the very good methods to analyze the performance of Logistic
Regression is AIC, which is similar to R-Squared in Linear
ENTER Regression. Which of the following is true about AIC
CONTENT. QTN
CAN HAVE
IMAGES ALSO
THIS IS
MANDATORY
OPTION
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_A)) TRUE
THIS IS
MANDATORY
OPTION
((OPTION_B)) FALSE
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_A)) ) LASSO
THIS IS
MANDATORY
OPTION
((OPTION_B)) Ridge
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) Both
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) Suppose you have been given a fair coin and you want to find out the
odds of getting heads. Which of the following option is true for such a
ENTER case?
CONTENT. QTN
CAN HAVE
IMAGES ALSO
THIS IS
MANDATORY
OPTION
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH C
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) ) The logit function(given as l(x)) is the log of odds function. What
could be the range of logit function in the domain x=[0,1]?
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A)) (– ∞ , ∞)
THIS IS
MANDATORY
OPTION
((OPTION_B)) (0,1)
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) (0, ∞)
This is optional
((OPTION_D)) (- ∞, 0)
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_A)) Linear Regression errors values has to be normally distributed but in case
of Logistic Regression it is not the case
THIS IS
MANDATORY
OPTION
((OPTION_B)) Linear Regression errors values has to be normally distributed but in case
of Logistic Regression it is not the case
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) Both Linear Regression and Logistic Regression error values have to be
normally distributed
This is optional
((OPTION_D)) Both Linear Regression and Logistic Regression error values have not to
be normally distributed
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
17) Which of the following is true regarding the logistic function for any value “x Note:
((QUESTION)) Logistic(x): is a logistic function of any number “x”
Logit(x): is a logit function of any number “x”
ENTER Logit_inv(x): is a inverse logit function of any number “x””?
CONTENT. QTN
CAN HAVE
IMAGES ALSO
THIS IS
MANDATORY
OPTION
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 2
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) Suppose, You applied a Logistic Regression model on a given data and
got a training accuracy X and testing accuracy Y. Now, you want to
ENTER add a few new features in the same data. Select the option(s) which
CONTENT. QTN is/are correct in such a case.
CAN HAVE
IMAGES ALSO Note: Consider remaining parameters are same.
THIS IS
MANDATORY
OPTION
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A&D
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
THIS IS
MANDATORY
OPTION
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION)) What would do if you want to train logistic regression on same data
that will take less time as well as give the comparatively similar
ENTER accuracy(may not be same)?
CONTENT. QTN
CAN HAVE Suppose you are using a Logistic Regression model on a huge dataset. One
IMAGES ALSO of the problem you may face on such huge data is that Logistic regression
will take very long time to train
((OPTION_A)) Decrease the learning rate and decrease the number of iteration
THIS IS
MANDATORY
OPTION
((OPTION_B)) Decrease the learning rate and increase the number of iteration
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) Increase the learning rate and increase the number of iteration
This is optional
((OPTION_D)) Increase the learning rate and decrease the number of iteration
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH D
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 2
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
Which of the following image is showing the cost function for y =1.
((QUESTION)) Following is the loss function in logistic regression(Y-axis loss function and x axis log probability) for two
class classification problem.
ENTER Note: Y is the target class
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A)) A
THIS IS
MANDATORY
OPTION
((OPTION_B)) B
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) BOTH
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
THIS IS
MANDATORY
OPTION
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH A
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
The odds ratio is
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A)) The ratio of the probability of an event not happening to the probability of the event happening.
THIS IS
MANDATORY
OPTION
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) The ratio of the odds after a unit change in the predictor to the original odds.
This is optional
((OPTION_D)) The ratio of the probability of an event happening to the probability of the event not happening.
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH C
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
Large values of the log-likelihood statistic indicate:
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A)) That there are a greater number of explained vs. unexplained observations.
THIS IS
MANDATORY
OPTION
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C)) That as the predictor variable increases, the likelihood of the outcome occurring decreases.
This is optional
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
Logistic regression assumes a:
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A)) Linear relationship between continuous predictor variables and the outcome variable.
THIS IS
MANDATORY
OPTION
((OPTION_B)) Linear relationship between continuous predictor variables and the logit of the outcome
variable.
THIS IS ALSO
MANDATORY
OPTION
This is optional
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
In binary logistic regression:
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
THIS IS
MANDATORY
OPTION
THIS IS ALSO
MANDATORY
OPTION
This is optional
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH C
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 1
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((OPTION_D)) none
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH C
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH B
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS)) 2
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
((MARKS))
QUESTION IS OF
HOW MANY
MARKS? (1 OR 2
OR 3 UPTO 10)
((QUESTION))
ENTER
CONTENT. QTN
CAN HAVE
IMAGES ALSO
((OPTION_A))
THIS IS
MANDATORY
OPTION
((OPTION_B))
THIS IS ALSO
MANDATORY
OPTION
((OPTION_C))
This is optional
((OPTION_D))
This is optional
((OPTION_E))
This is optional.
If optional keep
empty so that
system will skip
this option
((CORRECT_CH
OICE)) Either A
or B or C or D or
E
((EXPLANATION
)) This is also
optional
This sheet is for 3 Mark questions
S.r No Question Image a b c d Correct Answer
e.g 1 Write down question img.jpg Option a Option b Option c Option d a/b/c/d
1 Which of the following is characteristic of best fast accuracy scalable All above D
machine learning method ?
2 What are the different Algorithm techniques in Supervised Unsupervised Both A & B None of the C
Machine Learning? Learning and Learning and Mentioned
Semi- Transduction
3 ______can be adopted when it's necessary to Supervised Semi- Reinforcement Clusters B
categorize a large amount of data with a few supervised
complete examples or when there's the need to
4 In reinforcement learning, this feedback is usually Overfitting Overlearning Reward None of above C
called as___.
5 In the last decade, many researchers started training Deep learning Machine Reinforcement Unsupervised A
bigger and bigger models, built with several different learning learning learning
layers that's why this approach is called_____.
6 What does learning exactly mean? Robots are A set of data Learning is the It is a set of C
programed so is used to ability to data is used to
that they can discover the change discover the
7 When it is necessary to allow the model to develop a Overfitting Overlearning Classification Regression A
generalization ability and avoid a common problem
called______.
8 Techniques involve the usage of both labeled and Supervised Semi- Unsupervised None of the B
unlabeled data is called___. supervised above
9 there's a growing interest in pattern recognition and Regression Accuracy Modelfree Scalable C
associative memories whose structure and functioning
are similar to what happens in the neocortex. Such an
10 ______ showed better performance than other Machine Deep learning Reinforcement Supervised B
approaches, even without a context-based model learning learning learning
14 Classifications,
Predict time Speech
what is the function of ‘Supervised Learning’? -- series, recognition, Both A & B None of above C
Annotate Regression
strings
15 Object Similarity Automatic
Commons unsupervised applications include -- All above D
segmentation detection labeling
16
the it's impossible
Reinforcement learning is particularly efficient environment is it's often very to have a
-- All above D
when______________. not completely dynamic precise error
deterministic measure