0% found this document useful (0 votes)
116 views3 pages

Data Mining and Analysis Q&A

This document contains a list of 22 questions without associated answers. The questions cover various topics related to data mining processes like data preparation, different modeling types, statistical techniques for analyzing labeled and unlabeled data, assumptions of regression analysis, and common data mining activities.

Uploaded by

tushar wadile
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
116 views3 pages

Data Mining and Analysis Q&A

This document contains a list of 22 questions without associated answers. The questions cover various topics related to data mining processes like data preparation, different modeling types, statistical techniques for analyzing labeled and unlabeled data, assumptions of regression analysis, and common data mining activities.

Uploaded by

tushar wadile
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd

S.

No
1
2
3
4
5
6
7
8

9
10
11

12
13
14
15
16
17
18
19
20
21
22
Questions
Which of the following is not applicable to Data Mining?
The process of extracting valid, useful, unknown info from data and using it to make proactive knowledge driv
What is the other name for Data Preparation stage of Knowledge Discovery Process?
Which of the following role is responsible for performing validation on analysis datasets?
Which of the following activities is performed as part of data pre processing?
Which of the following modelling type should be used for Labelled data?
Noisy values are the values that are valid for the dataset, but are incorrectly recorded
Which statistical technique deals with finding a structure in a collection of unlabeled data?
Probability of theft in an area is 0.03 with expected loss of 20% or 30% of things with probabilities 0.55 and 0.
$150 pa with 100% repayment. Policy with B, costs $100 pa and first $500 of any loss has to be paid by the ow
can be used to choose the policy?
What is the type of learning where a function is inferred to describe hidden structure from unlabeled data
Statistical technique used for investigating and modelling the relationship between two or more variables is:

If time is used as an independent variable in a simple linear regression analysis, which of the following assump
Machine learning task of inferring a function from labelled training data is known as
Which is the statistical technique used for investigating and modelling the relationship between two or more v
Regression is typically carried out to develop a mathematical model of the process
Associate rule is known as _____________
Which data mining method groups together objects that are similar to each other and dissimilar to the other o
Which of the following activities are performed as part of data pre processing?
Which of the following are Multi-class Classification problem?
_________ are the values that mark the boundaries of the confidence interval.
The process of extracting valid, useful, unknown info from data to make proactive knowledge driven business
Simulations are carried out to develop a mathematical model of the process
Answers
Involves working with known information
Data mining
ETL
Statisticians
Detect Missing Values
Predictive Modelling
1
Clustering

Decision Tree
Unsupervised Learning
Regression analysis
Successive observations of the dependent variable are
uncorrelated
Supervised Learning
Regression analysis
1
Affinity analysis
Clustering
All the options
Is this movie a comedy, a documentary, or a thriller?
Confidence limits
Data mining
0

You might also like