0 ratings0% found this document useful (0 votes) 133 views1 pageAn Introduction To Key Data Science Concepts.
Definitions of 10 fundamental terms for data science and machine learning according to Dataiku
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content,
claim it here.
Available Formats
Download as PDF or read online on Scribd
OE}
iku
AN INTRODUCTION TO KEY DATA SCIENCE CONCEPTS
E
“x
Zz
a
©
O
ah apo =€
“UF
PT Coaze
model (n)
[madal] / noun
1, amathematical representation of areal world
process; a predictive model forecasts a future
‘outcome based on past behaviors.
training (Vv)
[treintn] / verb
1 the process of reatingamodel rom thetraning
data. The data i fed into the training algorithm,
which learns a representation for the problem,
and produces a model. Also called “learning”.
classification (n)
[klzesafokerfon] / noun
1. prediction method that assigns each data
point to a predefined category, e., a type of
operating system,
training set (n)
[treinin set] / noun
1. a dataset used to find potentially predictive
relationships that will be used to create a model.
feature (n)
[fitfar] / noun
1. also known as an independent variable or
a predictor variable, a feature is an observable
quantity, recorded and used by a prediction
model. You can also engineer features by
combining them or adding new Information to
them,
[p<
©® G&D C00 GOGO
10 fundamental terms for data science and machine learning.
algorithm (n)
[zelgart3am] / noun
-Laset of rules used to make a calculation or solve
a problem.
regression (n)
[regrefan] / noun
1. a prediction method whose output is areal
number, thats, a value that represents a quantity
alonga line. Example: predicting the temperature
of an engine or the revenue of a company.
target (n)
[target] / noun
1Linstaistics,tiscalled the dependent variable;
itis the output of the model or the variable you
wish to predict.
test set (n)
[test set] / noun
1. a dataset, separate from the training set but
with the same structure, used to measure and
benchmark the performance of various models
overfitting (v)
{ovvarfitin] / verb
1. a situation in which a model that is too
complex for the data has been trained to predict
the target. This leads to an overly specialized
model, which makes predictions that do not
reflect the reality of the underlying relationship
between the features and target.