0% found this document useful (0 votes)

28 views24 pages

Dataanalyticsunit 2

notes.

Uploaded by

bhikharilal0711

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views24 pages

Dataanalyticsunit 2

notes.

Uploaded by

bhikharilal0711

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

MAHARANA PRATAP GROUP OF INSTITUTIONS

KOTHI MANDHANA, KANPUR

(Approved by AICTE, New Delhi and Affiliated to Dr.AKTU, Lucknow )

Digital Notes
[Department of Computer Science Engineering]
Subject Name : Introduction to Data Analytics
Subject Code : BCDS-501
Course : B. Tech
Branch : CSE
Semester : V
Prepared by : Mr. Anand Prakash Dwivedi
Unit – 2

Data Analysis:
What is Regression Analysis?

 Regression analysis is used to:

 Predict the value of a dependent variable based on the value of at least
one independent variable
 Explain the impact of changes in an independent variable on the
dependent variable
 Dependent variable: the variable we wish to explain

 Independent variable: the variable used to explain the dependent variable

Simple Linear Regression Model

 Only one independent variable, x
 Relationship between x and y is described by a linear function
 Changes in y are assumed to be caused by changes in x

Population Linear Regression

The population regression model:

Population Random
Population Independent Error
Slope
y intercept Variable term, or
Coefficient
Dependent residual

y  β0  β1x  ε
Variable

Linear component Random Error

component
2
Page
Linear Regression Assumptions
 The underlying relationship between the x variable and the y variable is linear
 The distribution of the errors has constant variability
 Error values are normally distributed
 Error values are independent (over time)

Estimated Regression Model

The sample regression line provides an estimate of

the population regression line

Estimated Estimate of Estimate of the

(or predicted) the regression regression slope
y value intercept

Independent

ŷ i  b0  b1x variable

Interpretation of the Slope and the Intercept

 b0 is the estimated average value of y when the value of x is zero

 b1 is the estimated change in the average value of y as a result of a one-unit
change in x
Finding the Least Squares Equation

 The coefficients b0 and b1 will be found using computer software, such as

Excel’s data analysis add-in or MegaStat
 Other regression measures will also be computed as part of computer-based
3

regression analysis
Page

Simple Linear Regression Example

 A real estate agent wishes to examine the relationship between the selling price
of a home and its size (measured in square feet)
 A random sample of 10 houses is selected
 Dependent variable (y) = house price in $1000
 Independent variable (x) = square feet

APPLICATION OF REGRESSION ANALYSIS IN RESEARCH

i. It helps in the formulation and determination of functional relationship between

two or more variables.
ii. It helps in establishing a cause and effect relationship between two variables in
economics and business research.
iii. It helps in predicting and estimating the value of dependent variable as price,
production, sales etc.
iv. It helps to measure the variability or spread of values of a dependent variable
with respect to the regression line

USE OF REGRESSION IN ORGANIZATIONS

In the field of business regression is widely used by businessmen in;

• Predicting future production
• Investment analysis
• Forecasting on sales etc.
It is also used in sociological study and economic planning to find the projections of
population, birth rates. death rates
So the success of a businessman depends on the correctness of the various estimates
that he is required to make. 4
Page
METHODS OF STUDYING REGRESSION:

FREE HAND CURVE

GRAPHICALLY

LESAST SQUARES
Or
REGRESSION
LESAST SQUARES

DEVIATION METHOD FROM

ALGEBRAICALLY
AIRTHMETIC MEAN

DEVIATION METHOD FORM

ASSUMED MEAN

MULTIVARIATE (LINEAR) REGRESSION

This is a regression model with multiple independent variables
Here, the independent (regressor) variables x1, x2.... xn with only one dependent
(response) variable y
The model therefore assumes the following format;
yi = β0 + β1x1 + β2x2 + ...... βnxn+ ε
Where 1, 2, ... n, are the first index labels of the variable and the second observation.
NB: The exact values of β and ε are, and will always remain unknown

Polynomial Regression
This is a special case of multivariate regression, with only one independent variable
x, but an x-y relationship which is clearly nonlinear (at the same time, there is no
‘physical’ model to rely on).
y = β0 + β1x + β2x2 + β3x3.....+ βnxn + ε
Effectively, this is the same as having a multivariate model with x1 ≡ x, x2 ≡ x2, x3 ≡ x3
NONLINEAR REGRESSION
This is a model with one independent variable (the results can be easily extended to
several) and ‘n’ unknown parameters, which we will call b1,
b2, ... bn:
5

y = f (x, b) + ε
Page
where f (x, b) is a specific (given) function of the independent variable and the ‘n’
parameters

Introduction to Bayesian Modeling

• In the social science researchers point of view, the requirements of traditional
frequentistic statistical analysis are very challenging.
• For example, the assumption of normality of both the phenomena under
investigation and the data is prerequisite for traditional parametric frequentistic
calculations.

• Continuous age, income, temperature, ..

• In situations where
– a latent construct cannot be appropriately represented as a
continuous variable,
– ordinal or discrete indicators do not reflect underlying continuous
variables,
– the latent variables cannot be assumed to be normally distributed,
traditional Gaussian modeling is clearly not appropriate.
• In addition, normal distribution analysis sets minimum requirements for the
number of observations, and the measurement level of variables should be
continuous.

Introduction to Bayesian Modeling

• Frequentistic parametric statistical techniques are designed for
normally distributed (both theoretically and empirically) indicators
that have linear dependencies.
– Univariate normality
– Multivariate normality
– Bivariate linearity
6
Page
• The essence of Bayesian inference is in the rule, known as Bayes' theorem, that
tells us how to update our initial probabilities P(H) if we see evidence E, in order
to find out P(H|E).

• A priori probability
• Conditional probability
• Posteriori probability
7
Page
Bayes’ Theorem
W h y does i t m a t t e r ? I f 1 % o f a popul at i on have cancer, f or a
screening t e s t w i t h 8 0 % sensitivity and 9 5 % specificity;

Test P[ Te s t + v e | C a n c e r ] = 80%
Have Positive
Cance P[ Te s t + v e ]
= 5.75
r P[ Cancer ]
P[ Cancer |Test + v e ] ≈ 14%
... i.e. m o s t positive results
are actually false alarms

M i xi n g u p P[ A | B ] w i t h P[ B | A ] is t h e Pros ecut or ’s Fallacy ; a

small pro b a bility o f evidence given innocence need N O T mea n a
small probabilit y o f innocence given evidence.
8
Page
What is a Bayesian Network?
A Bayesian network (BN) is a graphical model for depicting probabilistic relationships
among a set of variables.
 BN Encodes the conditional independence relationships between the variables in
the graph structure.
 Provides a compact representation of the joint probability distribution over the
variables
 A problem domain is modeled by a list of variables X1, …, Xn
 Knowledge about the problem domain is represented by a joint probability P(X1,
…, Xn)
 Directed links represent causal direct influences
 Each node has a conditional probability table quantifying the effects from the
parents.
 No directed cycles

Bayesian Network constitutes of..

 Directed Acyclic Graph (DAG)
 Set of conditional probability tables for each node in
the graph

C D
9
Page
So BN = (DAG, CPD)

 DAG: directed acyclic graph (BN’s structure)

 Nodes: random variables (typically binary or discrete, but
methods also exist to handle continuous variables)
 Arcs: indicate probabilistic dependencies between
 nodes (lack of link signifies conditional independence)
 CPD: conditional probability distribution (BN’s parameters)
 Conditional probabilities at each node, usually stored as a
table (conditional probability table, or CPT)

So, what is a DAG?

Follow the general graph

directed acyclic graphs use principles such as a node A is a
only unidirectional arrows to parent of another node B, if
show the direction of there is an arrow from node A
causation to node B.

B Informally, an arrow from

node X to node Y means X has
a direct influence on Y

C D
Each node in graph represents
a random variable

10
Page
What is Inference in BN?
— Using a Bayesian network to compute probabilities is called inference
— In general, inference involves queries of the form:

P( X | E )
where X is the query variable and E is the evidence variable.

Limitations of Bayesian Networks

• Typically require initial knowledge of many probabilities…quality and extent of

prior knowledge play an important role
• Significant computational cost(NP hard task)
• Unanticipated probability of an event is not taken care of.
•
Representing causality in Bayesian Networks

— A causal Bayesian network, or simply causal networks, is a Bayesian network

whose arcs are interpreted as indicating cause-effect relationships
— Build a causal network:
 Choose a set of variables that describes the domain
 Draw an arc to a variable from each of its direct causes
(Domain knowledge required)

11
Page
Summary
—
 Bayesian methods provide sound theory and framework for implementation
of classifiers
 Bayesian networks a natural way to represent conditional independence
information. Qualitative info in links, quantitative in tables.
 NP-complete or NP-hard to compute exact values; typical to make simplifying
assumptions or approximate methods.
 Many Bayesian tools and systems exist
 Bayesian Networks: an efficient and effective representation of the joint
probability distribution of a set of random variables
 Efficient:
o Local models
o Independence (d-separation)
 Effective:
o Algorithms take advantage of structure to
o Compute posterior probabilities
o Compute most probable instantiation
o Decision making

Introduction to SVM
Support vector machines (SVMs) are powerful yet flexible supervised machine learning
algorithms which are used both for classification and regression. But generally, they
are used in classification problems. In 1960s, SVMs were first introduced but later they
got refined in 1990. SVMs have their unique way of implementation as compared to
other machine learning algorithms. Lately, they are extremely popular because of their
ability to handle multiple continuous and categorical variables.

Working of SVM
An SVM model is basically a representation of different classes in a hyperplane in
multidimensional space. The hyperplane will be generated in an iterative manner by
SVM so that the error can be minimized. The goal of SVM is to divide the datasets into
classes to find a maximum marginal hyperplane (MMH).
12
Page
The followings are important concepts in SVM −
 Support Vectors − Datapoints that are closest to the hyperplane is called
support vectors. Separating line will be defined with the help of these data
points.
 Hyperplane − As we can see in the above diagram, it is a decision plane or space
which is divided between a set of objects having different classes.
 Margin − It may be defined as the gap between two lines on the closet data
points of different classes. It can be calculated as the perpendicular distance
from the line to the support vectors. Large margin is considered as a good
margin and small margin is considered as a bad margin.
The main goal of SVM is to divide the datasets into classes to find a maximum
marginal hyperplane (MMH) and it can be done in the following two steps −
 First, SVM will generate hyperplanes iteratively that segregates the classes in
best way.
 Then, it will choose the hyperplane that separates the classes correctly.

SVM Kernels
 In practice, SVM algorithm is implemented with kernel that transforms an input
data space into the required form. SVM uses a technique called the kernel trick
in which kernel takes a low dimensional input space and transforms it into a
higher dimensional space. In simple words, kernel converts non-separable
problems into separable problems by adding more dimensions to it. It makes
13

SVM more powerful, flexible and accurate. The following are some of the types
of kernels used by SVM.
Page
Linear Kernel

 It can be used as a dot product between any two observations. The formula of
linear kernel is as below −
 K(x,xi)=sum(x∗xi)K(x,xi)=sum(x∗xi)
 From the above formula, we can see that the product between two vectors say
& is the sum of the multiplication of each pair of input values.

Polynomial Kernel

 It is more generalized form of linear kernel and distinguish curved or nonlinear

input space. Following is the formula for polynomial kernel −
 k(X,Xi)=1+sum(X∗Xi)^dk(X,Xi)=1+sum(X∗Xi)^d
 Here d is the degree of polynomial, which we need to specify manually in the
learning algorithm.

Radial Basis Function (RBF) Kernel

 RBF kernel, mostly used in SVM classification, maps input space in indefinite
dimensional space. Following formula explains it mathematically −
 K(x,xi)=exp(−gamma∗sum(x−xi^2))K(x,xi)=exp(−gamma∗sum(x−xi^2))
 Here, gamma ranges from 0 to 1. We need to manually specify it in the learning
algorithm. A good default value of gamma is 0.1.
Time Series Analysis
• Aim:
– To collect and analyze the past observations to develop an appropriate model which
can then be used to generate future values for the series.
• Time Series Forecasting is based on the idea that the history of occurrences over
time can be used to predict the future
14
Page
Application
• Business
• Economics
• Finance
• Science and Engineering

An overview of nonlinear dynamics Fundamental concepts

• System may be defined as an orderly working totality, a set of units combined by
nature, by science, or by art to form a whole.
• System is not just a set of elementsbut includes also interactions between both
the system’s elements and with the ‘external world’.
• Interactions may be staticor dynamic i.e. through an exchange of mass, energy,
electric charge or through exchange of information
• A living organism is an open system, supplied with free energy from biochemical
reactions. There are also effects of information interactions.
• In physics state of a system in a given moment of time is characterized by values
of state variables (at this moment).
• The minimum number of independent state variables that are necessary to
characterize the system's state is called the number of degrees of freedom of the
system. If a system has n degrees of freedom then any state of the system may
be characterized by a point in an n-dimensional space with appropriately defined
coordinates, called the system's phase space
Fundamental concepts and definitions

• Process is defined as a series of gradual changes in a system that succeed one

another. Every process exhibits a characteristic time, τ, that defines the time
scale for this process. In the system's phase space a process is represented by a
series of connected points called trajectory.
• Attractor is a subset of the system's phase space that attracts trajectories (i.e.
the system tends towards the states that belong to some attractor).
• Signal is a detectable physical quantity or impulse (as a voltage, current,
magnetic field strength) by which information can be transmitted from a given
system to other systems, e.g. to a measuring device (EEG, ECG, EMG)
• Noise is any unwanted signal that interferes with the desired signal
15
Page
Nonlinear vs linear

• Linearity in science means more or less the same as proportionality or additivity.

But linearity has its limits. (Nonlinearity-nonadditivity)
• Reductionism, a methodological attitude of explaining properties of a system
through properties of its elements alone, may work only for linear systems.
• Some systems have properties that depend more on the
• way how the elements are connected than on what the specific properties of
individual elements are.
• Far from equilibrium vs equilibrium: Thermodynamic equilibrium means a
complete lack of differences between different parts of the system and, as a
consequence, a complete lack of changes in the system –all processes are
stopped. 'Living' states of any system are nonequilibriumstates.
• Equilibrium, the unique state when all properties are equally distributed, is the
state of 'death'. It is true not just for a single cell or an organism. In the systems
being close to equilibrium one can observe linear processes while in the systems
being far from equilibrium processes are nonlinear. Life appears to be a nonlinear
phenomenon
•
RULE INDUCTION

• Rule induction is one of the most important techniques of machine learning.

Since regularities hidden in data are frequently expressed in terms of rules, rule
induction is one of the fundamental tools of data mining at the same time.
Usually rules are expressions of the form

• if (attribute − 1, value − 1) and (attribute − 2, value − 2) and ···

• and (attribute − n, value − n) then (decision, value).

• Some rule induction systems induce more complex rules, in which values of
attributes may be expressed by negation of some values or by a value subset of
the attribute domain

• Data from which rules are induced are usually presented in a form sim- ilar to a
table in which cases (or examples) are labels (or names) for rows and variables
16

are labeled as attributes and a decision. We will restrict our attention to rule
Page

induction which belongs to supervised learning:

• all cases are preclassied by an expert. In dierent words, the decision value is
assigned by an expert to each case. Attributes are independent variables and the
decision is a dependent variable.

• A very simple example of such a table is presented as Table 1.1, in which
attributes are:

• Temperature, Headache, Weakness, Nausea, and the decision is Flu. The set of all
cases labeled by the same decision value is called a concept. For Table 1.1, case
set f1, 2, 4, 5g is a concept of all cases aected by flu (for each case from this set
the corresponding value of Flu is yes).

Case Temperature Attributes Weakness Nausea Decision

Headache Flu
1 41.6 yes yes no yes
2 39.8 yes no yes yes
3 36.8 no no no no
4 37.0 yes yes yes yes
5 38.8 no yes no yes
6 40.2 no no no no
7 36.6 no yes no no

What are Neural Networks?

• Models of the brain and nervous system

• Highly parallel
o Process information much more like the brain than a serial computer
Learning
17

• Very simple principles

Page

• Very complex behaviours

• Applications
o As powerful problem solvers
o As biological models

A method of computing, based on the interaction of multiple connected processing

elements.
• A powerful technique to solve many real world problems.
• The ability to learn from experience in order to improve their performance.
• Ability to deal with incomplete information

Basics Of Neural Network

• Biological approach to AI
• Developed in 1943
• Comprised of one or more layers of neurons
• Several types, we‟ll focus on feed-forward and feedback networks

Types of Neural Networks

• Investment analysis
• Control systems & monitoring
Page
• Mobile computing
• Marketing and financial applications
• Forecasting – sales, market research, meteorology
Advantages:
• A neural network can perform tasks that a linear program can not.
• When an element of the neural network fails, it can continue without any
problem by their parallel nature.
• A neural network learns and does not need to be reprogrammed.
• It can be implemented in any application.
• It can be implemented without any problem
Disadvantages:
•The neural network needs training to operate.
•The architecture of a neural network is different from the architecture of
microprocessors
therefore needs to be emulated.
•Requires high processing time for large neural networks.
Conclusions
• Neural networks provide ability to provide more human-like AI
• Takes rough approximation and hard-coded reactions out of AI design (i.e. Rules
and FSMs)
• Still require a lot of fine-tuning during development

Principal Component Analysis

• The PCA method is a statistical method for Feature Selection and Dimensionality
Reduction.
• Feature Selection is a process whereby a data space is transformed into a
feature space. In
• principal both spaces have the same dimensionality.
• However, in the PCA method, the transformation is design in such way that the
data set be represented by a reduced number of “effective” features and yet
retain most of the intrinsic information contained in the data; in other words the
data set undergoes a dimensionality reduction.
• Suppose that we have a x of dimension m and we wish to transmit it using l
numbers, where l<m. If we simply truncate the vector x, we will cause a mean
square error equal to the sum of the variances of the elements eliminated from
19

x.
Page
• So, we ask: Does there exist an invertible linear transformation T such that the
truncation of Tx is optimum in the mean-squared sense?
• Clearly, the transformation T should have the property that some of its
components have low variance.
• Principal Component Analysis maximises the rate of decrease of variance and is
the right choice.
• Before we present neural network, Hebbian-based, algorithms that do this we
first present the statistical analysis of the problem.

• Let X be an m-dimensional random vector representing the environment of

interest. We assume that the vector X has zero mean:

• E[X]=0

• Where E is the statistical expectation operator. If X has not zero mean we first
subtract the mean from X before we proceed with the rest of the analysis.

• Let q denote a unit vector, also of dimension m, onto which the vector X is to be
projected. This projection is defined by the inner product of the vectors X and q:

• A=XTq=qTX

• Subject to the constraint:

• • ||q||=(qTq)½=1

• The projection A is a random variable with a mean and variance related to the
statistics of vector X. Assuming that X has zero mean we can calculate the mean
value of the projection A:

• E[A]=qTE[X]=0

• The variance of A is therefore the same as its mean- square value and so we can
write:
• s2=E[A2]=E[(qTX)(XTq)]=qTE[XXT]q=qTR q
• The m-by-m matrix R is the correlation matrix of the random vector X, formally
defined as the expectation of the outer product of the vector X with itself, as
20

shown:
• R=E[XXT]
Page
• We observe that the matrix R is symmetric, which means that:
a single vector:
• a =[a1, a2,…, am]T
• =[xTq1, xTq2,…, xTqm]T
• =QTx
• Where Q is the matrix which is constructed by the (column) eigenvectors of R.
• From the above we see that:
• x=Q a
• This is nothing more than a coordinate
transformation from the input space, of vector x, to the feature space of the
vector a.
• From the perspective of the pattern recognition the usefulness of the PCA
method is that it provides an effective technique for dimensionality reduction.
• In particular we may reduce the number of features needed for effective data
representation by discarding those linear combinations in the previous formula
that have small variances and retain only these terms that have large variances.
• Let l1, l2, …, ll denote the largest l eigenvalues of R. We may then approximate
the vector x by

WHAT IS FUZZY LOGIC?

Definition of fuzzy
Fuzzy – “not clear, distinct, or precise; blurred”
Definition of fuzzy logic
A form of knowledge representation suitable for notions that
cannot be defined precisely, but which depend upon their
contexts.

FUZZY LOGIC IN CONTROL SYSTEMS

Fuzzy Logic provides a more efficient and resourceful way to solve Control
Systems.
Some Examples
Temperature Controller
Anti – Lock Break System ( ABS )
TEMPERATURE CONTROLLER
21

 The problem
Page
 Change the speed of a heater fan, based off the room temperature and
humidity.
 A temperature control system has four settings
 Cold, Cool, Warm, and Hot
 Humidity can be defined by:
 Low, Medium, and High
 Using this we can define the fuzzy set.

BENEFITS OF USING FUZZY LOGIC

22
Page
ANTI LOCK BREAK SYSTEM ( ABS )
Nonlinear and dynamic in nature
Inputs for Intel Fuzzy ABS are derived from
Brake
4 WD
Feedback
Wheel speed
Ignition
Outputs
Pulsewidth
Error lamp

23
Page
Stochastic search
Stochastic search and optimization techniques are used in a vast number of areas,
including aerospace, medicine, transportation, and finance, to name but a
few. Whether the goal is refining the design of a missile or aircraft, determining the
effectiveness of a new drug, developing the most efficient timing strategies for
traffic signals, or making investment decisions in order to increase profits, stochastic
algorithms can help researchers and practitioners devise optimal solutions to
countless real-world problems.

Introduction to Stochastic Search and Optimization: Estimation, Simulation, and

Control is a graduate-level introduction to the principles, algorithms, and practical
aspects of stochastic optimization, including applications drawn from engineering,
statistics, and computer science. The treatment is both rigorous and broadly
accessible, distinguishing this text from much of the current literature and providing
students, researchers, and practitioners with a strong foundation for the often-
daunting task of solving real-world problems.

Most widely used stochastic algorithms, including

Random search Machine (reinforcement) learning

Recursive linear estimation Model selection
Stochastic approximation Simulation-based optimization
Simulated annealing Markov chain Monte Carlo
Genetic and evolutionary algorithms Optimal experimental design

24
Page

Machine Learning Unit 5 Notes
No ratings yet
Machine Learning Unit 5 Notes
45 pages
Unit 2linear Regression Bayesian Learning
No ratings yet
Unit 2linear Regression Bayesian Learning
49 pages
Data Analysis & Regression Guide
No ratings yet
Data Analysis & Regression Guide
136 pages
ML - Unit 2
No ratings yet
ML - Unit 2
155 pages
Unit 2&3 - 250421 - 215911
No ratings yet
Unit 2&3 - 250421 - 215911
60 pages
Unit 2
No ratings yet
Unit 2
133 pages
22cse61 Module 4
No ratings yet
22cse61 Module 4
110 pages
Unit 2 Data Analytics
No ratings yet
Unit 2 Data Analytics
33 pages
Bayesian Linear Regression For Posterior Predictive Distribution MATLAB
No ratings yet
Bayesian Linear Regression For Posterior Predictive Distribution MATLAB
46 pages
Week 5 Notes
No ratings yet
Week 5 Notes
175 pages
Business Analytics
No ratings yet
Business Analytics
19 pages
Machine Leraning Unit 2
No ratings yet
Machine Leraning Unit 2
62 pages
Introduction 1
No ratings yet
Introduction 1
113 pages
Bayesian Linear Regression-II
No ratings yet
Bayesian Linear Regression-II
12 pages
Examples of Quantitative Variables
No ratings yet
Examples of Quantitative Variables
104 pages
ML - Module 3 Chapter 5
No ratings yet
ML - Module 3 Chapter 5
10 pages
Unit 1 - Part 1
No ratings yet
Unit 1 - Part 1
105 pages
Data Analytics Unit 3
No ratings yet
Data Analytics Unit 3
104 pages
Da 2
No ratings yet
Da 2
31 pages
Understanding Simple Regression Models
No ratings yet
Understanding Simple Regression Models
32 pages
Week 6 v1.61 (Hidden) - Revision, CW1, and Probabilistic Graphical Models
No ratings yet
Week 6 v1.61 (Hidden) - Revision, CW1, and Probabilistic Graphical Models
65 pages
DA Unit 2
No ratings yet
DA Unit 2
124 pages
Linear Regression for Marketers
No ratings yet
Linear Regression for Marketers
49 pages
Regression Analysis Techniques
No ratings yet
Regression Analysis Techniques
16 pages
Machine Learning: Regression & Bayes
No ratings yet
Machine Learning: Regression & Bayes
48 pages
Data Analytics Class - Unit-Iii
No ratings yet
Data Analytics Class - Unit-Iii
45 pages
Bayesian Linear Regression - GeeksforGeeks
No ratings yet
Bayesian Linear Regression - GeeksforGeeks
15 pages
Ra Web
No ratings yet
Ra Web
70 pages
Chapter 6: How To Do Forecasting by Regression Analysis
No ratings yet
Chapter 6: How To Do Forecasting by Regression Analysis
7 pages
Bayesian Linear Regression in Data Mining: K.Sathyanarayana Sharma, Dr.S.Rajagopal
No ratings yet
Bayesian Linear Regression in Data Mining: K.Sathyanarayana Sharma, Dr.S.Rajagopal
3 pages
KCA 034 - Unit 2
No ratings yet
KCA 034 - Unit 2
97 pages
Machine Learning
No ratings yet
Machine Learning
92 pages
Unit 2 Notes
No ratings yet
Unit 2 Notes
7 pages
Presentation 1
No ratings yet
Presentation 1
24 pages
BDA Unit5
No ratings yet
BDA Unit5
9 pages
Bayesian Regression Analysis Guide
No ratings yet
Bayesian Regression Analysis Guide
53 pages
Bayesian Inference Fundamentals
No ratings yet
Bayesian Inference Fundamentals
195 pages
Forecasting Models & Regression Analysis
No ratings yet
Forecasting Models & Regression Analysis
13 pages
W8-Supervised Learning Methods
No ratings yet
W8-Supervised Learning Methods
30 pages
Regression
No ratings yet
Regression
45 pages
Var PPTS
No ratings yet
Var PPTS
249 pages
1 - UNIT 2 2 Files Merged
No ratings yet
1 - UNIT 2 2 Files Merged
80 pages
Merge
No ratings yet
Merge
240 pages
Random Variables: Corr (X, Y) Cov (X, Y) / Cov (X, Y) Is The Covariance (X, Y)
No ratings yet
Random Variables: Corr (X, Y) Cov (X, Y) / Cov (X, Y) Is The Covariance (X, Y)
15 pages
Regression Analysis in Data Science
No ratings yet
Regression Analysis in Data Science
267 pages
2.2.1 Transcript
No ratings yet
2.2.1 Transcript
2 pages
Unit II NOTES
No ratings yet
Unit II NOTES
31 pages
DA Notes - Module 3
No ratings yet
DA Notes - Module 3
14 pages
DECS Cheat Sheet
No ratings yet
DECS Cheat Sheet
8 pages
Co 3&4
No ratings yet
Co 3&4
22 pages
An Introduction To Statistical Learning PDF
No ratings yet
An Introduction To Statistical Learning PDF
35 pages
Econometrics Cheat Sheet
No ratings yet
Econometrics Cheat Sheet
4 pages
Linear & Polynomial Regression Guide
No ratings yet
Linear & Polynomial Regression Guide
56 pages
MLDL Lecture 1
No ratings yet
MLDL Lecture 1
28 pages
Linear Regression Chap01
100% (1)
Linear Regression Chap01
7 pages
Lecture 16 Regression
No ratings yet
Lecture 16 Regression
30 pages
Module 2 Topic 1 Bayesian Modeling, Inference and Bayesian Networks
No ratings yet
Module 2 Topic 1 Bayesian Modeling, Inference and Bayesian Networks
10 pages
Statistics for Data Science Overview
No ratings yet
Statistics for Data Science Overview
24 pages
Detuned Reactor Specifications: 5.67%, 7%, 14% Impedance
No ratings yet
Detuned Reactor Specifications: 5.67%, 7%, 14% Impedance
6 pages
Core Ict SHS 1
No ratings yet
Core Ict SHS 1
5 pages
DTR Series
No ratings yet
DTR Series
2 pages
Serial No. 211716
100% (3)
Serial No. 211716
339 pages
Railway System Database Project Report
No ratings yet
Railway System Database Project Report
16 pages
GitHub Education
No ratings yet
GitHub Education
1 page
Notification Templates
No ratings yet
Notification Templates
11 pages
BCT 604 Task 1 Sharifah Nur Atiqah (2020963365) Ap256 5a
No ratings yet
BCT 604 Task 1 Sharifah Nur Atiqah (2020963365) Ap256 5a
11 pages
Final Project Digital Literacy Group C
No ratings yet
Final Project Digital Literacy Group C
6 pages
Thor's Study Guide - CISSP Domain 5
No ratings yet
Thor's Study Guide - CISSP Domain 5
23 pages
CrazyTalk Animator 2 Pipeline Manual
No ratings yet
CrazyTalk Animator 2 Pipeline Manual
796 pages
Easy UPS 3S High Capacity Battery
No ratings yet
Easy UPS 3S High Capacity Battery
4 pages
Alto L-12, 16, 20 (Ver.2) Mixer Service Manual
83% (6)
Alto L-12, 16, 20 (Ver.2) Mixer Service Manual
91 pages
Ross Motors &amp Steering Index2
100% (4)
Ross Motors &amp Steering Index2
228 pages
Cryptool - Launchpad Proposal
No ratings yet
Cryptool - Launchpad Proposal
15 pages
ATA 33 Lights L1
100% (1)
ATA 33 Lights L1
42 pages
Laptop Invoice
No ratings yet
Laptop Invoice
1 page
2024 Bece Computing Sample Questions From Waec
75% (4)
2024 Bece Computing Sample Questions From Waec
5 pages
Simcom Sim5215 Sim5216 Atc en v1.21
No ratings yet
Simcom Sim5215 Sim5216 Atc en v1.21
527 pages
Solar Fence
No ratings yet
Solar Fence
31 pages
LCD TV LG 26LC2R Service Manual
50% (2)
LCD TV LG 26LC2R Service Manual
45 pages
2023 Grade 06 ICT 1st Term Test Paper Jaffna Hindu College
No ratings yet
2023 Grade 06 ICT 1st Term Test Paper Jaffna Hindu College
7 pages
Temporary Kiosk Design Criteria
No ratings yet
Temporary Kiosk Design Criteria
2 pages
Java Case Study VehcileIns Code
No ratings yet
Java Case Study VehcileIns Code
7 pages
2.3 Cloud Certification Review - IT 160-02P (B1) Cloud Computing Essentials
No ratings yet
2.3 Cloud Certification Review - IT 160-02P (B1) Cloud Computing Essentials
6 pages
HARMAN - Supplier Quality Manual - F1555499 - 6b
No ratings yet
HARMAN - Supplier Quality Manual - F1555499 - 6b
25 pages
Overview of Forging Processes
No ratings yet
Overview of Forging Processes
31 pages
UploadSchulabschlusszeugnis - Higher Secondary Certificate
No ratings yet
UploadSchulabschlusszeugnis - Higher Secondary Certificate
3 pages
BCS 51 Sof - Eng Imp Ques
No ratings yet
BCS 51 Sof - Eng Imp Ques
4 pages
33881019442221441207
No ratings yet
33881019442221441207
11 pages

Dataanalyticsunit 2

Uploaded by

Dataanalyticsunit 2

Uploaded by

MAHARANA PRATAP GROUP OF INSTITUTIONS

KOTHI MANDHANA, KANPUR

 Regression analysis is used to:

 Independent variable: the variable used to explain the dependent variable

Simple Linear Regression Model

Population Linear Regression

The population regression model:

Linear component Random Error

Estimated Regression Model

The sample regression line provides an estimate of

Estimated Estimate of Estimate of the

Interpretation of the Slope and the Intercept

 b0 is the estimated average value of y when the value of x is zero

 The coefficients b0 and b1 will be found using computer software, such as

APPLICATION OF REGRESSION ANALYSIS IN RESEARCH

i. It helps in the formulation and determination of functional relationship between

USE OF REGRESSION IN ORGANIZATIONS

In the field of business regression is widely used by businessmen in;

FREE HAND CURVE

DEVIATION METHOD FROM

DEVIATION METHOD FORM

MULTIVARIATE (LINEAR) REGRESSION

Introduction to Bayesian Modeling

• Continuous age, income, temperature, ..

Introduction to Bayesian Modeling

M i xi n g u p P[ A | B ] w i t h P[ B | A ] is t h e Pros ecut or ’s Fallacy ; a

Bayesian Network constitutes of..

 DAG: directed acyclic graph (BN’s structure)

So, what is a DAG?

Follow the general graph

B Informally, an arrow from

Limitations of Bayesian Networks

• Typically require initial knowledge of many probabilities…quality and extent of

— A causal Bayesian network, or simply causal networks, is a Bayesian network

 It is more generalized form of linear kernel and distinguish curved or nonlinear

Radial Basis Function (RBF) Kernel

An overview of nonlinear dynamics Fundamental concepts

• Process is defined as a series of gradual changes in a system that succeed one

• Linearity in science means more or less the same as proportionality or additivity.

• Rule induction is one of the most important techniques of machine learning.

• if (attribute − 1, value − 1) and (attribute − 2, value − 2) and ···

• and (attribute − n, value − n) then (decision, value).

induction which belongs to supervised learning:

Case Temperature Attributes Weakness Nausea Decision

What are Neural Networks?

• Models of the brain and nervous system

• Very simple principles

• Very complex behaviours

A method of computing, based on the interaction of multiple connected processing

Basics Of Neural Network

Types of Neural Networks

Neural Network types can be classified based on following

Principal Component Analysis

• Let X be an m-dimensional random vector representing the environment of

• Subject to the constraint:

WHAT IS FUZZY LOGIC?

FUZZY LOGIC IN CONTROL SYSTEMS

BENEFITS OF USING FUZZY LOGIC

Introduction to Stochastic Search and Optimization: Estimation, Simulation, and

Most widely used stochastic algorithms, including

Random search Machine (reinforcement) learning

You might also like