Data Science Using R Programming
Topic No Topic Name T1 T2 T3 T4 T5
Nina Peter Hadley Roger D Rafael A
Zumel, Bruce and Wickham Peng, R Irizarry,
Practical Andrew and Garrett Programming Introduction
Data Bruce, Grolemund for Data to Data
Science Practical , R for science, Lean Science,
with R, Statistics Data Publishing,20 LeanPublishi
Manning for Data Science, 16. ng,2016.
Publication Scientists, O’Reilly,2
s,2014. O’Reilly,2 017.
017.
Introduction to data 3
UNIT – I
Data Science science,
Linear Algebra for
data science,
Linear equations,
Distance,
Hyper planes,
Half spaces,
Eigen values,
Eigenvectors.
UNIT II Statistical Modelling,
Random variables, 244
Probability
mass/density
functions,
sample statistics, 63
hypothesis testing. 93
UNIT III Linear Regression, 141 513
Predictive
Modelling:
Simple Linear Regression
model building,
Multiple Linear 324
Regression,
Logistic regression 157 207 349
UNIT IV Installation of R 8
Introduction software and using
to R the interface,
Programming,
getting
started with
R:
Variables and data 345 42
types,
R Objects, 401 25
Vectors and lists, 292 35,37,42 47,52
Operations: 192 53
Arithmetic, Logical
and Matrix
operations,
Data frames, 119 56,75,20 64
functions,
Control structures, 48 62
Debugging and
Simulation in R.
UNIT V performance measures,
Classification:
Logistic regression 516
implementation in R,
K-Nearest neighbours 238 556
(KNN),
K-Nearest neighbours
implementation in R,
Clustering: K-Means 294 627
Algorithm,
K-Means implementation
inR.