Diploma in
Data Analytics
Lesson 1: Introduction to R
Introduction to R
Working in R
Maximum likelihood
Lesson Objectives
Lesson 1
Introduction
to R
ONE
What is R
Powerful statistical
programming language and
free software for data analysis
Why R
• Open source
• Ranks 8th in TIOBE index (and
increasing)
• Windows, Mac, Linux OS
• Libraries implements wide
variety of statistical and
graphical techniques
[Link]
Download R
R Studio
IDE or Front face for R
Why RStudio
• Open source
• Windows, Mac, Linux OS
• User-friendly
[Link]
Download
products/rstudio/
RStudio
download/
Working in R
TWO
R packages
• Over 15 000 packages
• Most used and popular
packages: tidyr, plotly,
ggplot2, etc.
Source: [Link]
ISLR library
Package for collection of
dataset used in book ISLR
Maximum
likelihood
THREE
Estimation
• Point
• Interval
Methods of
estimation
Method of least squares
Maximum likelihood
Maximum
likelihood
Method used to estimate the
parameters of a distribution
Maximum
likelihood
estimation
Which model fits the data the
best?
Source: [Link]
explained-maximum-likelihood-estimation-c7b4342fdbb1
Maximum
likelihood
estimation
Normal distribution
Blue line is best fit for data
points
Source: [Link]
estimation-c7b4342fdbb1