0% found this document useful (0 votes)
39 views2 pages

Data Science With R Workflow

Uploaded by

irsad
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
39 views2 pages

Data Science With R Workflow

Uploaded by

irsad
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Data Science with R

Workflow
The Data Science With R Workflow is available in the book: R
For Data Science. If you want to learn R and this
workflow for business analysis, take the R For Business
Analysis (DS4B 101-R) course through Business Science
University. Click the links for
Documentation

ggplot2 (CS)

dplyr (CS) Visualize


stringr (CS)
lubridate (CS)
forcats
Base R (CS)
Transform purrr (CS)
Import Tidy (iteration) Communicate

readr (CS)
readxl / writexl tibble (CS) RMarkdown (CS)
Model
odbc / DBI tidyr (CS) Shiny (CS)
rvest

recipes broom
rsample yardstick
RStudio IDE (CS) fs (file system) parsnip dials

CS = Cheat Sheet

Important Resources
R For Data Science Book: http://r4ds.had.co.nz/
Rmarkdown Book: https://bookdown.org/yihui/rmarkdown/
Data Visualization Book: https://rkabacoff.github.io/datavis/
More Cheatsheets: https://www.rstudio.com/resources/cheatsheets/
tidyverse packages: https://www.tidyverse.org/
Connecting to databases: https://db.rstudio.com/
RMarkdown website: https://rmarkdown.rstudio.com/
Shiny web applications website: http://shiny.rstudio.com/
Jenny Bryan's purrr tutorial: https://jennybryan.org/
Business Science University
"Data Science Education for the Enterprise" university.business­science.io

version: 1.0
Data Science with Text Analysis & NLP Machine Learning
Special Topics Multi-Threaded/Scalable/Production ML:
Text Mining with R (Book): tidytext
NLP: H2O (CS)
H2O word2vec: Word embeddings Extreme Gradient Boosting: xgboost
text2vec: fast vectorization, topic modeling R + Spark: sparklyr (CS)
udpipe: UDPipe C++ lib in R Sparkling Water (Spark + H2O): rsparkling
Time Series Analysis ML (Tidy): parsnip
ML: caret (CS)
Time-aware tibbles: tibbletime & tsibble
Convert between classes: timetk & tsbox Network Analysis
Time Series Index Summary: timetk Deep Learning
Generating Future Series: timetk Network Data Transformations (Tidy): tidygraph
Network Data Transformations: igraph R Interface to TensorFlow Homepage:
Keras (CS)
Forecasting Network Viz TF Estimators
TensorFlow (Core)
ARIMA, ETS, etc: forecast & fable
Static:
Tidy, glance, augment for forecast models: sweep
ggraph - Graph plotting utilities for ggplot2
Converting forecast prediction to tibble: sweep
Interactive (JavaScript):
networkD3 - D3 Networks in R
plotly - plotly.js (network graphs) in R Speed & Scale
Anomaly Detection
Fastest Single-Node Speed: data.table (CS)
Identify anomalies: anomalize Distributed Cluster (Spark): sparklyr (CS)

Geospatial Analysis
Geocoding (getting lat/long, bboxes, & sf's):
Interoperability
ggmap - Google API (requires key)
Python: reticulate
Financial Analysis osmdata - OpenStreet Overpass API
tmaptools - OpenStreet Nominatum API
C++: Rcpp
Java: rJava

Simple Features (sf objects): sf (CS) (tidy)


Getting financial data: tidyquant & quantmod Spatial Objects (sp objects): sp (non-tidy)
Quantitative Analysis: tidyquant & xts/TTR
Portfolio Analysis: tidyquant & Miscellaneous Tools
PerformanceAnalytics
Geospatial Viz Interactive Plotting: htmlwidgets for R
Building R Packages: R packages Book
Financial & Time Viz Static: Pkg Development Tools: devtools (CS)
ggmap - Google API (requires key) R Templates: usethis
Static: osmplotr - Impressive Maps via OSM Build Web Doc's: pkgdown
tidyquant - Financial ggplot2 geoms tmap - Thematic Maps Advanced Concepts (Advanced R Book)
Interactive: cartography (CS) - Thematic Maps Tidy Evaluation (Meta Programming)
highcharter - highchart.js in R Interactive (JavaScript): Making Blogs & Books:
dygraphs - xts plotting leaflet (CS) - leaflet.js in R Make a Website/Blog: blogdown
plotly - plotly.js (financial) in R plotly - plotly.js (maps) in R Write a Web Book: bookdown
Posting Code (GitHub, Stack Overflow): reprex

Business Science University


"Data Science Education for the Enterprise" university.business­science.io

You might also like