0% found this document useful (0 votes)
79 views2 pages

Data Science With R Workflow: Important Resources

The document discusses the data science workflow using R. It recommends taking the R For Business Analysis course through Business Science University to learn R and the workflow for business analysis. The course covers importing and tidying data, data visualization, modeling, and communicating results using RMarkdown and Shiny. Important resources for learning the workflow are also listed.

Uploaded by

kumar kumar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
79 views2 pages

Data Science With R Workflow: Important Resources

The document discusses the data science workflow using R. It recommends taking the R For Business Analysis course through Business Science University to learn R and the workflow for business analysis. The course covers importing and tidying data, data visualization, modeling, and communicating results using RMarkdown and Shiny. Important resources for learning the workflow are also listed.

Uploaded by

kumar kumar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Data 

Science with R
Workflow
The Data Science With R Workflow is available in the book: R
For Data Science. If you want to learn R and this
workflow for business analysis, take the R For Business
Analysis (DS4B 101-R) course through Business Science
University. Click the links for
Documentation  

ggplot2 (CS) 

dplyr (CS)  Visualize
stringr (CS) 
lubridate (CS) 
forcats 
Base R (CS) 
Transform purrr (CS)
Import Tidy (iteration)  Communicate

readr (CS) 
readxl / writexl  tibble (CS)  RMarkdown (CS) 
Model
odbc / DBI  tidyr (CS)  Shiny (CS) 
rvest 
 
recipes  broom 
rsample  yardstick 
RStudio IDE (CS) fs (file system)   parsnip  dials 

CS = Cheat Sheet

Important Resources
R For Data Science Book: [Link]
Rmarkdown Book: [Link]
Data Visualization Book: [Link]
More Cheatsheets: [Link]
tidyverse packages: [Link]
Connecting to databases: [Link]
RMarkdown website: [Link]
Shiny web applications website: [Link]
Jenny Bryan's purrr tutorial: [Link]
Business Science University
"Data Science Education for the Enterprise" [Link]­[Link]

version: 1.1
Data Science with   Text Analysis & NLP Machine Learning
Special Topics Multi-Threaded/Scalable/Production ML: 
Text Mining with R (Book): tidytext 
NLP:  H2O (CS)
H2O word2vec: Word embeddings Extreme Gradient Boosting: xgboost
text2vec: fast vectorization, topic modeling R + Spark: sparklyr (CS)
udpipe: UDPipe C++ lib in R Sparkling Water (Spark + H2O): rsparkling
Time Series Analysis ML (Tidy): parsnip
ML: caret (CS)
Time-aware tibbles: tibbletime & tsibble
Convert between classes: timetk & tsbox Network Analysis
Time Series Index Summary: timetk Deep Learning
Generating Future Series: timetk Network Data Transformations (Tidy): tidygraph
Network Data Transformations: igraph R Interface to TensorFlow Homepage:
Keras (CS)
Forecasting Network Viz TF Estimators
TensorFlow (Core)
ARIMA, ETS, etc: forecast & fable
Static: 
Tidy, glance, augment for forecast models: sweep
ggraph - Graph plotting utilities for ggplot2
Converting forecast prediction to tibble: sweep
Interactive (JavaScript):
networkD3 - D3 Networks in R
plotly - [Link] (network graphs) in R Speed & Scale 
Anomaly Detection
Fastest Single-Node Speed: [Link] (CS)
Identify anomalies: anomalize Distributed Cluster (Spark): sparklyr (CS)

Geospatial Analysis
Geocoding (getting lat/long, bboxes, & sf's): 
Interoperability
ggmap - Google API (requires key)
Python: reticulate
Financial Analysis osmdata - OpenStreet Overpass API
tmaptools - OpenStreet Nominatum API
C++: Rcpp
Java: rJava

Simple Features (sf objects): sf (CS) (tidy)


Getting financial data: tidyquant & quantmod Spatial Objects (sp objects): sp (non-tidy)
Quantitative Analysis: tidyquant & xts/TTR
Portfolio Analysis: tidyquant & Miscellaneous Tools
PerformanceAnalytics
Geospatial Viz Interactive Plotting: htmlwidgets for R 
Building R Packages: R packages Book
Financial & Time Viz Static:  Pkg Development Tools: devtools (CS)
ggmap - Google API (requires key) R Templates: usethis 
Static: osmplotr - Impressive Maps via OSM Build Web Doc's: pkgdown
tidyquant - Financial ggplot2 geoms tmap - Thematic Maps Advanced Concepts (Advanced R Book)
Interactive: cartography (CS) - Thematic Maps rlang & Tidy Evaluation (CS)
highcharter - [Link] in R Interactive (JavaScript): Making Blogs & Books:
dygraphs - xts plotting leaflet (CS) - leafl[Link] in R Make a Website/Blog: blogdown
plotly - [Link] (financial) in R plotly - [Link] (maps) in R Write a Web Book: bookdown
Posting Code (GitHub, Stack Overflow): reprex

Business Science University
"Data Science Education for the Enterprise" [Link]­[Link]

You might also like