1 Preface


1.1 Summary

Working with BIG DATA requires a particular suite of data analytics tools and advanced techniques, such as machine learning (ML). Many of these tools are readily and freely available in R. This full-day session will provide participants with a hands-on training on how to use data analytics tools and machine learning methods available in R to explore, visualize, and model big data.

The first half of our training session will focus on organizing (manipulating and summarizing) and visualizing (both statically and dynamically) big data in R. The second half will involve a series of short lectures on ML techniques (decision trees, random forests, and support vector machines), as well as hands-on demonstrations applying these methods in R. Examples will be drawn from the OECD’s Programme for International Student Assessment (PISA). Participants will get opportunities to work through several hands-on lab sessions throughout the day.

1.2 Who we are

  1. Okan Bulut – University of Alberta

  2. Christopher D. Desjardins – University of Minnesota


We also co-authored: