Skip to content

Latest commit

 

History

History
49 lines (34 loc) · 2.03 KB

File metadata and controls

49 lines (34 loc) · 2.03 KB

COMPUTER SCIENCE DEPARTMENT, UNIVERSITY OF BONN


Lab Distributed Big Data Analytics

Worksheet-3: ML on Spark (Spark ML and BigDL)


Dr. Hajira Jabeen, Gezim Sejdiu, Denis Lukovnikov, Prof. Dr. Jens Lehmann

April 25, 2019

In this lab we are going to perform basic Spark ML and BigDL operations (described on “Spark Fundamentals II (ML on Spark)”).


IN CLASS


  1. Setup
  2. Implement PySpark-BigDL dummy linreg notebook.
  3. Implement PySpark-BigDL mnist notebook.
  4. Implement PySpark-BigDL mnist cnn notebook.

AT HOME


  1. Reading:
  2. Complete the notebooks
  3. Convert the mnist_cnn notebook to use MLlib’s Pipeline API