SYMBIOSIS SKILLS AND PROFESSIONAL UNIVERSITY
(Established under Govt. of Maharashtra Act No. XXXVII 2017 dated 3rd May 2017)
Kiwale, Adjoining Mumbai - Pune Express Highway, Pune 412 101,
State – Maharashtra, INDIA.| http://www.ssou.ac.in
School Name School of Data Science
Program Name Certificate Course in Data Associate
Duration 3 Months (300 Hours)
Occupation & Python Developer, MySQL Developer, Data Engineer, Data Analyst, Data Visualizer,
Description of Data Associate.
Role
Eligibility BE/B.Tech (Engg, CSE), MS (Stat, Math), BSc (Stat, Math).
(Educational)
Master's Degree in Science/ Technology/ Mathematics/ Statistics
and
Mathematics subjects studied in 12th Standard is a compulsory.
Candidate need to qualify the Entrance Test conducted by the Competent Authority.
Pre-requisite Understanding of Math / Stats at 12th standard level.
Knowledge of basic computer science principles and skills.
Basic programming knowledge – writing and executing code in any language
Laptop with minimum 8 GB RAM.
Skills Students Generic: Communication skills (spoken, basic), Presentation Skills
Acquire at end of
Technical: Collate and Analyze Data, Use Python for Programming, Use Tableau,
the course
MySQL as part of data analysis, Hadoop Eco System, HIVE, Apache Spark.
Professional: Team Work
Course Objective ● To develops skills of Python programming.
● To implement a different types of wide variety of python modules in various
applications.
● To develop data handling and data analysing skills.
● To handle big data.
● To give each student a realistic perspective of work and work expectations,
to help formulate problem solving skills, to guide students in making
appropriate and responsible decisions.
● To create a desire to fulfil individual goals, and to educate students about
unproductive thinking, self-defeating emotional impulses, and self-defeating
behaviour.
Course Learning After completing this programme, participants will be able to:
Outcomes
Work with Python scripting elements such as variables, flow control
structures, functions, file handling.
Determine the methods to create and manipulate Python programs by
utilizing the data structures like lists, dictionaries, tuples and sets.
Apply fundamentals of some of the most widely used Python packages;
including Numpy, Pandas and Matplotlib, then apply them to Data Analysis
and Data Visualization projects.
Execute various basic and advance SQL queries.
Apply various Normalization techniques, MySQL programming using
concept.
Creating Crosstabs, Charts, Maps and Dashboards using Tableau.
Develop Big Data Solutions using Hadoop Eco System.
Querying big data with Hive.
Managing big data with Apache Spark.
Understanding the significance Personality Development and Presentation
skills
Able to build right Attitude
Adopt the Interview skills and self-introduction
Module list Python for Data Analysis
Managing with Data
Analyzing Data from Disparate Sources
IDSC
Project
Suggested learning Resources (but not limited)
Text (T)
Edition /
Sr. No Title of the Book / Link Author / Webiste Reference
volume
(R)
Mark Kerzner & Sujee
1. Hadoop Illuminated 2014 T1
Maniyam
Data Mining: Practical
Ian H. Witten & Eibe
2 Machine Learning Tools 2005 T2
Frank
and Techniques
3 Big Data Now O’Reilly Media 2012 R1
4 Head-First Python Paul Barry 2nd Edition R2
Hadoop: The Definitive
5 Guide: Storage and Tom White 4th Edition R3
Analysis at Internet Scale
Big Data Analytics with
6 Sridhar Alla R4
Hadoop 3.0
https://www.res
earchgate.net/pu
blication/314724
458_CASE_ST
Sai Prasad Potharaju,
CASE STUDY OF HIVE UDY_OF_HIV Case
7 Shanmuk Srinivas A,
USING HADOOP E_USING_HA Study 1
Ravi Kumar Tirandasu
DOOP/link/58c5
114192851c0cc
bf7fb57/downlo
ad
https://hadoopbaseblog.wordpress.com/2017/09/04/hive-ddl-loading-
8 Online
data-into-hive-tables/