0% found this document useful (0 votes)
86 views1 page

Data Science Prodegree Curriculum Overview

This document outlines the curriculum for a data science program. The program covers topics including all about data, R, SAS, SQL, Python, Tableau, and job readiness. It includes over 200 hours of content organized into modules on these topics. Students will complete 7 hands-on projects, including projects on default modeling, credit risk analytics, intrusion detection, and a group project.

Uploaded by

skumarites
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
86 views1 page

Data Science Prodegree Curriculum Overview

This document outlines the curriculum for a data science program. The program covers topics including all about data, R, SAS, SQL, Python, Tableau, and job readiness. It includes over 200 hours of content organized into modules on these topics. Students will complete 7 hands-on projects, including projects on default modeling, credit risk analytics, intrusion detection, and a group project.

Uploaded by

skumarites
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

CURRICULUM:

DATA SCIENCE PRODEGREE

PROJECT 4 Project 4 - Default Modelling using Logistic Regression in Python


INTRODUCTION - 14.5 HOURS
PROJECT 5 Project 5 - Credit Risk Analytics using SVM in Python

Intro to Program | Curriculum Overview | Learning Methodology | Project 6 - Intrusion Detection using Decision Trees & Ensemble
BATCH LAUNCH PROJECT 6
Guest Lecture Learning in Python

Data | Variables | Data Types | Measures of Central Tendency in Data


ALL ABOUT DATA | Understanding Skewness in Data | Measures of Dispersion | Data SAS - 49 HOURS
Distribution

INTRODUCTION TO What is SAS? | Key Features | Submitting a SAS Program | SAS Program
R - 76 HOURS SAS AND SAS
PROGRAMS
Syntax Examining SAS Datasets Accessing SAS Libraries | Sorting and
Grouping Reporting Data | Using SAS Formats

R Base Software | Understanding CRAN | RStudio The IDE | Basic Reading SAS Datasets | Reading Excel Data | Reading Raw Files |
R BASICS Building Blocks in R | Sequence of Numbers in R | Understanding READING AND
Reading Database Data | Creating Summary Reports | Combining
Vectors in R | Basic Operations Operators and Types MANIPULATING DATA
Datasets

Handling Missing Values in R | Subsetting Vectors in R | Matrices and DATA Writing Observations | Writing to Multiple Datasets | Accumulating Total
R FUNCTIONS Data Frames in R | Logical Statements in R | Lapply, sapply, vapply and TRANSFORMATIONS Creating Accumulating Total for a Group of Data | Data Transformations
tapply Functions
Introduction to Macro Variables | Automatic Macro Variables | User
LINEAR REGRESSION Covariance and Correlation | Multivariate Analysis | Assumptions of Defined Macro Variables | Macro Variable Reference | Defining and
MACROS
THEORY - R Linearity Hypothesis Testing | Limitations of Regression Calling Macros | Macro Parameters | Global and Local Symbol Table |
Creating Macro Variables in the Data Step
BUSINESS CASE: Business Case : Managing Credit Risk | Meaning of Credit Risk | Impact
MANAGING CREDIT of Credit Default | Sources of Data for Managing Risk | Understanding Introduction to SQL | How Does RDBMS Work? | SQL Procedures |
RISK Loss Given Default | Understanding Default Specifying Columns | Specifying Rows | Presenting Data | Summarizing
SQL Data | Writing Join Queries using SQL | Working with Subqueries,
Loss Given Default Linear Regression R | Extract Data in R | Univariate Indexes and Views | Set Operators | Creating Tables and Views using
Analysis of Data | Apply Data Transformations | Bivariate Analysis of Proc SQL
LOSS GIVEN DEFAULT Data | Identify Multicollinearity in Data | Treatment on Data | Identify
LINEAR REGRESSION Heteroscedasticity Discuss what could be the Reason for Heteroscedasticity PROJECT 7 Project 7 - Store Data Analytics in SAS
R | Modelling of Data Variable Significance Identification | Model
Significance Test | Predict using Testing Data Set | Validate the Model
Performance TABLEAU - 6 HOURS
LOGISTIC REGRESSION Reason for Logistic Regression | The Logistic Transform | Logistic
THEORY - R Regression Modelling | Model Optimisation | Understanding ROC Curve Introduction to Visualization | Working with Tableau | Visualization in
TABLEAU BASIC Depth Data Organisation | Advanced Visualization | Mapping |
PROJECT 1 Project 1 - Default Modelling using Logistic Regression in R Enterprise Dashboards Data Presentation

Introduction to SVM | Classification as a Hyper Plane Location Problem | INTRODUCTION TO


SUPPORT VECTOR THE GROUP PROJECT
Choice of three projects on various domains
Motivation for Linear Support Vectors | SVM as Quadriatic Optimization
MACHINES (THEORY)
Problem | Non Linear SVM | Introduction to Kernel Functions

PROJECT 2
JOB READINESS - 8 HOURS
Project 2 - Default Modelling using SVM in R

Introduction to Decision Trees | Theory of Entropy & Information Gain |


Stopping Rules | Overfitting Problem | Cross Validations for Overfitting RESUME BUILDING Resume Building | Personal Branding | Tips and Resources | Interview
DECISION TREES
Problem | Prunning as a Solution for Overfitting | Ensemble Learning AND INTERVIEW PREP Skills
Notion | Concept of Bootstrap Aggregation | Concept of Random Forest
1:1 MOCK 1:1 Mock Interviews with Industry Veterans to Clear the Technical
Business Case : Intrusion Detection in IT Network | Meaning of Intrusion INTERVIEWS Round of Interviews to Give You Confidence to Face Real World Scenarios
BUSINESS CASE
in IT Cost of Intrusion | Meaning of Intrusion Detection System
Groups Present their Project Presentation in Front of Their Peers and
GROUP PROJECT industry Experts Evaluate the Solution (Refresher session for online
PRESENTATION
Project 3 - Network Intrusion Detection using Decision Tree & Ensemble batches)
PROJECT 3
Learning in R

GUEST LECTURE Industry View from Expert | Refresher on R | Open House


HANDS-ON PROJECTS
NETWORK INTRUSION
DEFAULT MODELLING DETECTION USING
DEFAULT MODELLING
PYTHON - 29.5 HOURS
USING LOGISTIC DECISION TREE &
USING SVM IN R
REGRESSION IN R ENSEMBLE LEARNING IN R

DEFAULT MODELLING INTRUSION DETECTION


What is Python? | Installing Anaconda | Understanding the Spyder CREDIT RISK
USING LOGISTIC USING DECISION TREES
ANALYTICS USING
PYTHON BASICS Integrated Development Environment (IDE) | Lists, tuples, dictionaries, REGRESSION IN
SVM IN PYTHON
& ENSEMBLE LEARNING
PYTHON IN PYTHON
variables

DATA STRUCTURES Intro to Numpy Arrays | Creating ndarrays | Indexing | Data Processing STORE DATA
IN PYTHON USED ANALYTICS IN SAS
using Arrays | File Input and Output | Getting Started with Pandas
FOR DATA ANALYSIS

PROJECT-BASED LEARNING:
You will spend approximately 50 hours of this program getting hands-on with industry
projects and build a portfolio of demonstrable work.

You might also like