CURRICULUM:
DATA SCIENCE PRODEGREE
PROJECT 4 Project 4 - Default Modelling using Logistic Regression in Python
INTRODUCTION - 14.5 HOURS
PROJECT 5 Project 5 - Credit Risk Analytics using SVM in Python
Intro to Program | Curriculum Overview | Learning Methodology | Project 6 - Intrusion Detection using Decision Trees & Ensemble
BATCH LAUNCH PROJECT 6
Guest Lecture Learning in Python
Data | Variables | Data Types | Measures of Central Tendency in Data
ALL ABOUT DATA | Understanding Skewness in Data | Measures of Dispersion | Data SAS - 49 HOURS
Distribution
INTRODUCTION TO What is SAS? | Key Features | Submitting a SAS Program | SAS Program
R - 76 HOURS SAS AND SAS
PROGRAMS
Syntax Examining SAS Datasets Accessing SAS Libraries | Sorting and
Grouping Reporting Data | Using SAS Formats
R Base Software | Understanding CRAN | RStudio The IDE | Basic Reading SAS Datasets | Reading Excel Data | Reading Raw Files |
R BASICS Building Blocks in R | Sequence of Numbers in R | Understanding READING AND
Reading Database Data | Creating Summary Reports | Combining
Vectors in R | Basic Operations Operators and Types MANIPULATING DATA
Datasets
Handling Missing Values in R | Subsetting Vectors in R | Matrices and DATA Writing Observations | Writing to Multiple Datasets | Accumulating Total
R FUNCTIONS Data Frames in R | Logical Statements in R | Lapply, sapply, vapply and TRANSFORMATIONS Creating Accumulating Total for a Group of Data | Data Transformations
tapply Functions
Introduction to Macro Variables | Automatic Macro Variables | User
LINEAR REGRESSION Covariance and Correlation | Multivariate Analysis | Assumptions of Defined Macro Variables | Macro Variable Reference | Defining and
MACROS
THEORY - R Linearity Hypothesis Testing | Limitations of Regression Calling Macros | Macro Parameters | Global and Local Symbol Table |
Creating Macro Variables in the Data Step
BUSINESS CASE: Business Case : Managing Credit Risk | Meaning of Credit Risk | Impact
MANAGING CREDIT of Credit Default | Sources of Data for Managing Risk | Understanding Introduction to SQL | How Does RDBMS Work? | SQL Procedures |
RISK Loss Given Default | Understanding Default Specifying Columns | Specifying Rows | Presenting Data | Summarizing
SQL Data | Writing Join Queries using SQL | Working with Subqueries,
Loss Given Default Linear Regression R | Extract Data in R | Univariate Indexes and Views | Set Operators | Creating Tables and Views using
Analysis of Data | Apply Data Transformations | Bivariate Analysis of Proc SQL
LOSS GIVEN DEFAULT Data | Identify Multicollinearity in Data | Treatment on Data | Identify
LINEAR REGRESSION Heteroscedasticity Discuss what could be the Reason for Heteroscedasticity PROJECT 7 Project 7 - Store Data Analytics in SAS
R | Modelling of Data Variable Significance Identification | Model
Significance Test | Predict using Testing Data Set | Validate the Model
Performance TABLEAU - 6 HOURS
LOGISTIC REGRESSION Reason for Logistic Regression | The Logistic Transform | Logistic
THEORY - R Regression Modelling | Model Optimisation | Understanding ROC Curve Introduction to Visualization | Working with Tableau | Visualization in
TABLEAU BASIC Depth Data Organisation | Advanced Visualization | Mapping |
PROJECT 1 Project 1 - Default Modelling using Logistic Regression in R Enterprise Dashboards Data Presentation
Introduction to SVM | Classification as a Hyper Plane Location Problem | INTRODUCTION TO
SUPPORT VECTOR THE GROUP PROJECT
Choice of three projects on various domains
Motivation for Linear Support Vectors | SVM as Quadriatic Optimization
MACHINES (THEORY)
Problem | Non Linear SVM | Introduction to Kernel Functions
PROJECT 2
JOB READINESS - 8 HOURS
Project 2 - Default Modelling using SVM in R
Introduction to Decision Trees | Theory of Entropy & Information Gain |
Stopping Rules | Overfitting Problem | Cross Validations for Overfitting RESUME BUILDING Resume Building | Personal Branding | Tips and Resources | Interview
DECISION TREES
Problem | Prunning as a Solution for Overfitting | Ensemble Learning AND INTERVIEW PREP Skills
Notion | Concept of Bootstrap Aggregation | Concept of Random Forest
1:1 MOCK 1:1 Mock Interviews with Industry Veterans to Clear the Technical
Business Case : Intrusion Detection in IT Network | Meaning of Intrusion INTERVIEWS Round of Interviews to Give You Confidence to Face Real World Scenarios
BUSINESS CASE
in IT Cost of Intrusion | Meaning of Intrusion Detection System
Groups Present their Project Presentation in Front of Their Peers and
GROUP PROJECT industry Experts Evaluate the Solution (Refresher session for online
PRESENTATION
Project 3 - Network Intrusion Detection using Decision Tree & Ensemble batches)
PROJECT 3
Learning in R
GUEST LECTURE Industry View from Expert | Refresher on R | Open House
HANDS-ON PROJECTS
NETWORK INTRUSION
DEFAULT MODELLING DETECTION USING
DEFAULT MODELLING
PYTHON - 29.5 HOURS
USING LOGISTIC DECISION TREE &
USING SVM IN R
REGRESSION IN R ENSEMBLE LEARNING IN R
DEFAULT MODELLING INTRUSION DETECTION
What is Python? | Installing Anaconda | Understanding the Spyder CREDIT RISK
USING LOGISTIC USING DECISION TREES
ANALYTICS USING
PYTHON BASICS Integrated Development Environment (IDE) | Lists, tuples, dictionaries, REGRESSION IN
SVM IN PYTHON
& ENSEMBLE LEARNING
PYTHON IN PYTHON
variables
DATA STRUCTURES Intro to Numpy Arrays | Creating ndarrays | Indexing | Data Processing STORE DATA
IN PYTHON USED ANALYTICS IN SAS
using Arrays | File Input and Output | Getting Started with Pandas
FOR DATA ANALYSIS
PROJECT-BASED LEARNING:
You will spend approximately 50 hours of this program getting hands-on with industry
projects and build a portfolio of demonstrable work.