0% found this document useful (0 votes)
19 views23 pages

Data Science & Analyst Brochure

Code Galatta offers a comprehensive Data Scientist and Data Analyst training program that includes hands-on learning, experienced instructors, and lifetime mentorship. The curriculum covers essential topics such as Python, SQL, machine learning, and data visualization tools like Tableau and Power BI, along with practical projects. The course is designed for both beginners and those with prior experience, providing support and career opportunities in the field of AI technology.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
19 views23 pages

Data Science & Analyst Brochure

Code Galatta offers a comprehensive Data Scientist and Data Analyst training program that includes hands-on learning, experienced instructors, and lifetime mentorship. The curriculum covers essential topics such as Python, SQL, machine learning, and data visualization tools like Tableau and Power BI, along with practical projects. The course is designed for both beginners and those with prior experience, providing support and career opportunities in the field of AI technology.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 23

DATA SCIENCE &

ANALYST
PREMIUM SYLLABUS

For Registration www.codegalatta.com


Why choose Code Galatta?

Comprehensive Curriculum
Hands-On Learning
Experienced Instructors
Career Opportunities
Supportive Community
24/7 Support
Affordable Fee Structures
Live Sessions with Comfortable Timings
Recordings of Live Sessions
Lifetime Mentorship
Internship Opportunities

Whether you’re a beginner or have prior experience, our course will equip you
with the knowledge and practical skills needed to excel in the world of STACK
development !

At Code Galatta, we’re thrilled to introduce our highly anticipated DATA SCI-
ENTIST & DATA ANALYST training program. This course is designed to cater to
students like you who are eager to dive into the world of AI TECH.

What Will You Learn?

Python & SQL Basics: Build a strong programming foundation with Python
for data manipulation and SQL for data management.

Excel for Data Analysis: Learn how to efficiently use Excel, from basic data
manipulation to advanced formulas and pivot tables, to gain quick
insights from your data.

Tableau for Data Visualization: Create interactive dashboards and


visually compelling charts to uncover trends and patterns in your
datasets.

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com


Power BI for Data Analysis: Develop data visualization and dashboard
skills to extract meaningful insights for informed decision-making.

Machine Learning & Deep Learning: Learn key ML algorithms, and explore
neural networks for complex tasks like image and text analysis.

Natural Language Processing (NLP): Acquire skills to work with text data,
from sentiment analysis to text generation.

Model Deployment & Version Control: Understand Git for version control
and deploy AI models in real-world applications.

Project Work: Throughout the course, you’ll work on hands-on projects


that will solidify your skills.

DATA SCIENCE & ANALYST PREMIUM SYLLABUS


Setup & Preparation

Introduction and Tool Setup

Python Installation: Setting up the Python environment for data


science tasks.

Power BI Installation: Overview and installation for data visualization.

MySQL Download: Database setup and connection basics.

Jupyter Notebook Essentials: Basic imports and setup for data


manipulation and visualization libraries.

Tableau Installation: Installing Tableau (or Tableau Public) for creating


interactive dashboards and visualizations.

Excel Setup: Ensuring you have Excel installed (often included with
Microsoft Office) or access to an equivalent spreadsheet tool for data
manipulation and quick analyses.

<aside>
Libraries: pandas, numpy, matplotlib
</aside>

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com


Static Website Design

Introduction to Core AI Concepts


Data Science: Overview and applications.

Data Analysis: Basics of analyzing and interpreting data.

Machine Learning (ML): Introduction to predictive modeling.

Deep Learning: Basics of neural networks.

Natural Language Processing (NLP): Language-based AI.

Computer Vision: Understanding image data for AI.

Python Programming Essentials

Python Fundamentals for Data Science


Common String Functions

Deleting & Updating Strings

Control Structures:

Conditional Statements (if, elif, else)

Nested Conditional Statements

Data Structures in Python

Understanding Python Data Structures


Lists:
Introduction and Methods
Tuples:
Basics and Use Cases
Dictionaries:
Introduction and Methods
Sets:
Introduction and Methods

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com


Data Structures in Python

Understanding Python Data Structures


Loops:
While Loop
For Loop
Control Statements: Breaking and Continuing in Loops
Functions:
Function Basics
Arguments and Keyword Arguments (Args and Kwargs)
<aside>
Basic Python Assignments and Problem Solving
</aside>

File Handling, Exceptions

Advanced Python Techniques

File Handling: Reading & Writing Files

Error Handling:

Basics of Exception Handling

Printing and Displaying Output

Comments in Python

Variables and Their Usage

Data Types in Python

Type Conversion

Taking User Input

Arithmetic, Comparison,

& Logical Operators

Assignment Operators

String Operations:

Introduction to Strings

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com


NumPy for Numerical Computation
Introduction to NumPy for Data Science

Basics of NumPy and Array Creation

Array Dimensions and Operations:

Reshape, Transpose, Ravel

Indexing, Slicing, Inserting, and Deleting Elements

Random Number Generation

Stacking and Splitting Arrays: Horizontal and Vertical Stacking

Statistical Functions: Mean, Median, Standard Deviation, Dot Product

<aside> Practical Assignment on NumPy </aside>

Pandas and Exploratory Data Analysis (EDA)

Data Manipulation and EDA Techniques


Pandas Basics:

Overview of Pandas & Basic Concepts

Working with Series

Data Frame Operations:

Set Index, Sorting, and Filtering Data

Accessing Data with loc and iloc

Data Aggregation:

Group By and Pivot Table

Advanced Aggregation Techniques

Exploratory Data Analysis (EDA):

Deleting Rows and Columns

Handling Duplicate &Missing Values

Outlier Detection (IQR)

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com


Data Visualization with Matplotlib and Seaborn:

Labeling, Subplots, Font Adjustments

Legends and Basic Plot Types: Bar Chart, Scatter Plot, Histogram,
Pie Chart
<aside> Assignment on Pandas and EDA </aside>

SQL Basic
Basic SQL Module
Introduction to Database

What is a Database? Types of Databases SQL vs NoSQL

Database Schema
Three-Layer Abstraction: External, Conceptual, Internal
Importance of Schema Design

ACID Properties

Atomicity Consistency
Isolation Durability

Data Definition Language (DDL)

CREATE Command
ALTER, DROP& TRUNCATE Commands

SQL Constraints

Primary Key, Foreign Key


Unique, Check & Default Constraints

Normalization

1NF, 2NF, 3NF, and BCNF


Importance of Normalization

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com


SQL Commands

DCL, DML, DDL, TCL

SQL Query Execution

SQL Keyword Order of Execution FROM Clause


WHERE Clause GROUP BY & HAVING Clauses
SELECT and ORDER BY Clauses TOP Keyword Usage

SQL Intermediate
Intermediate SQL Module

Subqueries
Types of Subqueries Use Cases in SQL Queries.
SQL Joins
Inner Join, Left Join, Right Join, Full Join Cross Join and Self Join

CTE, Stored Procedures, Data Modeling

SQL String Transformations: String Functions (e.g., UPPER, LOWER, TRIM)

Date and Time Manipulation: Date Functions (e.g., NOW, DATEDIFF)

Case Statements: Conditional Logic in SQL

Common Table Expressions (CTE): Recursive and Non-Recursive CTEs

Stored Procedures:

Creating and Using Stored Procedures

Parameters and Return Types

Data Modeling

Entity-Relationship Diagrams

Keys and Relationships

SQL Reverse Engineering : Understanding Existing Databases

Transactions: Transaction Control (COMMIT, ROLLBACK, SAVEPOINT)

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com


Views

Creating and Using Views Updatable Views

SQL Advanced
Advanced SQL Features

Lead & Lag Functions: Contextual Use Cases


Swapping Records: Implementing Swaps in SQL Queries
Pivot and Unpivot: Transforming Rows to Columns and Vice Versa
Ratio to Report Function: Calculating Ratios in SQL
SQL for Recommendations : Using SQL in Recommendation Systems

Window Functions

Introduction to Window Functions: Basics and Syntax


Window vs. Group By: Differences and Use Cases
Partitioning Data: Row Number with PARTITION BY
Ranking Functions: RANK, DENSE_RANK, NTILE
Aggregate Window Functions: Calculating Sums, Averages, etc.
Rolling Windows: Rolling Sum, Rolling Average

MACHINE LEARNING
Feature Selection
Introduction to Machine Learning : Basics and Terminology
Feature Selection Techniques: Methods for Feature Reduction
Handling Missing Values : Dropping and Imputing
Variance Threshold: Removing Low Variance Features
fit, transform, fit_transform, Predict: Understanding Scikit-Learn’s
Transformations
ANOVA and Recursive Feature Elimination (RFE): Techniques for Feature
Ranking
Model-Based Feature Selection: Using Models to Determine Features
Variance Inflation Factor (VIF): Multicollinearity Analysis

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com


Feature Engineering
Creating New Columns: Feature Construction
Handling Missing Values: fillna and dropna Methods
Outlier Detection and Removal: Standard Deviation, Z-Score, and
Pandas Techniques
Data Preprocessing: Scaling, Normalization, Encoding
Binning and Encoding: Converting Continuous to Categorical Data
Gaussian Transformation: Converting Data to Normal Distribution
Datetime Module: Handling Date-Time Data

Linear Regression
Introduction to Linear Regression : Algebraic and Geometric Intuition
Optimization Techniques: Vector Multiplication, Transpose, and Error
Calculations
Loss Function and Gradient Descent: Error Minimization Techniques
Assumptions in Linear Regression: Data Preparation and EDA
Fitting the Model and Evaluation: Model Building, Metrics, and Residual
Analysis

Logistic Regression, PCA, KNN

Logistic Regression: Intuition, Model, and Metrics


Normalization & Standardization: Data Scaling Techniques
PCA: Fundamentals, Implementation, and Visualization
K-Nearest Neighbors (KNN): Theory and Coding

Decision Tree, Naive Bayes

Decision Tree: Theory, Visualization, and Pruning


Naive Bayes: Text Analytics, Bernoulli, Multinomial, Gaussian
Evaluation Metrics: Confusion Matrix, Sensitivity, Specificity

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com


Bagging and Boosting

Ensemble Techniques: Bagging, Random Forest, Extra Trees


Boosting Techniques: AdaBoost, Gradient Boosting, XGBoost, CatBoost
Hyperparameter Tuning: Using GridSearchCV & RandomizedSearchCV

Clustering, Support Vector Machine

Clustering (Unsupervised Learning): K-means, Hierarchical, DBSCAN,


Gaussian Mixture
Support Vector Machine (SVM): Intuition and Hyperparameter Tuning

Hypothesis Testing, Gradient Descent

Hypothesis Testing: Null & Alternative Hypothesis, Errors, and Tailed


Tests
ANOVA & Chi-Square Test: Applications in Data Analysis
Gradient Descent: Concept and Applications in Regression

Linear Regression Deployment

Overview of Deployment Preparing the Model for Deployment


Deployment Techniques Evaluating Deployed Models

Deep Learning - Neural Network Fundamentals

Introduction to Deep Learning

What is Deep Learning? Applications of Deep Learning

Artificial Neural Networks (ANN)

Structure of an ANN Forward and Backward Propagation

Perceptron

Introduction to Perceptron Weights and Biases in Perceptron


Animating the Perceptron

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com


Multilayer Perceptron (MLP)

Structure and Working of MLP Importance of Layers in MLP

Activation Functions

Sigmoid Function
Additional Activation Functions (ReLU, TANH, Softmax)

Loss Functions

Regression Loss Functions Classification Loss Functions

Keras Framework

Overview of Keras Key Keras Modules

Problem Solving with Keras

Binary Classification Multi-Class Classification


Regression Problems

Optimizers

Types of Optimizers Evaluating and Comparing Optimizers

Introduction to Convolutional Neural Networks (CNN)

Basics of CNN Applications of CNN

Deep Learning - Advanced Concepts

Tensor Fundamentals

Understanding Tensors Tensor Operations and Examples

PyTorch Fundamentals

Introduction to PyTorch Practical Tensor Solutions with PyTorch

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com


Building Neural Networks

Core Building Blocks Hands-on Neural Network Creation

Advanced Activation Functions

Advanced Use Cases of Activation Functions


Practical Implementation

Training a Neural Network

Steps in Training Practical Training Solutions

Loss Functions

Advanced Loss Function Use Cases Loss Implementation Techniques

Optimizers

Optimizer Implementation and Evaluation

Regularization Techniques

Types of Regularization Hands-on Regularization

Object-Oriented Programming (OOP) in Python for Deep Learning

OOP Concepts in Python OOP-Based Solutions for Deep Learning

Deep Learning - Case Study

Human Activity Recognition Project

Dataset Overview Model Development


Evaluation and Optimization

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com


Natural Language Processing (NLP) - Basics

Text Preprocessing

Tokenization Lemmatization
Stemming Stop Words Removal

Feature Extraction

Bag of Words (BoW) Count Vectorizer


TF-IDF Vectorization N-Grams

POS Tagging & Word Embeddings

Parts of Speech (POS) Tagging


Introduction to Gensim and Word2Vec

Sentiment Analysis

Overview of Sentiment Analysis Applications and Use Cases

Activation Functions for NLP

Sigmoid Function TANH, ReLU, and Softmax Functions

Recurrent Neural Networks (RNN)

Introduction to RNNs Types of RNNs (Simple, LSTM, GRU)


Limitations of RNNs

Natural Language Processing (NLP) - Advanced Techniques

Deep Learning Models in NLP

LSTM (Long Short-Term Memory) GRU (Gated Recurrent Unit)


Bi-LSTM (Bidirectional LSTM)

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com


Transformers in NLP

Introduction to Transformers Tokenization with BERT


Applications of BERT (Bidirectional Encoder Representations from
Transformers)

Using SpaCy for NLP

Overview of SpaCy Implementing NLP Tasks with SpaCy

Natural Language Processing (NLP) - Project

Medical Chatbot Assistance Project

Dataset Overview Model Development and Deployment

Excel

Getting Started & Basic Data Handling

Basic Excel Interface


Navigating the Ribbon and Quick Access Toolbar
Understanding rows, columns, cells, worksheets
Saving and organizing files
Data Types in Excel
Text, numbers, dates
How Excel interprets each type
Practical Data Cleaning Basics
Removing extra spaces, text-to-columns, merging/splitting cells
Handling blank cells, using find/replace
Relative vs. Absolute Cell References
Difference between A1 vs. $A$1
Common use cases

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com


Cell Errors
Recognizing #VALUE!, #REF!, #DIV/0!
Quick tips for fixing and interpreting errors

Core Formulas & Essential Functions

Calculations & Order of Operations


Basic arithmetic, operator precedence, parentheses
Fundamental Functions
SUM, AVERAGE, MIN, MAX, COUNT
IF Function
Single-condition logic (e.g., pass/fail)
IFERROR Function
Handling errors gracefully
Formatting the Spreadsheet
Font styles, alignment, number formats, cell styles
Conditional Formatting
Highlighting cells
Color scales and icon sets

Organizing & Analyzing Data

Sorting & Custom Sort


Single-level and multi-level sorts
Creating custom sort lists (e.g., “High, Medium, Low”)
Filters & Removing Duplicates
Using AutoFilter on multiple columns
Identifying and removing duplicate rows
Data Validation
Creating drop-down lists Preventing invalid entries

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com


Basic Pivot Tables
Creating a pivot table Placing fields in rows, columns, values
Pivot Table Modifications
Grouping, rearranging fields, formatting
Adding simple calculated fields if needed

Essential Lookup & Intermediate Functions

VLOOKUP
Basic vertical lookup to pull data from another table
SUMIF & COUNTIF
Summations and counts based on single criteria
Multiple IF Conditions
Using AND/OR inside IF Brief introduction to nested IF
Text Functions
LEFT, RIGHT, MID, LEN CONCAT (or CONCATENATE)
Formula Auditing
Checking cell precedents and dependents
Evaluating formulas step by step

Extended Pivot & Basic What-If Analysis

Advanced Pivot Table Features


Using slicers, creating pivot charts
Refreshing data, best practices for structured data
SubTotal (Optional)
Quick summaries for data lists Often replaced by pivot tables
Goal Seek: Setting a target value to find the required input
Scenario Manager (Optional): Comparing different input sets

Case Study: Uber Data Analysis

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com


Power BI - Fundamentals

Fundamentals of Power BI

Introduction to Power BI
Overview of the Power BI ecosystem
Installation and setup

Basic Visualizations

Bar Chart
Creating a basic bar chart Formatting options
Pie Chart, Donut Chart, Funnel Chart
When to use each chart type Formatting and best practices
Ribbon Chart, Line Chart, Area Chart
Comparing trends over time Area vs. line chart usage
Combo Chart, Scatter Chart, Waterfall Chart, Treemap Chart
Combining multiple measures (Combo)
Analyzing data with two axes (Scatter)
Visualizing data changes (Waterfall)
Hierarchical data representation (Treemap)
Maps, Filled Maps
Geo-locating data Using map layers and data categories
Tables, Conditional Formatting, Matrix
Displaying tabular data
Highlighting data with conditional formatting
Using Matrix for pivot-like views
Gauge Chart, Cards, Multi-Row Cards
Monitoring single metrics
Showing key performance indicators (KPIs)

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com


Filter, Drill
Filtering at various levels (visual, page, report)
Drilling up/down in hierarchies
Slicer (Text, Number, Date)
Interactive data filtering
Configuration and formatting of slicers

Reports & Dashboards

Creating a Report in Power BI


Page layout and visual arrangement
Publishing in Power BI
Sharing reports via the Power BI Service
Dashboard in Power BI
Pinning visuals and setting up real-time tiles
How to Refresh Data in Power BI
Scheduled refresh Gateway configuration

Power Query

Power Query Intro : The Query Editor and data ingestion


Adding/Removing Rows: Filtering unwanted data, handling null values
Text Add Column Transform: Splitting, extracting, and replacing text
Number Function: Arithmetic operations, rounding, percentages
Date Add Column Transform:
Extracting year, month, day Creating time-based columns
Appending Sheets
Combining multiple data sources Handling schema differences
Merging Sheets:
Joining data tables Merging vs. appending scenarios

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com


Conditional Column
Creating new fields based on conditions
Fill, GroupBy, Transpose
Forward/backward fill, aggregations, reshaping data
Keep and Remove Columns/Rows
Selecting and removing data before loading

Data Modeling Concepts

Normalization (1NF, 2NF, 3NF)


Structuring data to reduce redundancy
De-Normalization, OLTP vs. OLAP
Balancing performance with structure
Transactional vs. analytical systems

DAX Formulas

DAX Intro, Data Types: Basic syntax, row vs. filter context
First DAX: Simple measures, fundamental calculations
Date Functions, More on Date Functions
Time intelligence (YTD, MTD, etc.)
Text Functions: Concatenation, string extraction, formatting
Logical Functions: IF, SWITCH, handling null values
Calculated Measure, More on Calculated Measure
Reusable expressions, performance considerations
Additional DAX Functions
Statistical, math, financial functions
Creating Your Own Query
Writing custom DAX queries
Best practices for performance

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com


Case Studies

Life Expectancy
End-to-end implementation (data sourcing, modeling, visualization)
Applying Power BI features in a real-world scenario

Tableau
Introduction to Tableau

Overview of Tableau
Purpose and use cases Installing or accessing Tableau
Tableau User Interface
Navigating the workspace (Data pane, Shelves, Marks card)
Distinguishing Dimensions vs. Measures

Basic Charting & Data Handling

Working with Data


Connecting to common data sources Default properties of fields
Common Charts
Bar Charts, Line Charts
Combined Axis Charts, Area Charts, Dual Axis Charts
Basic plot types and when to use them

Filters & Simple Analytics

Filter Types
Dimension Filter, Measure Filter Visual Filter, Interactive Filter
Data Source Filter, Context Filter
Sets, Groups, and Parameters
Creating sets to segment data
Grouping members for simplified categories
Using parameters for user-driven calculations

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com


Calculated Fields & Functions

Calculated Fields
Building basic expressions Overview of Calculation Syntax
Date & Text Functions
Working with dates (e.g., YEAR(), MONTH())
Common text functions (e.g., LEFT(), MID(), TRIM())
Bins and Histograms: Segmenting measures into intervals
Reference Lines & Trend Lines:
Adding statistical insights to visualizations
Quick Table Calculations
Built-in running totals, moving averages, etc.

Dashboards

Introduction to Dashboards
Creating a dashboard layout Adding sheets and objects
Applying filters across multiple views
Actions
Setting up basic interactivity (filter actions, highlight actions)

Data Integration

Union & Joins


Combining data within the same source
Joining multiple tables
Data Blending
Merging data from different sources (light introduction)

Case Study: Loan Application Analysis

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com


Capstone Project

Project : HR employee Data Analysis (POWER BI)

Project : Life Expectancy (POWER BI)

Project : Linear Regression Deployment (MACHINE LEARNING)

Project : Chat Bot (MACHINE LEARNING, NLP)

Project : Human Activity Recognition (DEEP LEARNING)

Project : BERT Email Classification (DEEP LEARNING)

Project : Project like Google Assistant (NLP)

Project : Sentimental Analysis (NLP)

Placement Preparation
Get ready for your dream job with resume building, LinkedIn optimization,
and mock interviews.

1. Resume Preparation Session

2. LinkedIn mastery

3. Naukri mastery

4. Interviews Question

5. Session with Industry Mentor

For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com

You might also like