DATA SCIENCE &
ANALYST
PREMIUM SYLLABUS
For Registration www.codegalatta.com
Why choose Code Galatta?
Comprehensive Curriculum
Hands-On Learning
Experienced Instructors
Career Opportunities
Supportive Community
24/7 Support
Affordable Fee Structures
Live Sessions with Comfortable Timings
Recordings of Live Sessions
Lifetime Mentorship
Internship Opportunities
Whether you’re a beginner or have prior experience, our course will equip you
with the knowledge and practical skills needed to excel in the world of STACK
development !
At Code Galatta, we’re thrilled to introduce our highly anticipated DATA SCI-
ENTIST & DATA ANALYST training program. This course is designed to cater to
students like you who are eager to dive into the world of AI TECH.
What Will You Learn?
Python & SQL Basics: Build a strong programming foundation with Python
for data manipulation and SQL for data management.
Excel for Data Analysis: Learn how to efficiently use Excel, from basic data
manipulation to advanced formulas and pivot tables, to gain quick
insights from your data.
Tableau for Data Visualization: Create interactive dashboards and
visually compelling charts to uncover trends and patterns in your
datasets.
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com
Power BI for Data Analysis: Develop data visualization and dashboard
skills to extract meaningful insights for informed decision-making.
Machine Learning & Deep Learning: Learn key ML algorithms, and explore
neural networks for complex tasks like image and text analysis.
Natural Language Processing (NLP): Acquire skills to work with text data,
from sentiment analysis to text generation.
Model Deployment & Version Control: Understand Git for version control
and deploy AI models in real-world applications.
Project Work: Throughout the course, you’ll work on hands-on projects
that will solidify your skills.
DATA SCIENCE & ANALYST PREMIUM SYLLABUS
Setup & Preparation
Introduction and Tool Setup
Python Installation: Setting up the Python environment for data
science tasks.
Power BI Installation: Overview and installation for data visualization.
MySQL Download: Database setup and connection basics.
Jupyter Notebook Essentials: Basic imports and setup for data
manipulation and visualization libraries.
Tableau Installation: Installing Tableau (or Tableau Public) for creating
interactive dashboards and visualizations.
Excel Setup: Ensuring you have Excel installed (often included with
Microsoft Office) or access to an equivalent spreadsheet tool for data
manipulation and quick analyses.
<aside>
Libraries: pandas, numpy, matplotlib
</aside>
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com
Static Website Design
Introduction to Core AI Concepts
Data Science: Overview and applications.
Data Analysis: Basics of analyzing and interpreting data.
Machine Learning (ML): Introduction to predictive modeling.
Deep Learning: Basics of neural networks.
Natural Language Processing (NLP): Language-based AI.
Computer Vision: Understanding image data for AI.
Python Programming Essentials
Python Fundamentals for Data Science
Common String Functions
Deleting & Updating Strings
Control Structures:
Conditional Statements (if, elif, else)
Nested Conditional Statements
Data Structures in Python
Understanding Python Data Structures
Lists:
Introduction and Methods
Tuples:
Basics and Use Cases
Dictionaries:
Introduction and Methods
Sets:
Introduction and Methods
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com
Data Structures in Python
Understanding Python Data Structures
Loops:
While Loop
For Loop
Control Statements: Breaking and Continuing in Loops
Functions:
Function Basics
Arguments and Keyword Arguments (Args and Kwargs)
<aside>
Basic Python Assignments and Problem Solving
</aside>
File Handling, Exceptions
Advanced Python Techniques
File Handling: Reading & Writing Files
Error Handling:
Basics of Exception Handling
Printing and Displaying Output
Comments in Python
Variables and Their Usage
Data Types in Python
Type Conversion
Taking User Input
Arithmetic, Comparison,
& Logical Operators
Assignment Operators
String Operations:
Introduction to Strings
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com
NumPy for Numerical Computation
Introduction to NumPy for Data Science
Basics of NumPy and Array Creation
Array Dimensions and Operations:
Reshape, Transpose, Ravel
Indexing, Slicing, Inserting, and Deleting Elements
Random Number Generation
Stacking and Splitting Arrays: Horizontal and Vertical Stacking
Statistical Functions: Mean, Median, Standard Deviation, Dot Product
<aside> Practical Assignment on NumPy </aside>
Pandas and Exploratory Data Analysis (EDA)
Data Manipulation and EDA Techniques
Pandas Basics:
Overview of Pandas & Basic Concepts
Working with Series
Data Frame Operations:
Set Index, Sorting, and Filtering Data
Accessing Data with loc and iloc
Data Aggregation:
Group By and Pivot Table
Advanced Aggregation Techniques
Exploratory Data Analysis (EDA):
Deleting Rows and Columns
Handling Duplicate &Missing Values
Outlier Detection (IQR)
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com
Data Visualization with Matplotlib and Seaborn:
Labeling, Subplots, Font Adjustments
Legends and Basic Plot Types: Bar Chart, Scatter Plot, Histogram,
Pie Chart
<aside> Assignment on Pandas and EDA </aside>
SQL Basic
Basic SQL Module
Introduction to Database
What is a Database? Types of Databases SQL vs NoSQL
Database Schema
Three-Layer Abstraction: External, Conceptual, Internal
Importance of Schema Design
ACID Properties
Atomicity Consistency
Isolation Durability
Data Definition Language (DDL)
CREATE Command
ALTER, DROP& TRUNCATE Commands
SQL Constraints
Primary Key, Foreign Key
Unique, Check & Default Constraints
Normalization
1NF, 2NF, 3NF, and BCNF
Importance of Normalization
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com
SQL Commands
DCL, DML, DDL, TCL
SQL Query Execution
SQL Keyword Order of Execution FROM Clause
WHERE Clause GROUP BY & HAVING Clauses
SELECT and ORDER BY Clauses TOP Keyword Usage
SQL Intermediate
Intermediate SQL Module
Subqueries
Types of Subqueries Use Cases in SQL Queries.
SQL Joins
Inner Join, Left Join, Right Join, Full Join Cross Join and Self Join
CTE, Stored Procedures, Data Modeling
SQL String Transformations: String Functions (e.g., UPPER, LOWER, TRIM)
Date and Time Manipulation: Date Functions (e.g., NOW, DATEDIFF)
Case Statements: Conditional Logic in SQL
Common Table Expressions (CTE): Recursive and Non-Recursive CTEs
Stored Procedures:
Creating and Using Stored Procedures
Parameters and Return Types
Data Modeling
Entity-Relationship Diagrams
Keys and Relationships
SQL Reverse Engineering : Understanding Existing Databases
Transactions: Transaction Control (COMMIT, ROLLBACK, SAVEPOINT)
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com
Views
Creating and Using Views Updatable Views
SQL Advanced
Advanced SQL Features
Lead & Lag Functions: Contextual Use Cases
Swapping Records: Implementing Swaps in SQL Queries
Pivot and Unpivot: Transforming Rows to Columns and Vice Versa
Ratio to Report Function: Calculating Ratios in SQL
SQL for Recommendations : Using SQL in Recommendation Systems
Window Functions
Introduction to Window Functions: Basics and Syntax
Window vs. Group By: Differences and Use Cases
Partitioning Data: Row Number with PARTITION BY
Ranking Functions: RANK, DENSE_RANK, NTILE
Aggregate Window Functions: Calculating Sums, Averages, etc.
Rolling Windows: Rolling Sum, Rolling Average
MACHINE LEARNING
Feature Selection
Introduction to Machine Learning : Basics and Terminology
Feature Selection Techniques: Methods for Feature Reduction
Handling Missing Values : Dropping and Imputing
Variance Threshold: Removing Low Variance Features
fit, transform, fit_transform, Predict: Understanding Scikit-Learn’s
Transformations
ANOVA and Recursive Feature Elimination (RFE): Techniques for Feature
Ranking
Model-Based Feature Selection: Using Models to Determine Features
Variance Inflation Factor (VIF): Multicollinearity Analysis
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com
Feature Engineering
Creating New Columns: Feature Construction
Handling Missing Values: fillna and dropna Methods
Outlier Detection and Removal: Standard Deviation, Z-Score, and
Pandas Techniques
Data Preprocessing: Scaling, Normalization, Encoding
Binning and Encoding: Converting Continuous to Categorical Data
Gaussian Transformation: Converting Data to Normal Distribution
Datetime Module: Handling Date-Time Data
Linear Regression
Introduction to Linear Regression : Algebraic and Geometric Intuition
Optimization Techniques: Vector Multiplication, Transpose, and Error
Calculations
Loss Function and Gradient Descent: Error Minimization Techniques
Assumptions in Linear Regression: Data Preparation and EDA
Fitting the Model and Evaluation: Model Building, Metrics, and Residual
Analysis
Logistic Regression, PCA, KNN
Logistic Regression: Intuition, Model, and Metrics
Normalization & Standardization: Data Scaling Techniques
PCA: Fundamentals, Implementation, and Visualization
K-Nearest Neighbors (KNN): Theory and Coding
Decision Tree, Naive Bayes
Decision Tree: Theory, Visualization, and Pruning
Naive Bayes: Text Analytics, Bernoulli, Multinomial, Gaussian
Evaluation Metrics: Confusion Matrix, Sensitivity, Specificity
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com
Bagging and Boosting
Ensemble Techniques: Bagging, Random Forest, Extra Trees
Boosting Techniques: AdaBoost, Gradient Boosting, XGBoost, CatBoost
Hyperparameter Tuning: Using GridSearchCV & RandomizedSearchCV
Clustering, Support Vector Machine
Clustering (Unsupervised Learning): K-means, Hierarchical, DBSCAN,
Gaussian Mixture
Support Vector Machine (SVM): Intuition and Hyperparameter Tuning
Hypothesis Testing, Gradient Descent
Hypothesis Testing: Null & Alternative Hypothesis, Errors, and Tailed
Tests
ANOVA & Chi-Square Test: Applications in Data Analysis
Gradient Descent: Concept and Applications in Regression
Linear Regression Deployment
Overview of Deployment Preparing the Model for Deployment
Deployment Techniques Evaluating Deployed Models
Deep Learning - Neural Network Fundamentals
Introduction to Deep Learning
What is Deep Learning? Applications of Deep Learning
Artificial Neural Networks (ANN)
Structure of an ANN Forward and Backward Propagation
Perceptron
Introduction to Perceptron Weights and Biases in Perceptron
Animating the Perceptron
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com
Multilayer Perceptron (MLP)
Structure and Working of MLP Importance of Layers in MLP
Activation Functions
Sigmoid Function
Additional Activation Functions (ReLU, TANH, Softmax)
Loss Functions
Regression Loss Functions Classification Loss Functions
Keras Framework
Overview of Keras Key Keras Modules
Problem Solving with Keras
Binary Classification Multi-Class Classification
Regression Problems
Optimizers
Types of Optimizers Evaluating and Comparing Optimizers
Introduction to Convolutional Neural Networks (CNN)
Basics of CNN Applications of CNN
Deep Learning - Advanced Concepts
Tensor Fundamentals
Understanding Tensors Tensor Operations and Examples
PyTorch Fundamentals
Introduction to PyTorch Practical Tensor Solutions with PyTorch
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com
Building Neural Networks
Core Building Blocks Hands-on Neural Network Creation
Advanced Activation Functions
Advanced Use Cases of Activation Functions
Practical Implementation
Training a Neural Network
Steps in Training Practical Training Solutions
Loss Functions
Advanced Loss Function Use Cases Loss Implementation Techniques
Optimizers
Optimizer Implementation and Evaluation
Regularization Techniques
Types of Regularization Hands-on Regularization
Object-Oriented Programming (OOP) in Python for Deep Learning
OOP Concepts in Python OOP-Based Solutions for Deep Learning
Deep Learning - Case Study
Human Activity Recognition Project
Dataset Overview Model Development
Evaluation and Optimization
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com
Natural Language Processing (NLP) - Basics
Text Preprocessing
Tokenization Lemmatization
Stemming Stop Words Removal
Feature Extraction
Bag of Words (BoW) Count Vectorizer
TF-IDF Vectorization N-Grams
POS Tagging & Word Embeddings
Parts of Speech (POS) Tagging
Introduction to Gensim and Word2Vec
Sentiment Analysis
Overview of Sentiment Analysis Applications and Use Cases
Activation Functions for NLP
Sigmoid Function TANH, ReLU, and Softmax Functions
Recurrent Neural Networks (RNN)
Introduction to RNNs Types of RNNs (Simple, LSTM, GRU)
Limitations of RNNs
Natural Language Processing (NLP) - Advanced Techniques
Deep Learning Models in NLP
LSTM (Long Short-Term Memory) GRU (Gated Recurrent Unit)
Bi-LSTM (Bidirectional LSTM)
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com
Transformers in NLP
Introduction to Transformers Tokenization with BERT
Applications of BERT (Bidirectional Encoder Representations from
Transformers)
Using SpaCy for NLP
Overview of SpaCy Implementing NLP Tasks with SpaCy
Natural Language Processing (NLP) - Project
Medical Chatbot Assistance Project
Dataset Overview Model Development and Deployment
Excel
Getting Started & Basic Data Handling
Basic Excel Interface
Navigating the Ribbon and Quick Access Toolbar
Understanding rows, columns, cells, worksheets
Saving and organizing files
Data Types in Excel
Text, numbers, dates
How Excel interprets each type
Practical Data Cleaning Basics
Removing extra spaces, text-to-columns, merging/splitting cells
Handling blank cells, using find/replace
Relative vs. Absolute Cell References
Difference between A1 vs. $A$1
Common use cases
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com
Cell Errors
Recognizing #VALUE!, #REF!, #DIV/0!
Quick tips for fixing and interpreting errors
Core Formulas & Essential Functions
Calculations & Order of Operations
Basic arithmetic, operator precedence, parentheses
Fundamental Functions
SUM, AVERAGE, MIN, MAX, COUNT
IF Function
Single-condition logic (e.g., pass/fail)
IFERROR Function
Handling errors gracefully
Formatting the Spreadsheet
Font styles, alignment, number formats, cell styles
Conditional Formatting
Highlighting cells
Color scales and icon sets
Organizing & Analyzing Data
Sorting & Custom Sort
Single-level and multi-level sorts
Creating custom sort lists (e.g., “High, Medium, Low”)
Filters & Removing Duplicates
Using AutoFilter on multiple columns
Identifying and removing duplicate rows
Data Validation
Creating drop-down lists Preventing invalid entries
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com
Basic Pivot Tables
Creating a pivot table Placing fields in rows, columns, values
Pivot Table Modifications
Grouping, rearranging fields, formatting
Adding simple calculated fields if needed
Essential Lookup & Intermediate Functions
VLOOKUP
Basic vertical lookup to pull data from another table
SUMIF & COUNTIF
Summations and counts based on single criteria
Multiple IF Conditions
Using AND/OR inside IF Brief introduction to nested IF
Text Functions
LEFT, RIGHT, MID, LEN CONCAT (or CONCATENATE)
Formula Auditing
Checking cell precedents and dependents
Evaluating formulas step by step
Extended Pivot & Basic What-If Analysis
Advanced Pivot Table Features
Using slicers, creating pivot charts
Refreshing data, best practices for structured data
SubTotal (Optional)
Quick summaries for data lists Often replaced by pivot tables
Goal Seek: Setting a target value to find the required input
Scenario Manager (Optional): Comparing different input sets
Case Study: Uber Data Analysis
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com
Power BI - Fundamentals
Fundamentals of Power BI
Introduction to Power BI
Overview of the Power BI ecosystem
Installation and setup
Basic Visualizations
Bar Chart
Creating a basic bar chart Formatting options
Pie Chart, Donut Chart, Funnel Chart
When to use each chart type Formatting and best practices
Ribbon Chart, Line Chart, Area Chart
Comparing trends over time Area vs. line chart usage
Combo Chart, Scatter Chart, Waterfall Chart, Treemap Chart
Combining multiple measures (Combo)
Analyzing data with two axes (Scatter)
Visualizing data changes (Waterfall)
Hierarchical data representation (Treemap)
Maps, Filled Maps
Geo-locating data Using map layers and data categories
Tables, Conditional Formatting, Matrix
Displaying tabular data
Highlighting data with conditional formatting
Using Matrix for pivot-like views
Gauge Chart, Cards, Multi-Row Cards
Monitoring single metrics
Showing key performance indicators (KPIs)
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com
Filter, Drill
Filtering at various levels (visual, page, report)
Drilling up/down in hierarchies
Slicer (Text, Number, Date)
Interactive data filtering
Configuration and formatting of slicers
Reports & Dashboards
Creating a Report in Power BI
Page layout and visual arrangement
Publishing in Power BI
Sharing reports via the Power BI Service
Dashboard in Power BI
Pinning visuals and setting up real-time tiles
How to Refresh Data in Power BI
Scheduled refresh Gateway configuration
Power Query
Power Query Intro : The Query Editor and data ingestion
Adding/Removing Rows: Filtering unwanted data, handling null values
Text Add Column Transform: Splitting, extracting, and replacing text
Number Function: Arithmetic operations, rounding, percentages
Date Add Column Transform:
Extracting year, month, day Creating time-based columns
Appending Sheets
Combining multiple data sources Handling schema differences
Merging Sheets:
Joining data tables Merging vs. appending scenarios
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com
Conditional Column
Creating new fields based on conditions
Fill, GroupBy, Transpose
Forward/backward fill, aggregations, reshaping data
Keep and Remove Columns/Rows
Selecting and removing data before loading
Data Modeling Concepts
Normalization (1NF, 2NF, 3NF)
Structuring data to reduce redundancy
De-Normalization, OLTP vs. OLAP
Balancing performance with structure
Transactional vs. analytical systems
DAX Formulas
DAX Intro, Data Types: Basic syntax, row vs. filter context
First DAX: Simple measures, fundamental calculations
Date Functions, More on Date Functions
Time intelligence (YTD, MTD, etc.)
Text Functions: Concatenation, string extraction, formatting
Logical Functions: IF, SWITCH, handling null values
Calculated Measure, More on Calculated Measure
Reusable expressions, performance considerations
Additional DAX Functions
Statistical, math, financial functions
Creating Your Own Query
Writing custom DAX queries
Best practices for performance
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com
Case Studies
Life Expectancy
End-to-end implementation (data sourcing, modeling, visualization)
Applying Power BI features in a real-world scenario
Tableau
Introduction to Tableau
Overview of Tableau
Purpose and use cases Installing or accessing Tableau
Tableau User Interface
Navigating the workspace (Data pane, Shelves, Marks card)
Distinguishing Dimensions vs. Measures
Basic Charting & Data Handling
Working with Data
Connecting to common data sources Default properties of fields
Common Charts
Bar Charts, Line Charts
Combined Axis Charts, Area Charts, Dual Axis Charts
Basic plot types and when to use them
Filters & Simple Analytics
Filter Types
Dimension Filter, Measure Filter Visual Filter, Interactive Filter
Data Source Filter, Context Filter
Sets, Groups, and Parameters
Creating sets to segment data
Grouping members for simplified categories
Using parameters for user-driven calculations
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com
Calculated Fields & Functions
Calculated Fields
Building basic expressions Overview of Calculation Syntax
Date & Text Functions
Working with dates (e.g., YEAR(), MONTH())
Common text functions (e.g., LEFT(), MID(), TRIM())
Bins and Histograms: Segmenting measures into intervals
Reference Lines & Trend Lines:
Adding statistical insights to visualizations
Quick Table Calculations
Built-in running totals, moving averages, etc.
Dashboards
Introduction to Dashboards
Creating a dashboard layout Adding sheets and objects
Applying filters across multiple views
Actions
Setting up basic interactivity (filter actions, highlight actions)
Data Integration
Union & Joins
Combining data within the same source
Joining multiple tables
Data Blending
Merging data from different sources (light introduction)
Case Study: Loan Application Analysis
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com
Capstone Project
Project : HR employee Data Analysis (POWER BI)
Project : Life Expectancy (POWER BI)
Project : Linear Regression Deployment (MACHINE LEARNING)
Project : Chat Bot (MACHINE LEARNING, NLP)
Project : Human Activity Recognition (DEEP LEARNING)
Project : BERT Email Classification (DEEP LEARNING)
Project : Project like Google Assistant (NLP)
Project : Sentimental Analysis (NLP)
Placement Preparation
Get ready for your dream job with resume building, LinkedIn optimization,
and mock interviews.
1. Resume Preparation Session
2. LinkedIn mastery
3. Naukri mastery
4. Interviews Question
5. Session with Industry Mentor
For Queries: 9842191947 | [email protected] For Registration www.codegalatta.com