Data Science
Data Science
TRAINING TRACK IN
Pages:
07 TRACK OUTCOMES
09 PREREQUISITE
CONTENT
10 TRAINING OUTLINE
33 METHODOLOGIES
34 TRAINING ASSESSMENT
35 PROJECTS
ABOUT
SHAI for AI Solutions is a Saudi company operating from Riyadh and Amman,
S H A I technology.
Guided by our vision to become the leading strategic partner for AI-driven
OUR CLIENTS:
We guide businesses from strategy to We offer unique training in AI, data SHAI specializes in sourcing top AI and
execution, with AI experts crafting science, and technology through tech talent, offering swift recruitment
tailored strategies and ensuring flexible, customized paths using and outsourcing services. We provide
seamless implementation to unlock AI's distance learning tools, provided by skilled expertise, reduce costs, enable
full potential and achieve strategic goals. elite SHAI experts. flexible staffing, and boost efficiency.
Recruit
Recruitment Excellence by SHAI:
A Showcase
Architects.
T r a i n i n g O u t l i n e
ABOUT THE TRACK
In today's data-driven era, learning data science has become one of The course covers skills related to Python programming, data
the most in-demand skills in both global and local job markets. analysis using Python, building machine learning solutions with
Statista show that the data analytics market in the Middle East and
The course's need and importance come from the fact that it
North Africa is expanding at an annual growth rate exceeding 12%,
focuses on the foundational skills that a data scientist will certainly
reflecting the increasing demand for professionals capable of
extracting value from data. In Jordan, both governmental and private need, and that it focuses on building analytical skills not just learning
institutions have begun adopting AI and data analytics to support the tools. In other words, the course does not only teach you the
decision-making, making learning this field a strategic professional Python functions and libraries, it teaches when an how to use them,
References:
analysis tasks.
models (e.g., Random Forest, Gradient Boosting), and explain their usage
Use essential Python libraries for numerical analysis, data visualization,
in interviews.
statistics, and machine learning (e.g., NumPy, Pandas, Matplotlib, Seaborn,
Scikit-learn).
Apply dimensionality reduction techniques, describe their use in
Apply key statistical methods including hypothesis testing, confidence interviews, and visualize high-dimensional data effectively.
Understand and work with relational databases using SQL: create databases,
models, and optimize them for performance.
build tables and relationships, load and update data, and retrieve data for
analysis or machine learning tasks. Demonstrate foundational understanding of deep learning and artificial
Use Python to connect to and interact with relational databases. neural networks, including the perceptron, activation functions,
Build, train, analyze, and optimize supervised classical machine learning gradient descent, and backpropagation, and build simple feedforward
models, and confidently describe them in interviews. models for regression and classification tasks.
An individual seeking a career in data analysis, business
Training Outline
intelligence, data science, or machine learning, with no
(OOP)
Training Outline
4 STATISTICS Page 21
6 SQL Page 28
and decision-making
1 world
business
decision-making
Science
Lessons Topics Covered
marketing, etc.
Lessons Topics Covered
-Notebooks: Jupyter)
TO DATA SCIENCE
-Use cases in various industries:
8 What is Data Science
-Predictive analytics
Used For?
-Recommendation engines
-Fraud detection
-Customer segmentation
-Enhanced decision-making
9 What are the Benefits
-Operational efficiency
of Data Science?
-Personalization in services
Training Outline
1 Python Foundations & - Introduction to Python -Introduction to Python
Environment Environment
2 -Variables
-Comments and Docstrings
-Arithmetic Operations
CODING WITH
-Working with Strings and
-Comparison Operations
User Input
-Logical Operations
-User Input
replace(), find())
-Lists
-Slicing
Structures -Sets
Module Tiltle Lessons Topics Covered
-Loops
Training Outline
-Data Structures (Lists)
5 Control Flow &
-Linear Data Structures -List Built-in Functions
Functions
-Immutable and Unordered -Nested Lists
Structures -Dictionaries
Python
Oriented Programming
CODING WITH
Oriented Programming
PYTHON
-Methods and Object -Attributes in Classes
Self Parameter
Code Reusability
Module Tiltle Lessons Topics Covered
Training Outline
1
Analysis
3 -NumPy Operations
generate arrays (arange( ),
PYTHON FOR
-Array attributes: shape,
-Mathematical Functions
DATA ANALYSIS
(max(), argmax( ), min( ),
argmin( )).
Selection
-Broadcasting
-Copying Arrays
-Indexing a 2D array
(matrices)
-Fancy Indexing
(Selection)
-Mathematical Functions
argmin( )).
-Universal Array
Training Outline
3
-what is Panads Library
-Creating DataFrame
-Aggregating and Grouping
-Removing columns
concatenate
-Conditional Selection
-Operations
-setting and resetting
-Data Input and Output
index
PYTHON FOR
-Multi-Index and Index
Point Hierarchy
DATA ANALYSIS
-Dropping Null Values
-Interpolating Missing
Values
-Group by
-Common Aggregation
std(), describe())
-Concatenation
-Merging(inner -default ,
left)
-Unique Values
-Selecting Data
-Applying Functions
Module Tiltle Lessons Topics Covered
Training Outline
4 -Permanently Removing a
names
-Aggregating and Grouping
concatenate
-Find Null Values or Check
-Operations
for Null Values
-Data Input and Output
-Pivot Tables
graphs
-Scatter Plots
-Fundamentals and Plot
-Histograms
Customization
-Bar Charts
-Basic Matplotlib
Commands
Module Tiltle Lessons Topics Covered
Point
-Figure size, aspect ratio
and DPI
-Saving figures
-Setting colors,
syntax
-Plot range
-Grids -rugplot
Point -countplot
-boxplot
Module Tiltle Lessons Topics Covered
-Grids Plots
Point -PairGrid
-pairplot
-Facet Grid
-Styles
-Spine Removal
with Plotly
-Creating Visualizations Data Visualization with
-Numerical Operations
Techniques
with NumPy
-Data Manipulation and
Numerical Operations
1 Importance of mathematics
Training Outline
Basic math for data -Introduction to
Probability)
precision
4
-Linear Algebra -Vectors and matrices
-Matrix operations
(addition, multiplication,
transpose)
science
functions.
-Logical operators.
-Applications in data
models.
(events, outcomes)
-Independent and
dependent events
-Applications of probability
-Measures of variability
-Data distributions
Module Tiltle lessons Topics Covered
Training Outline
Confidence Intervals -Population proportion -Population proportion
-Statistical proportions in
sample mean
mean
-Applications of the
population mean
-Statistical proportions in
statistical equations
two populations
methods
medians / means in
hypothesis testing
(coefficients).
-Importance of weights in
interpreting model
relationships.
-Statistical significance of
weights (p-values).
Module Tiltle lessons Topics Covered
Training Outline
Hypothesis Testing
weights regression.
for weights.
coefficients.
STATISTICS
testing multivariable testing multivariable
hypotheses
hypotheses.
-Conducting simultaneous
between variables.
tests, R-squared).
Module Tiltle lessons Topics Covered
Training Outline
1 Machine Learning -Types of ML Systems -Supervised, semi-
self-supervised,
reinforcement learning.
-Probabilistic vs non-
5 probabilistic models
-Evaluating Regression
MAPE)
- Overfitting and
Underfitting
Regularization)
-Elastic Net
Module Tiltle lessons Topics Covered
Training Outline
2 Regression -Logistic Regression: The Sigmoid Function
Softmax Regression
-Linearity
-Independence of Errors
5
-Homoscedasticity
-Normality of Residuals
-No Multicollinearity
-Basic Assumptions of
3
& INTRODUCTION
Classification -Evaluation Metrics -Introduction
Introduction -Accuracy
TO DEEP LEARNING
-Confusion matrix
-Precision, Recall, F1
-ROC
-Cross Validation
Types -Multi-class
-Multi-label
-Multi-output
Training Outline
SVM -Non-linear SVM Mapping to higher order
4
space
-Computation complexity
RBF)
with trees
-Trees hyperparameters
learner
-Diversity in ensemble
data)
instances
-Bootstrap sampling in
features
Error
-Random Forest
-Feature Importance
Module Tiltle lessons Topics Covered
-Gradient Boosting
Reduction dimensionality
-Dimensionality reduction
5
approaches (projection-
based vs manifold
learning)
-Variance of data
-PCA
-PCs and projection in PCs
-Linear vs non-linear
& INTRODUCTION projections
-Clustering in image
processing
density-based methods
-Non-linear activation
-Loss function
-Gradient Descent
-Backpropagation
Module Tiltle lessons Topics Covered
Training Outline
1 What is a relational -Introduction to SQL -Learn the basic structure
(Structured Query
Language).
database in data
management.
SQL
2 Working with data in a -Select -Retrieve data from one or
table.
ascending or descending
more columns.
on specific conditions.
Training Outline 2 Working with data in a -Select -Retrieve data from one or
table.
conditions.
based on specific
SQL
conditions.
records returned in a
query.
column.
criteria.
or specify ranges/sets of
those groups.
SQL
3 Working with data in -Joins -Understand how to merge
join.
table.
Training Outline
4 Managing databases -Create Table -Define a new table within
structure.
6
database.
add/remove columns).
7
Docker
(RECORDED VIDEOS)
Training Outline
8 METHODOLOGY
Resource-Driven Learning
Comprehensive projects that simulate real-
Pre-recorded videos on foundational AI concepts.
Interactive Sessions
Mentorship and Support
Includes discussions, Q&A sessions, and group Aims to help participants achieve their learning objectives.
activities
Continuous Assessment
Training Outline
Pre-Assessment
9 Pre-Self-Evaluation
Graded Quizzes
TRAINING
Tasks
ASSESSMENT
Observation
Final Project
Post Assessment
Post Self-Evaluation
In addition to the practical tasks that follow each training
Training Outline
module, and the interactive quizzes based on challenges and
order:
PROJECTS
Project:
-Data Analysis
medicine.
Engineering: Utilize deep learning for design optimization and predictive maintenance.
effectiveness.
Finance: Employ advanced analytics to optimize trading strategies and assess risks.
engagement.
techniques.
Public Health: Use data science to track health trends and improve community
initiatives.
Agriculture: Optimize crop yields and resource management with data-driven insights.
Economics: Apply data science to analyze market trends and inform policy decisions.
algorithms.
analytics.
MAIN HALL
TRAINING ROOMS
JORDAN BRANCH
CONTACT US