0% found this document useful (0 votes)
2 views9 pages

Data Science

Uploaded by

Parveen
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views9 pages

Data Science

Uploaded by

Parveen
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 9

Basic to Intermediate with Expanded Super vised Learning

Team Manoj
89399 15538
Course Overview

Course Title: Data Science Fundamentals: From Basics to


Advanced Machine Learning
Price: ₹45,000

Greens
Duration: 12 weeks (3-4 hours per week per module)
Target Audience: Beginners and intermediate learners aiming
to master data science foundations and a wide range of
super vised learning algorithms.

Technology
This 12-week course builds a strong foundation in data
science, emphasizing Python, NumPy, Pandas, and an
extensive exploration of super vised learning algorithms.
Learners will progress from basic data manipulation to
implementing and evaluating a variety of machine learning
models, preparing them for practical data science
challenges.

Team Manoj
89399 15538
Prerequisites
-Basic understanding of mathematics (e.g., algebra, statistics)
-No prior programming experience required

Module 1: Introduction to Data Science and Python


Duration: 1 week

Greens
Objective: Understand data science and set up Python.
Topics:
- What is Data Science? Applications and Workflow
- Python Setup: Anaconda, Jupyter Notebooks

Technology
- Python Basics: Variables, Loops, Functions, Libraries
Activities:
- Write a simple Python script (e.g., data summary)
- Explore a dataset in Jupyter Notebook

Module 2: Data Manipulation with NumPy


Duration: 1 week
Objective: Master numerical computing with NumPy.
Topics:
- NumPy Arrays: Creation, Indexing, Slicing
- Array Operations: Arithmetic, Statistical Functions
- Multidimensional Arrays and Broadcasting
Activities:
- Perform calculations on a dataset (e.g., weather data)

Team Manoj
89399 15538
Module 3: Data Analysis with Pandas
Duration: 1 week
Objective: Learn data manipulation and exploration.
Topics:
- Pandas: Series, DataFrames, Importing Data

Greens
- Data Cleaning: Missing Values, Duplicates, Outliers
- EDA: Grouping, Pivot Tables, Summary Statistics
Activities:
- Clean and analyze a dataset (e.g., e-commerce sales)

Technology
Module 4: Data Visualization Basics
Duration: 1 week
Objective: Create effective data visualizations.
Topics:
- Matplotlib: Line, Bar, Scatter Plots
- Seaborn: Heatmaps, Pair Plots
- Visualization Best Practices
Activities:
- Visualize trends in a dataset (e.g., stock prices)

Team Manoj
89399 15538
Module 5: Statistics for Data Science
Duration: 1 week
Objective: Build a statistical foundation.
Topics:
- Descriptive Statistics: Mean, Variance, Skewness

Greens
- Probability: Normal Distribution, Central Limit Theorem
- Hypothesis Testing: T-tests, P-values
Activities:
- Conduct statistical tests on a dataset (e.g., marketing data)

Technology
Module 6: Introduction to Machine Learning

Duration: 1 week
Objective: Understand machine learning basics.
Topics:
- Types of ML: Super vised, Unsuper vised
- Scikit-learn: Data Splitting, Model Training
- Evaluation Metrics: Accuracy, Confusion Matrix
Activities:
- Train a basic model (e.g., predict pass/fail)

Team Manoj
89399 15538
Module 7: Supervised Learning - Regression Algorithms
Duration: 1 week
Objective: Master regression techniques.
Topics:
- Linear Regression: Simple and Multiple
- Polynomial Regression

Greens
- Regularization: Ridge, Lasso, Elastic Net
- Evaluation: MSE, RMSE, R²
Activities:
- Build regression models (e.g., predict house prices)

Technology
Module 8: Supervised Learning - Classification Algorithms

Duration: 1 week
(Part 1)

Objective: Explore foundational classification methods.


Topics:
- Logistic Regression: Binary and Multiclass
- K-Nearest Neighbors (KNN)
- Support Vector Machines (SVM): Linear Kernel
- Evaluation: Precision, Recall, F1-Score
Activities:
- Classify a dataset (e.g., customer churn)

Team Manoj
89399 15538
Module 9: Supervised Learning - Classification Algorithms
(Part 2)
Duration: 1 week
Objective: Dive into tree-based and ensemble methods.
Topics:
- Decision Trees: Pruning, Feature Importance

Greens
- Random Forests: Bagging, Hyperparameters
- Gradient Boosting: XGBoost, LightGBM Basics
- ROC Cur ve and AUC
Activities:

Technology
- Build ensemble models (e.g., fraud detection)

Module 10: Unsupervised Learning Basics

Duration: 1 week
Objective: Introduce unsuper vised techniques.
Topics:
- K-Means Clustering
- Hierarchical Clustering
- PCA for Dimensionality Reduction
Activities:
- Cluster a dataset (e.g., customer segmentation)

Team Manoj
89399 15538
Module 11: Model Optimization and Evaluation
Duration: 1 week
Objective: Enhance and evaluate ML models.
Topics:
- Cross-Validation: K-Fold, Stratified

Greens
- Hyperparameter Tuning: Grid Search, Random Search
- Bias-Variance Tradeoff, Overfitting
Activities:
- Optimize a model from previous modules

Technology
Module 12: Capstone Project
Duration: 1 week
Objective: Apply skills to a real-world problem.
Topics:
- End-to-End Workflow: Data Prep to Model Deployment
- Presenting Insights and Model Results
Activities:
- Complete a project (e.g., predict sales or classify emails)

Team Manoj
89399 15538
Assessment
- Weekly assignments (40%)
- Mid-term quiz (20%)
- Capstone project (40%)

Learning Outcomes

Greens
1. Write Python code for data analysis and modeling.
2. Use NumPy and Pandas for data manipulation.
3. Implement and evaluate a wide range of super vised learning
algorithms (regression and classification).

Technology
4. Apply unsuper vised learning and model optimization techniques.

Tools
Python, NumPy, Pandas, Matplotlib, Seaborn, Scikit-learn, XGBoost,
LightGBM

Team Manoj
89399 15538

You might also like