1.
5-Month Roadmap
– Python, Math, and
Libraries (4 hr/day)
Week 1 – Python Basics (Foundation building in coding fundamentals)
Variables and data types, loops (for, while),
functions, recursion
Core Topics Covered
Data structures: lists, tuples, sets, dictionaries
Daily writing of 10–15 small programs
(calculator, prime numbers, factorial, string
reversal)
Practice and Resources
Resource: CodeWithHarry Python playlist
2 hr: Python theory and exercises
Daily Schedule (4 hours) 1 hr: Small practice problems
1 hr: Revision and note-taking
Week 2 – Python OOP and File Handling (Learning structured programming
and data storage)
Classes and objects, methods, attributes
Key Concepts
File handling techniques: reading/writing CSV
and text files
Mini project: Student marks management system
using CSV for data storage and retrieval
Practical Application
2 hr: Object-Oriented Programming concepts
Daily Schedule (4 hours) 1 hr: File handling practice
1 hr: Coding mini project
Week 3 – Math Basics for AI/ML (Foundational math for machine learning)
Linear algebra: vectors, matrices, dot product,
transpose
Probability and statistics: mean, median,
variance, standard deviation
Topics Studied
Basic distributions: Normal and Binomial
Solving math problems using Python
Hands-on Practice
Implementing statistical calculations with Numpy
2 hr: Theory and formulas
Daily Schedule (4 hours) 1 hr: Python implementation using Numpy
1 hr: Mini problems and exercises
Week 4 – Python Libraries Basics (Data manipulation and visualization
foundation)
Numpy: arrays, operations, broadcasting
Pandas: DataFrame, CSV handling, filtering,
grouping
Libraries and Techniques
Visualization: Matplotlib and Seaborn plots (line,
bar, histogram, heatmap)
Titanic dataset exploration: loading CSV,
cleaning data, visualizing distributions
Project Focus
1.5 hr: Numpy practice
Daily Schedule (4 hours) 1.5 hr: Pandas practice
1 hr: Data visualization with Matplotlib and
Seaborn
Week 5 – Advanced Pandas and Statistics (Complex data handling and
feature engineering)
Handling missing values and duplicates
Feature creation and transformation
Advanced Topics
Grouping, aggregation, merging datasets
Statistical measures: correlation and covariance
Working on Titanic, IPL, or Covid datasets for
cleaning, feature creation, and analysis
Practice Projects
2 hr: Advanced Pandas techniques
Daily Schedule (4 hours) 1 hr: Statistics with Pandas and Numpy
1 hr: Mini project focusing on exploratory data
analysis and feature engineering
Week 6 – Mini Projects and Practice (Applying skills through real data
projects)
Titanic survival analysis: data cleaning, EDA,
basic statistics
IPL data analysis: batting averages, top players,
visualizations
Mini Projects Included
Covid dataset trends visualization
2 hr: Data cleaning and manipulation
Daily Schedule (4 hours) 1 hr: Visualization tasks
1 hr: Project documentation in Google Colab or
Jupyter Notebook
Week 7 (Half Month) – Revision and Consolidation (Strengthening weak
areas and preparing portfolio)
Revising Python basics, OOP, math concepts,
Numpy, Pandas, and visualization
Focus Areas Finalizing mini projects for portfolio readiness
Creating cheat sheets for functions and formulas
2 hr: Revision and Python exercises
Daily Schedule (4 hours) 1 hr: Dataset practice
1 hr: Documentation and portfolio preparation
Outcome After 1.5 Months
Strong foundation in Python basics and OOP principles
Solid math background essential for AI/ML: linear algebra, probability, statistics
Confident use of Numpy, Pandas, Matplotlib, and Seaborn libraries
Completion of 2-3 mini projects ready for showcasing in a portfolio
Preparedness to advance to machine learning topics next