0% found this document useful (0 votes)

15 views19 pages

Prasoon 2

Uploaded by

pk8826627

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views19 pages

Prasoon 2

Uploaded by

pk8826627

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

SUMMER INTERNSHIP REPORT

“Python For Mastering Machine Learning And Data Science”

Submitted in the Partial Fulfillment for the Award of Degree

Bachelor of Science Information Technology (BSC.IT)

UTTARANCHAL UNIVERSITY

SESSION 2025 – 2026

Under the Supervision of Submitted By

Dr. Saurabh Dhyani Prasoon Giri
Assistant Professor BSC.IT 5rth Semester
Uttaranchal School of Computing Sciences (USCS) UU2310000021
ACKNOWLEDGEMENT

First and for most I am ever grateful to God to whom I owe my life. I
would also like to thank my parents for giving me the opportunity to
study at Uttaranchal University, Dehradun. I wish to express my deep
sense of gratitude to our Project Mentor Dr. Saurabh Dhyani
(Assistant Professor).
Prof. (Dr.) Sonal Sharma (Director USCS)for his valuable guidance
to prepare the project and in assembling the project material. I am very
thankful for his faithful blessings and for providing necessary and
related facilities required for our computer project file. In last I also
want to thank those directly or indirectly took interest to complete my
project file.
.

Prasoon Giri

UU2310000021
DECLARATION

I hereby declare that the summer internship report titled “Python For
Mastering Machine Learning And Data Science” is submitted by
Prasoon Giri to Uttaranchal School of Computing Sciences. The
Internship was done under the guidance of Dr. Saurabh Dhyani. I
further declare that the work reported in this Internship has not been
submitted and will not be submitted, either in part or in full, for the
award of any other degree or diploma in this university or any other
university or institute.

Prasoon Giri

UU2310000022
CERTIFICATE OF INTERNSHIP
TABLE OF CONTENTS

1.Company Profile .......................................................................................... I

2.Introduction ........................................................................................................ II

3.Week 1 ...................................................................................................................... III

4.Week 2 ....................................................................................................................... IV
5.Week 3 ........................................................................................................................ V
6.Week 4 ........................................................................................................................ VI
7.Week 5 .........................................................................................................................VII
8.Week 6 ......................................................................................................................... VIII
9.Week 7........................................................................................................................ IX
10.Week 8....................................................................................................................... X

11.Week 9....................................................................................................................... XI

10.Conclusion..................................................................................................................... X
Company Profile

Udemy is a leading global online learning and teaching marketplace that

operates on a two-sided business model, connecting instructors with over
75 million learners worldwide. Founded in 2010 and headquartered in
San Francisco, the company’s core mission is to "transform lives through
learning" by providing flexible and on-demand access to over 250,000
courses on a wide variety of subjects. A significant and growing portion
of its business is the subscription-based Udemy Business segment,
which provides a curated course library to corporate customers for
employee training and development, and has been the primary driver of
the company's financial growth. As a publicly traded company on the
NASDAQ (UDMY), Udemy has consistently invested in technology,
including artificial intelligence, to enhance its platform and maintain its
competitive edge in the global e-learning market. The company’s culture
is deeply aligned with its mission, valuing continuous learning,
inclusivity, and an agile, results-oriented approach to business.
INTRODUCTION

An internship at Udemy is a unique opportunity to gain practical

experience at a leading global online learning marketplace. The
company's mission is to transform lives through learning by
empowering individuals and organizations with essential skills. My
internship will focus on the "Python for Data Science and Machine
Learning" course, providing a direct connection to the company's core
business and mission. This specific course, widely popular on the
platform, serves as an excellent foundation for mastering the in-demand
fields of machine learning and data science. Through this experience, I
aim to apply theoretical knowledge from the course to real-world
challenges, such as analyzing learner engagement data, optimizing
course content for better learning outcomes, or developing new features
for the platform using Python. The internship will not only enhance my
technical skills in data science but also provide invaluable insights into
the operations of an agile, mission-driven tech company.)
Week 1
Foundations in Python and Data Environment Setup

 Objective: To build a robust and reproducible foundation in Python

and its core data science ecosystem.

 Detailed Activities: The first week was dedicated to mastering the

foundational tools. We started by configuring our environment
using Anaconda, which allowed us to create and manage isolated
virtual environments for different projects, ensuring package
compatibility and reproducibility. We then became proficient in
Jupyter Notebooks, not just for running code but for creating rich,
executable documents that combined live code, equations (using
LaTeX), visualizations, and narrative text. Our deep dive into the
Pandas library was extensive. We learned how to load data from
diverse sources like CSV, JSON, and even SQL databases. We
mastered core data manipulation tasks, including filtering data
using boolean masks, handling multi-index dataframes, and
performing complex aggregations with the groupby() function.
This week also included a thorough review of NumPy, focusing on
advanced array indexing and slicing, broadcasting, and optimizing
for speed by using vectorized operations over traditional Python
loop
Week 2

Comprehensive Data Preprocessing and Advanced EDA

 Objective: To transform raw, messy data into a clean, structured

format and to uncover hidden insights through in-depth exploration.

 Detailed Activities: This week was all about the essential

groundwork for any data science project. We addressed missing
values using a variety of techniques; besides simple mean or
median imputation, we implemented K-Nearest Neighbors (KNN)
imputation, which uses a machine learning algorithm to predict
missing values. We also handled categorical data by going beyond
simple one-hot encoding to use Target Encoding and Ordinal
Encoding when appropriate. A significant part of the week was
spent on Exploratory Data Analysis (EDA). We used Seaborn to
create complex plots like pair plots to visualize relationships
between all features and violin plots to compare the distributions of
features across different categories. We performed statistical tests
to understand feature correlations and used techniques like the
Interquartile Range (IQR) to detect and handle outliers, which
can skew model performance.
Week 3

Supervised Learning (Regression) and Model Evaluation

 Objective: To implement and evaluate a wide range of regression

models and understand the principles of model optimization.

 Detailed Activities: With our data clean, we moved into the core of
supervised learning, focusing on regression. We implemented
Linear Regression and then learned about regularization
techniques by building Ridge Regression and Lasso Regression
models to handle multicollinearity and prevent overfitting. We also
explored powerful, non-linear models like Decision Trees and
Random Forest Regressors. To evaluate our models, we went
beyond simple metrics. We learned when to use Mean Absolute
Error (MAE) for interpretability and Root Mean Squared Error
(RMSE) for penalizing larger errors. A key part of the week was
performing k-fold cross-validation to ensure our models were
robust and could generalize to new data, and we used
sklearn.model_selection.GridSearchCV to automate the
hyperparameter tuning process.
Week 4

Supervised Learning (Classification) and Nuanced

Evaluation

 Objective:To master classification algorithms and perform a

nuanced, comprehensive evaluation of their performance.

 Detailed Activities: This week was dedicated to classification, a

cornerstone of machine learning. We implemented algorithms like
Logistic Regression and Support Vector Machines (SVMs). We
learned that simple accuracy can be misleading, especially with
imbalanced datasets. We focused on a more holistic evaluation
using the Confusion Matrix to understand false positives and false
negatives. We calculated and interpreted Precision, Recall, and
F1-Score, metrics that are essential for tasks like fraud detection.
We also spent significant time creating and interpreting the ROC
(Receiver Operating Characteristic) curve and calculating the
AUC (Area Under the Curve), which are crucial for evaluating a
model's ability to distinguish between classes at various thresholds.
Week 5

Unsupervised Learning and Dimensionality Reduction

 Objective: To explore unsupervised learning algorithms and

methods for simplifying and finding patterns in complex datasets.

 Detailed Activities: We shifted our focus to unsupervised learning,

where the goal is to find hidden patterns in unlabeled data. We
spent a significant amount of time on K-Means Clustering,
learning how to apply it for customer segmentation and other
business problems. We explored advanced techniques for
determining the optimal number of clusters, such as the "Elbow
Method" and silhouette analysis. We also worked with more
complex clustering algorithms like DBSCAN, which is effective at
finding clusters of varying shapes and densities. A key topic this
week was dimensionality reduction using Principal Component
Analysis (PCA). We learned how to use PCA not only to speed up
model training but also as a powerful tool for data visualization in a
lower-dimensional space. We also briefly touched on using T-SNE
for visualization of high-dimensional data
Week 6

Ensemble Methods and Introduction to Specialized Topics

 Objective: To build highly accurate and robust models using

ensemble techniques and to get an introduction to specialized areas.

 Detailed Activities: We delved into powerful ensemble methods

that combine the predictions of multiple models to improve
performance. We built and fine-tuned Random Forest and
Gradient Boosting models, understanding the underlying
principles of bagging and boosting that make them so effective. We
explored advanced topics beyond traditional machine learning. We
had a comprehensive introduction to Natural Language
Processing (NLP), learning about text preprocessing techniques
like tokenization and lemmatization, and feature extraction with
TF-IDF. We also delved into Time Series Analysis, learning to
handle and forecast time-dependent data using classic models like
ARIMA. We spent time understanding the different components of
a time series, such as trend, seasonality, and noise.
Week 7

Natural Language Processing (NLP) Fundamentals

 Objective: To understand the fundamentals of working with text

data and to apply machine learning to it.

 Detailed Activities: This week served as a bridge between

foundational machine learning and a specialized field. We had an
in-depth introduction to Natural Language Processing (NLP). We
learned about text preprocessing techniques like tokenization and
lemmatization, and how to clean raw text data. We explored
different ways to convert text into numerical features, including
Bag-of-Words and TF-IDF, and built a simple sentiment analysis
model. We also learned about text vectorization using word
embeddings, which allows us to capture the semantic meaning of
words.
Week 8

Time Series Analysis and Forecasting

 Objective: To learn how to analyze and forecast time-dependent

data.

 Detailed Activities: We dedicated this week to Time Series

Analysis. We learned to handle time-dependent data in Pandas,
understanding the importance of proper indexing. We explored how
to decompose a time series into its different components: trend,
seasonality, and residuals. We then built a forecasting model using
ARIMA (Autoregressive Integrated Moving Average) and its
variants. We also explored using traditional machine learning
models for time-series forecasting by creating time-based features
and evaluating our models using specialized metrics for
forecasting. This week provided a strong foundation for tackling a
variety of forecasting problems.
Week 9

Final Project and Professional Reporting

 Objective: To synthesize all skills into a comprehensive, end-to-

end project and create a professional report.

 Detailed Activities: This final week was the culmination of all our
learning. We worked on our selected project from start to finish.
We applied all the skills acquired—from extensive data cleaning
and feature engineering to training multiple models, selecting the
best one, and optimizing its performance. We meticulously
documented every step of our process, including the code, data
analysis, and visualizations. The final submission included a
detailed technical report outlining the problem statement, our
methodology, the challenges we faced, and our final results. We
also created a professional presentation to showcase our work,
which demonstrated our ability to not only solve a problem but also
to effectively communicate the process and findings to both
technical and non-technical audiences. This project solidified our
practical skills and provided a complete example of the data
science lifecycle.
Conclusion

The nine-week internship on this topic was a comprehensive and hands-

on experience that successfully met its objectives. The structured
curriculum provided a progressive learning path, beginning with a strong
foundation in Python and its data science ecosystem and steadily
advancing to complex machine learning concepts.
Over the course of the internship, I gained a deep understanding of the
entire data science workflow. This included data collection and
preprocessing, exploratory data analysis, and the implementation of
various supervised and unsupervised learning algorithms. I also learned
crucial skills in model evaluation and optimization, ensuring that the
models were not only accurate but also robust and reliable.

The project-based approach in the final weeks was particularly valuable,

as it allowed me to apply theoretical knowledge to a practical, real-world
problem. This demonstrated my ability to execute a complete data
science lifecycle, from initial data cleaning to final model deployment
and reporting.

This internship has significantly enhanced my technical skills in Python

and its data science libraries. The practical experience gained has
prepared me for future career opportunities in the field of machine
learning and data science.
ReferenceS
https://www.geeksforgeeks.org/python-programming-language-tutorial/

https://livemytraining.com/course/web-development-using-python/

Data Science 4-Week Internship Report
No ratings yet
Data Science 4-Week Internship Report
14 pages
Data Science 2-Week Internship Report
No ratings yet
Data Science 2-Week Internship Report
12 pages
Visakha Institute of Engineering&Technology Computer Science and Engineering
No ratings yet
Visakha Institute of Engineering&Technology Computer Science and Engineering
48 pages
BTech 3rd Year AI - DS Summer Internship - PIET With Pickl - Ai by TransOrg Analytics
No ratings yet
BTech 3rd Year AI - DS Summer Internship - PIET With Pickl - Ai by TransOrg Analytics
4 pages
Rainfall Viva Project
No ratings yet
Rainfall Viva Project
58 pages
Report Data Analysis
No ratings yet
Report Data Analysis
45 pages
Log Book
No ratings yet
Log Book
32 pages
Skill Report
No ratings yet
Skill Report
36 pages
Data Science Report
No ratings yet
Data Science Report
46 pages
7th Sem Intern
No ratings yet
7th Sem Intern
12 pages
One Month Internship in DataScience With AIML
No ratings yet
One Month Internship in DataScience With AIML
3 pages
Data
No ratings yet
Data
36 pages
Internship ML REPORT
No ratings yet
Internship ML REPORT
27 pages
Shraddha
No ratings yet
Shraddha
29 pages
Summer Training Report on Data Science
100% (1)
Summer Training Report on Data Science
41 pages
Internship Report
No ratings yet
Internship Report
20 pages
Nagara Ju Documentation 421010
No ratings yet
Nagara Ju Documentation 421010
48 pages
B.tech Minor Syllabus-CSE (Data Science) - Final
No ratings yet
B.tech Minor Syllabus-CSE (Data Science) - Final
17 pages
Data Science & Machine Learning 2024
No ratings yet
Data Science & Machine Learning 2024
2 pages
Guidelines: Summer Internship For PG Programs "Study of Data Science in Exposys Data Labs"
No ratings yet
Guidelines: Summer Internship For PG Programs "Study of Data Science in Exposys Data Labs"
29 pages
Internship Progress Report: Data Science
No ratings yet
Internship Progress Report: Data Science
14 pages
Machine Learning Internship Report
No ratings yet
Machine Learning Internship Report
19 pages
ML 8 Sem
No ratings yet
ML 8 Sem
9 pages
Data Science Internship
No ratings yet
Data Science Internship
1 page
Sakthivel Intern Rec
No ratings yet
Sakthivel Intern Rec
22 pages
Anush J Internship Report
No ratings yet
Anush J Internship Report
15 pages
4 Month Data Science Roadmap
No ratings yet
4 Month Data Science Roadmap
3 pages
Metis Bootcamp Curriculum
No ratings yet
Metis Bootcamp Curriculum
18 pages
LT - Ai & ML Using Python
No ratings yet
LT - Ai & ML Using Python
51 pages
LT - Machine Learning Using Python
No ratings yet
LT - Machine Learning Using Python
50 pages
Internshala Data Science Training Report
No ratings yet
Internshala Data Science Training Report
70 pages
Internship Diary: Data Science Journey
No ratings yet
Internship Diary: Data Science Journey
136 pages
Intro to Data Science with Python
No ratings yet
Intro to Data Science with Python
24 pages
ML Internship
No ratings yet
ML Internship
13 pages
Aakarshits INTERNSHIPREPORT
No ratings yet
Aakarshits INTERNSHIPREPORT
32 pages
Data Science New Report
No ratings yet
Data Science New Report
39 pages
Ap Internship Last
No ratings yet
Ap Internship Last
30 pages
It Report
No ratings yet
It Report
24 pages
Data Science and Machine Learning Using Python
No ratings yet
Data Science and Machine Learning Using Python
4 pages
Data Science Training Report.
100% (1)
Data Science Training Report.
73 pages
DS g4g
No ratings yet
DS g4g
7 pages
Dsa Report
No ratings yet
Dsa Report
24 pages
Internship Report Format VII SEM
No ratings yet
Internship Report Format VII SEM
17 pages
Vaibhavpatel
No ratings yet
Vaibhavpatel
40 pages
90-Day Data Science Internship Plan
No ratings yet
90-Day Data Science Internship Plan
3 pages
Alladi Cloud Data Science Training
No ratings yet
Alladi Cloud Data Science Training
5 pages
Ayush Cse Synopsis2
No ratings yet
Ayush Cse Synopsis2
11 pages
Industrial Training in AIML Weekly Diary
No ratings yet
Industrial Training in AIML Weekly Diary
13 pages
Metis Bootcamp: Data Science Journey
No ratings yet
Metis Bootcamp: Data Science Journey
15 pages
Python For Data Science Curriculum
No ratings yet
Python For Data Science Curriculum
6 pages
Artificial Intelligence & Data Science Course Outline
No ratings yet
Artificial Intelligence & Data Science Course Outline
5 pages
8 Weeks Main ML Plan
No ratings yet
8 Weeks Main ML Plan
11 pages
Complete Curriculum by Ashish
No ratings yet
Complete Curriculum by Ashish
6 pages
AI Development Plan Overview
No ratings yet
AI Development Plan Overview
10 pages
Data Science & Analytics - AI & ML and Visualization
No ratings yet
Data Science & Analytics - AI & ML and Visualization
2 pages
DWDM Laboratory Manual 2019-20
No ratings yet
DWDM Laboratory Manual 2019-20
31 pages
Data Analytics
No ratings yet
Data Analytics
1 page
Intelligent Control for Water-Bath System
No ratings yet
Intelligent Control for Water-Bath System
7 pages
Elective Focus Basket Details
No ratings yet
Elective Focus Basket Details
46 pages
DIP Mini Project
100% (1)
DIP Mini Project
12 pages
Paper 1 AI Foundational Issues NIST FINAL
No ratings yet
Paper 1 AI Foundational Issues NIST FINAL
56 pages
Detection and Classification of Dental Caries in X-Ray Images Using Deep Neural Networks
No ratings yet
Detection and Classification of Dental Caries in X-Ray Images Using Deep Neural Networks
5 pages
Jay Internship Report
No ratings yet
Jay Internship Report
46 pages
Final-Report22 3 PDF
No ratings yet
Final-Report22 3 PDF
124 pages
MCA1
No ratings yet
MCA1
29 pages
Machine Learning Approaches in Cyber Security Analytics 1st Edition by Tony Thomas, Athira Vijayaraghavan, Sabu Emmanuel 9811517061 9789811517068
No ratings yet
Machine Learning Approaches in Cyber Security Analytics 1st Edition by Tony Thomas, Athira Vijayaraghavan, Sabu Emmanuel 9811517061 9789811517068
51 pages
Neural network-WPS Office
No ratings yet
Neural network-WPS Office
23 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
18 pages
De Group 5, Public Utility Portals
No ratings yet
De Group 5, Public Utility Portals
32 pages
Global Skills Insights 2022
No ratings yet
Global Skills Insights 2022
45 pages
Internship IN Data Analysis Using Machine Learning: Gopal Tiwari
No ratings yet
Internship IN Data Analysis Using Machine Learning: Gopal Tiwari
44 pages
Permeability Prediction of Carbonate Reservoir Based On Nuclear Magnetic Resonance (NMR) Logging and Machine Learning
No ratings yet
Permeability Prediction of Carbonate Reservoir Based On Nuclear Magnetic Resonance (NMR) Logging and Machine Learning
15 pages
Data Science Workflow Essentials
No ratings yet
Data Science Workflow Essentials
3 pages
A Review On Finding Efficient Approach To Detect Customer Emotion Analysis Using Deep Learning Analysis
No ratings yet
A Review On Finding Efficient Approach To Detect Customer Emotion Analysis Using Deep Learning Analysis
17 pages
Group 5 - Disney+ Case
100% (1)
Group 5 - Disney+ Case
7 pages
Deep Learning Book
No ratings yet
Deep Learning Book
610 pages
DL Unit-1
No ratings yet
DL Unit-1
20 pages
Ict 423 - Deep Learning
No ratings yet
Ict 423 - Deep Learning
18 pages
2 ML
No ratings yet
2 ML
9 pages
"Text Summarization For Audio and Video Files": Bachelor of Technology in Information Technology by
No ratings yet
"Text Summarization For Audio and Video Files": Bachelor of Technology in Information Technology by
54 pages
Unsupervised Machine Learning For Managing Safety
No ratings yet
Unsupervised Machine Learning For Managing Safety
15 pages
Apriori Algorithm & Clustering Guide
No ratings yet
Apriori Algorithm & Clustering Guide
8 pages
Unit 1 BUSINESS ANALYTICS
No ratings yet
Unit 1 BUSINESS ANALYTICS
22 pages
Scs302 Artificial Intelligence Notes
No ratings yet
Scs302 Artificial Intelligence Notes
110 pages
7 Sem Syllabus
No ratings yet
7 Sem Syllabus
20 pages
Purva Rawale - BDA Practical No 2
No ratings yet
Purva Rawale - BDA Practical No 2
9 pages
ML Ethics for New Computer Scientists
No ratings yet
ML Ethics for New Computer Scientists
10 pages

Prasoon 2

Uploaded by

Prasoon 2

Uploaded by

SUMMER INTERNSHIP REPORT

“Python For Mastering Machine Learning And Data Science”

Submitted in the Partial Fulfillment for the Award of Degree

Bachelor of Science Information Technology (BSC.IT)

SESSION 2025 – 2026

Under the Supervision of Submitted By

1.Company Profile .......................................................................................... I

3.Week 1 ...................................................................................................................... III

Udemy is a leading global online learning and teaching marketplace that

An internship at Udemy is a unique opportunity to gain practical

 Objective: To build a robust and reproducible foundation in Python

 Detailed Activities: The first week was dedicated to mastering the

Comprehensive Data Preprocessing and Advanced EDA

 Objective: To transform raw, messy data into a clean, structured

 Detailed Activities: This week was all about the essential

Supervised Learning (Regression) and Model Evaluation

 Objective: To implement and evaluate a wide range of regression

Supervised Learning (Classification) and Nuanced

 Objective:To master classification algorithms and perform a

 Detailed Activities: This week was dedicated to classification, a

Unsupervised Learning and Dimensionality Reduction

 Objective: To explore unsupervised learning algorithms and

 Detailed Activities: We shifted our focus to unsupervised learning,

Ensemble Methods and Introduction to Specialized Topics

 Objective: To build highly accurate and robust models using

 Detailed Activities: We delved into powerful ensemble methods

Natural Language Processing (NLP) Fundamentals

 Objective: To understand the fundamentals of working with text

 Detailed Activities: This week served as a bridge between

Time Series Analysis and Forecasting

 Objective: To learn how to analyze and forecast time-dependent

 Detailed Activities: We dedicated this week to Time Series

Final Project and Professional Reporting

 Objective: To synthesize all skills into a comprehensive, end-to-

The nine-week internship on this topic was a comprehensive and hands-

The project-based approach in the final weeks was particularly valuable,

This internship has significantly enhanced my technical skills in Python

You might also like