0% found this document useful (0 votes)

11 views6 pages

ML Assigment 1

machine learmih

Uploaded by

927623mca060

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views6 pages

ML Assigment 1

machine learmih

Uploaded by

927623mca060

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

DEPARTMENT OF

MASTER OF COMPUTER APPLICATIONS

SELF STUDY ASSIGNMENT – 1

MCB1724– MACHINE LEARNING USING

Name : KAVIN N

Register No.: 927623MCA022

Year / Sec : I / -MCA

Marks Awarded:

Technology Conclusion
Descriptive Solution
Objective Problem and Results
and a nd Total
Headline Analysis Statistics References
(2) (3) (4) (20)
(4)
(4) ( 3)
Assignment Topic CO PO addressed BTL Level
INTRODUCTION TO POPULAR
PO1, PO2, PO3, BTL 4
PYTHON LIBRARIES FOR ML: CO1,
PO5,PO6,PO7,PO8,PO9,P
NUMPY, PANDAS, CO2
O11,PO12
MATPLOTLIB

TITLE: INTRODUCTION TO POPULAR PYTHON LIBRARI E

S FOR ML

(NU MPY,PANDAS,MATPLOTLIB)

Objective:
NumPy:

NumPy aims to provide efficient numerical operations on large arrays and matrices. It
facilitates mathematical and logical operations on arrays, along with a vast collection of high-
level mathematical functions to operate on these arrays.

pandas:

The main objective of pandas is to provide easy-to-use data structures and data analysis tools
for Python. It simplifies the process of working with structured data, such as tabular data and
time series data, by offering powerful data manipulation and analysis capabilities.

Matplotlib:

Matplotlib's primary objective is to create high-quality static, animated, and interactive

visualizations in Python. It enables users to generate a wide variety of plots and charts to
explore and communicate data effectively, facilitating data visualization tasks in scientific
computing and data analysis.

Problem Analysis:

NumPy:

Before NumPy, numerical computations on large datasets in Python were inefficient and
slow. Standard Python lists lack the ability to perform vectorized operations, leading to
lengthy for-loops or list comprehensions for even basic operations. This inefficiency was a
significant bottleneck for scientific computing tasks, such as linear algebra operations, signal
processing, and statistical analysis.
pandas:

Working with structured data, such as CSV files or database tables, in Python often required
writing complex code and using multiple libraries. Without pandas, tasks like loading,
cleaning, transforming, and analyzing tabular data were cumbersome and error-prone.
Standard Python data structures like lists or dictionaries lacked the functionality and
expressiveness needed for efficient data manipulation and analysis.

Matplotlib:

Before Matplotlib, creating high-quality visualizations in Python was challenging and

required stitching together various low-level plotting functions and libraries. There was no
comprehensive plotting library that provided a wide range of plotting options and
customization features. As a result, data scientists and researchers spent significant time and
effort on creating and fine-tuning plots, hindering the exploration and communication of data
insights.

Solution and Results:

NumPy:

Solution: NumPy addresses the inefficiency of numerical computations in Python by

introducing the ndarray, a powerful n-dimensional array object. It provides vectorized
operations, allowing mathematical operations to be performed on entire arrays at once,
eliminating the need for explicit looping. Additionally, NumPy offers a vast collection of
mathematical functions optimized for array operations, including linear algebra, Fourier
transforms, and random number generation.

Results: With NumPy, developers can write concise and efficient code for numerical
computations, significantly speeding up the execution of scientific computing tasks and
machine learning algorithms. The ability to perform vectorized operations on large arrays
enables faster data processing and analysis, leading to improved productivity and
performance in data-driven applications.

pandas:

Solution: pandas simplifies working with structured data in Python by introducing two main
data structures: DataFrame and Series. DataFrame represents tabular data with rows and
columns, similar to a spreadsheet or SQL table, while Series represents a one-dimensional
labeled array. pandas provides a wide range of functions for data manipulation, including
indexing, filtering, grouping, and aggregation, as well as handling missing data and
time series data.
Results: Using pandas, developers can load, clean, transform, and analyze datasets with ease,
streamlining the data preprocessing and exploration process in machine learning workflows.
The intuitive API and powerful functionality of pandas enable faster iteration and
experimentation, leading to more robust and accurate machine learning models.

Matplotlib:

Solution: Matplotlib offers a comprehensive plotting toolkit for creating static, animated, and
interactive visualizations in Python. It provides a MATLAB-like interface for generating a
wide variety of plots and charts, including line plots, scatter plots, bar plots, histograms, and
more. Matplotlib allows fine-grained control over every aspect of the plot, such as colors,
labels, axes, and annotations, enabling users to create publication-quality visualizations
tailored to their specific needs.

Results: By using Matplotlib, developers can effectively explore and communicate data
insights through visualizations, enhancing understanding and interpretation. Whether it's
exploring data distributions, comparing trends, or presenting model performance, Matplotlib's
flexibility and customization options empower users to create informative and compelling
visualizations that drive decision-making and insight generation.

Technology and methodology:

NumPy:

Technology: NumPy is primarily built using the Python programming language, but it
heavily relies on optimized, low-level libraries written in languages like C and Fortran, such
as BLAS (Basic Linear Algebra Subprograms) and LAPACK (Linear Algebra Package).
These libraries provide efficient implementations of mathematical functions and operations.

Methodology: NumPy follows an array-oriented computing methodology, where

mathematical operations are applied to entire arrays rather than individual elements. This
approach enables efficient vectorized computation and facilitates numerical analysis and
manipulation of large datasets.

pandas:

Technology: pandas is also written in Python and builds upon the NumPy library. It leverages
the fast and efficient array operations provided by NumPy, along with additional features
implemented in Python. pandas may also utilize libraries like Cython for performance
optimization when dealing with large datasets.

Methodology: pandas follows a methodology centered around data manipulation and

analysis. It provides data structures like DataFrame and Series, which allow users to work
with structured data in a tabular format. pandas emphasizes ease of use and productivity,
offering high-level functions for data cleaning, transformation, and analysis.
Matplotlib:

Technology: Matplotlib is implemented in Python and makes extensive use of NumPy for
numerical computation. It does not rely on external libraries for basic plotting functionalities.
However, for certain advanced features or specific plot types, Matplotlib may interface with
other libraries or tools.

Methodology: Matplotlib follows a methodology of data visualization, providing a flexible

and customizable plotting toolkit. It allows users to create a wide range of static, animated,
and interactive visualizations to explore and communicate data insights effectively.
Matplotlib supports both procedural and object-oriented approaches to plotting, catering to
different user preferences and requirements.

Statistical data:

NumPy provides functions for various statistical calculations, such as computing measures
of central tendency (mean, median, mode), dispersion (standard deviation, variance), and
percentiles. These functions operate efficiently on NumPy arrays, making them suitable for
large datasets.

import numpy as np

data = np.array([1, 2, 3, 4, 5])

mean = np.mean(data)

median = np.median(data)

std_dev = np.std(data)

pandas:

import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})

summary_stats = df.describe()

quantile = df['A'].quantile(0.5) # Median

Matplotlib:

import matplotlib.pyplot as plt

data = [1, 2, 2, 3, 3, 3, 4, 4, 5]

plt.hist(data, bins=5)

plt.xlabel('Value')
plt.ylabel('Frequency')

plt.title('Histogram of Data')

plt.show()

Conclusion:

NumPy facilitates efficient numerical operations with its array-oriented computing. pandas
simplifies data manipulation and analysis through its DataFrame and Series structures.
Matplotlib provides a flexible platform for creating a wide range of visualizations to explore
and communicate data insights effectively. Together, these libraries form a powerful trio that
empowers users to tackle data-driven challenges with ease and clarity in Python.

References:

1.https://www.almabetter.com/bytes/tutorials/python/popular-python-libraries

2.https://towardsdatascience.com/top-5-machine-learning-libraries-in-python-e36e3e0e02af

Python Libraries for Data Science
No ratings yet
Python Libraries for Data Science
6 pages
Lab Manual ML R22
No ratings yet
Lab Manual ML R22
27 pages
Python in Research
No ratings yet
Python in Research
18 pages
DA&V Module 6 (SAMI)
No ratings yet
DA&V Module 6 (SAMI)
10 pages
Pai 6
No ratings yet
Pai 6
17 pages
Elc Report
No ratings yet
Elc Report
12 pages
Unit 5 Python Notes HM
No ratings yet
Unit 5 Python Notes HM
59 pages
EXP1-siddhant Gupta (23 - SE - 148)
No ratings yet
EXP1-siddhant Gupta (23 - SE - 148)
17 pages
Unit-2 Ds
No ratings yet
Unit-2 Ds
26 pages
Essential Python Tools for Data Science
No ratings yet
Essential Python Tools for Data Science
2 pages
Machine Learning Experiment
No ratings yet
Machine Learning Experiment
69 pages
NumPy, Pandas, MatplotLib, Seaborn, ScikitLearn (SkLearn)
No ratings yet
NumPy, Pandas, MatplotLib, Seaborn, ScikitLearn (SkLearn)
14 pages
Pandas & NumPy in Business Analytics
No ratings yet
Pandas & NumPy in Business Analytics
13 pages
PR Final File
No ratings yet
PR Final File
70 pages
AIES Assignment1
No ratings yet
AIES Assignment1
15 pages
Unit 4
No ratings yet
Unit 4
105 pages
Python Data Libraries: NumPy, SciPy, Pandas
No ratings yet
Python Data Libraries: NumPy, SciPy, Pandas
23 pages
Essential Python Libraries for Data Science
No ratings yet
Essential Python Libraries for Data Science
17 pages
Dsbda Unit4
No ratings yet
Dsbda Unit4
110 pages
Numpy Code
No ratings yet
Numpy Code
10 pages
PR Final File
No ratings yet
PR Final File
49 pages
Data Analysis Using Python2
No ratings yet
Data Analysis Using Python2
27 pages
Ex. No: 1 Exploring The Features of Numpy, Scipy, Jupyter, Statsmodels and Pandas Date: 07/08/2024
No ratings yet
Ex. No: 1 Exploring The Features of Numpy, Scipy, Jupyter, Statsmodels and Pandas Date: 07/08/2024
9 pages
Ip Project Class Xii
No ratings yet
Ip Project Class Xii
31 pages
Python for Scientific Computing: NumPy & Pandas
No ratings yet
Python for Scientific Computing: NumPy & Pandas
7 pages
ML Programs
No ratings yet
ML Programs
41 pages
l9 Scientific Python Proc
No ratings yet
l9 Scientific Python Proc
30 pages
Mastering Python Data Visualization - Sample Chapter
100% (9)
Mastering Python Data Visualization - Sample Chapter
63 pages
CUTM Alumni Data Analysis with Python
No ratings yet
CUTM Alumni Data Analysis with Python
27 pages
Python Libraries
No ratings yet
Python Libraries
6 pages
Python Libraries for B.Tech Students
No ratings yet
Python Libraries for B.Tech Students
17 pages
CRAI AI BOOTCAMP Week Two 2025
No ratings yet
CRAI AI BOOTCAMP Week Two 2025
29 pages
DVAP - Final Project Report
No ratings yet
DVAP - Final Project Report
27 pages
Logistic Regression on Iris Dataset
No ratings yet
Logistic Regression on Iris Dataset
60 pages
NumPy and Pandas Basics in Python
No ratings yet
NumPy and Pandas Basics in Python
9 pages
Ass1 DSBDA Writeup
No ratings yet
Ass1 DSBDA Writeup
8 pages
FINAL FDS MANUAL Print
No ratings yet
FINAL FDS MANUAL Print
55 pages
Digital Principal and System Design
No ratings yet
Digital Principal and System Design
17 pages
Exploring Python Data Packages
No ratings yet
Exploring Python Data Packages
77 pages
100 Must-Know PythonMl Interview Questions and Answers 2024 - Devinterview - Io
No ratings yet
100 Must-Know PythonMl Interview Questions and Answers 2024 - Devinterview - Io
1 page
Data Analytics Libraries Overview
No ratings yet
Data Analytics Libraries Overview
8 pages
Python For Data Analysis
No ratings yet
Python For Data Analysis
15 pages
Exp 1
No ratings yet
Exp 1
22 pages
Python Libraries Overview
No ratings yet
Python Libraries Overview
19 pages
Data Visualization
No ratings yet
Data Visualization
25 pages
Data Analysis Lab with Python
No ratings yet
Data Analysis Lab with Python
11 pages
ML Lab File
No ratings yet
ML Lab File
43 pages
PYTHON
No ratings yet
PYTHON
11 pages
Unit 5
No ratings yet
Unit 5
28 pages
Essential Python Libraries for Data Science
No ratings yet
Essential Python Libraries for Data Science
12 pages
Python Written Assignment
No ratings yet
Python Written Assignment
35 pages
Final Fds Manual Print
No ratings yet
Final Fds Manual Print
55 pages
Data Science with Python: NumPy, Pandas, SciPy
No ratings yet
Data Science with Python: NumPy, Pandas, SciPy
48 pages
Unit 5 PythonPackages (Matplotlib)
No ratings yet
Unit 5 PythonPackages (Matplotlib)
24 pages
ML Exp
No ratings yet
ML Exp
9 pages
ML Manual
No ratings yet
ML Manual
21 pages
Top 18 Python Libraries for Data Science
100% (1)
Top 18 Python Libraries for Data Science
11 pages
Python CA2
No ratings yet
Python CA2
11 pages
Java Algorithm Final
No ratings yet
Java Algorithm Final
28 pages
Unit L QB ML
No ratings yet
Unit L QB ML
5 pages
ML - Lab - Ex 2
No ratings yet
ML - Lab - Ex 2
4 pages
Unit 1 Part B Notes Final Draft
No ratings yet
Unit 1 Part B Notes Final Draft
15 pages
Full Stack Syllabus (22.01.2024)
No ratings yet
Full Stack Syllabus (22.01.2024)
3 pages
Electoral Roll
No ratings yet
Electoral Roll
9 pages
Babylonian Civilization Q&A Guide
No ratings yet
Babylonian Civilization Q&A Guide
3 pages
Curriculum Map in Grade 6 Computer Education
No ratings yet
Curriculum Map in Grade 6 Computer Education
6 pages
Free Online PDF Creation Tools
No ratings yet
Free Online PDF Creation Tools
3 pages
Mth744u Exam 2013
No ratings yet
Mth744u Exam 2013
3 pages
Edee330 Assignment 2
No ratings yet
Edee330 Assignment 2
47 pages
Scheme - and - Syl - Electronics - and - Electrical - 2014 1 PDF
No ratings yet
Scheme - and - Syl - Electronics - and - Electrical - 2014 1 PDF
127 pages
Foucault's Power-Knowledge Theory
100% (1)
Foucault's Power-Knowledge Theory
3 pages
Observation Sheet-Primary
No ratings yet
Observation Sheet-Primary
12 pages
Future Tenses
No ratings yet
Future Tenses
11 pages
Unit 2 Conjunction, Disjunction, Conditional and Biconditional
No ratings yet
Unit 2 Conjunction, Disjunction, Conditional and Biconditional
16 pages
Networking Interview Q&A Guide
No ratings yet
Networking Interview Q&A Guide
4 pages
7 DWDM System Protection Principle (With OPCS)
No ratings yet
7 DWDM System Protection Principle (With OPCS)
17 pages
Holidays Homework Grade 8
No ratings yet
Holidays Homework Grade 8
8 pages
Ciesman Full Prueba Part2
No ratings yet
Ciesman Full Prueba Part2
9 pages
XN-L Interfacing Guide PDF
33% (3)
XN-L Interfacing Guide PDF
25 pages
JIS High School Graduation Criteria
No ratings yet
JIS High School Graduation Criteria
2 pages
Shashikant Rathi Resume
No ratings yet
Shashikant Rathi Resume
3 pages
Ridera DLP Eng g10 q1 Melc 5 Week 5
No ratings yet
Ridera DLP Eng g10 q1 Melc 5 Week 5
7 pages
Past Simple Regular Verbs Reading Comprehension
100% (1)
Past Simple Regular Verbs Reading Comprehension
1 page
BA Eng
No ratings yet
BA Eng
46 pages
Imperial Aramaic: The Unicode Standard, Version 17.0
No ratings yet
Imperial Aramaic: The Unicode Standard, Version 17.0
2 pages
Practical Python
No ratings yet
Practical Python
9 pages
DP1 English Lang & Lit Exam Guide
No ratings yet
DP1 English Lang & Lit Exam Guide
4 pages
Presentation ON Tripwire Alarm Using Arduino: Presented By: Siddharth Maurya Abhijeet Sharma Saurav Agrahari
No ratings yet
Presentation ON Tripwire Alarm Using Arduino: Presented By: Siddharth Maurya Abhijeet Sharma Saurav Agrahari
11 pages
Adders and Subtractors Lab Report
No ratings yet
Adders and Subtractors Lab Report
15 pages
Budget of Work 3RD Quarter-Dressmaking 10
No ratings yet
Budget of Work 3RD Quarter-Dressmaking 10
2 pages
Medeamedeamedea Actor Packet
No ratings yet
Medeamedeamedea Actor Packet
9 pages
Shopping Lesson Plan
No ratings yet
Shopping Lesson Plan
3 pages
Revolver 2.0 Template
No ratings yet
Revolver 2.0 Template
6 pages

ML Assigment 1

Uploaded by

ML Assigment 1

Uploaded by

DEPARTMENT OF

MASTER OF COMPUTER APPLICATIONS

SELF STUDY ASSIGNMENT – 1

MCB1724– MACHINE LEARNING USING

Register No.: 927623MCA022

Year / Sec : I / -MCA

TITLE: INTRODUCTION TO POPULAR PYTHON LIBRARI E

Matplotlib's primary objective is to create high-quality static, animated, and interactive

Before Matplotlib, creating high-quality visualizations in Python was challenging and

Solution and Results:

Solution: NumPy addresses the inefficiency of numerical computations in Python by

Technology and methodology:

Methodology: NumPy follows an array-oriented computing methodology, where

Methodology: pandas follows a methodology centered around data manipulation and

Methodology: Matplotlib follows a methodology of data visualization, providing a flexible

data = np.array([1, 2, 3, 4, 5])

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})

quantile = df['A'].quantile(0.5) # Median

import matplotlib.pyplot as plt

You might also like