0% found this document useful (0 votes)

313 views11 pages

Machine Learning Operations

Uploaded by

sayantani 11

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

313 views11 pages

Machine Learning Operations

Uploaded by

sayantani 11

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Machine Learning Operations

The basic introduction combines of what exactly MLOPS and its

auxiliaries. Moreover, what exactly it is.

Definition:
Machine learning operations (MLOps) is the development and use of
machine learning models by development operations (DevOps) teams.
Machine Learning Operations involves a set of processes or rather a
sequence of steps implemented to deploy an ML model to the
production environment. There are several steps to be undertaken
before an ML Model is production ready. These processes ensure that
your model can be scaled for a large user base and perform accurately.

What is the use of the MLOps?

Till now, we have created all the MLOPs model and trained a lot of
models, tested them and done all the aspects related to machine
learning aspects. But what's the use to it? Where it is being utilized.
Here is where the MLOPs comes into play.:
Creating an ML model that can predict what you want it to predict from
the data you have fed is easy, but creating a model that is reliable, fast,
accurate, pinpoint and can be used by many users in difficult, isn't it?
So, that's where the MLOPs comes into the play:
• These models that rely on large amount of data, are very difficult
for a single person to be handled and tracking their development
or usage.
• Since, due to having a lot of data, even if there is small tweak in
the parameters it can result in the enormous difference in the
results and accuracy.
• Now, feature engineering is another hectic task that would come
up with the large dataset, because we need to keep the track of
the features with which the model is working.
• Monitoring the model isn't easy like monitoring a software
performance.
• Debugging the ML model is extremely painful.
• Now, here comes the major problem. Since, we guys are working
with the real-world data for predictions and all other aspects. So
as the real-world data keeps on updating the model should also
keep updating itself. This means we need to keep the track of the
new data change and accordingly we need to make sure that
model learn them.
If we guys, take a funny example, as developer we always give excuse,
it's working on my end....
This is not what we have to do, here in MLOps.

DevOps vs MLOps
So, in the definition itself, we talked about the part of DevOps. So, what
exactly we mean by this terminology of DevOps.

Discussion on the DevOps

It is the process to build and deploy the software application
simultaneously. Now you all would think then how the MLOps is
different than this, because as conveyed seems to us that these both
things are pretty much the same!!
a) What is DevOps?
It is a mixture / combination of development and operation to increase
the efficiency, speed and security of software development and delivery
compared to traditional processes.
Dev: Plan, Create, Verify,
Ops: Package, Release, Configure, Monitor
DevOps can be best explained as people working together to conceive,
build and deliver secure software at top speed. DevOps practices
enable software development (dev) and operations (ops) teams to
accelerate delivery through automation, collaboration, fast feedback,
and iterative improvement.

Now, once to deep dive into the concept of DevOps, you will also come
across the terminology of Agile Development.

Stages into DevOps

The DevOps stages are targeted for developing a software application.
You plan the features of application you want to release, write code,
build the code, test it, create a deployment plan and deploy it. Then,
eventually, we can work with the part where we can monitor the
infrastructure where the application has been deployed. And this
process will keep on going until the application is fully developed.

Before going onto deployment, the code goes through multiple

procedures: plan, code, build, test, release, deploy, operate, monitor.

Under the code part, we have version control and source code.
In the build process, we have development and automation.
In test, there is quality analysis and control.
In release, here comes to most important aspects of DevOps, the CI/CD
(Continuous Integration and Development).
Inside deploy, we have IAAS, provisioning, Configuration Management.
In operate, virtualization and containerization.
Now monitoring comes with the part of Logging and Visualization.

But in the part of Machine Learning Operations, the things work little
differently. We implement the following stages:

1. Scoping: Here we try to define the project that means, here we try
to check if the problem requires Machine Learning to solve it.
While performing the requirement analysis, we check if the
necessary data is available. Here we try to verify that if the data
provided is biased or not biased and based on the to that we
formulate the POC (Proof of Concept for the same). Moreover, we
also try to check whether it reflects the objective to the program
or not, and its real-world use cases.

2. Data Engineering: We all are very much aware about this stage
isn’t it. The stage that is as easy as it could be and as complex as
it could be. Here we collect data, establish the relationship
between data, format the data, label the data and organize the
data. And hence this makes this stage the most crucial stage in
the entire process of the Machine Learning Operations.

3. Modelling: Now, comes the part that is the most interesting one.
Creation of the Machine Learning Model. We train the model with
the processed data. Perform the predictions, error assessment,
define the error measurement and track the performance of the
model.
4. Deployment: In this stage, the pack the model, just like we
package the item before gifting it to someone else. Then this
packed or wrapped code or model of yours gets deployed on to
the cloud or any edge devices as per the requirements. When we
are talking about the packaging, we are basically talking about the
model being wrapped with an API server exposing the REST or
gRPC access points using which users can access applications or
maybe a docker container could be deployed on the cloud
infrastructure or may be the application could be deployed on the
any server-less cloud platform, or a mobile application for edge-
based models.

5. Monitoring: Once the gift yours has been delivered, then what’s
next. You try to capture the reaction of the individual to whom you
have gifted that gift. Same happens here as well, once the
application gets deployed, we monitor the infrastructure to
maintain and update the model. This stage has the components
like:

Process of Building the DevOps:

1. Code: Here, we use the version control system in order to
collaborate with the other team members.
2. Build: Here we write the code in high level language making
sure that the code performs the required tasks and then gets

3. Monitoring the space / infrastructure where we have deployed:

For the load, utilization, storage and health, we monitor the
infrastructure. This tells us about the environment where the
ML model is being deployed.

4. Monitoring the model’s performance, accuracy, errors and bias.

This tells us about the model performance, if the model is
performing well, as expected, valid for the real-world
scenarios or not.

This will not be much beneficial for some of the particular

as some models might require learning from the user inputs
and predictions it makes. This lifecycle is valid for most of the
ML use cases.

UNDERSTANDING THE CI/CD PIPELINE

In development, whenever we update the code, we want that the code
should be updated everywhere it is being used, ensuring that each user
is having the same functionality of it, on their respective devices. Now
this seems as easy as it could be but is as complicated as it could be.

CI/CD ensures that the integration and delivery of incremental changes

to a live application. It is triggered when by a new update of version
control system. This integration helps the system to go through all the
stages until they safely reach the production environment.

The Integration pipeline focuses on the initial stages of software

delivery, encompassing tasks like building, testing, and packaging the
application. On the other hand, the Deployment pipeline ensures the
smooth deployment of new software packages in both testing and
production environments.
Why are we using this concept over here
in Machine Learning?
Imagine you're baking a cake. The traditional way involves tasting the
batter as you go, adjusting ingredients, and hoping the final cake turns
out right. This can be messy and unpredictable, especially if different
people bake it with slightly different methods.

The "immutable" way is like following a strict recipe without any

changes. You measure all the ingredients precisely, mix them in a
specific order, and bake for the exact time. This ensures the cake will
always turn out the same, regardless of who bakes it.

Applying thing over here:

Traditional Approach: Data scientists work on their own laptops, using

their preferred tools and versions. This can lead to unexpected changes
in model behaviour when someone else tries to run the same analysis.
Immutable / Newer Approach: Everyone follows a standardized process
with pre-defined tools and versions. This removes the risk of
inconsistencies and makes it easier to understand and fix problems.
The benefits of following an "immutable" process:

Reproducible results: You can be confident that any changes in model

behaviour are due to actual data or code changes, not accidental
differences in setups.
Easier troubleshooting: It's simpler to pinpoint the source of issues
when everyone is using the same tools and steps.
Improved collaboration: Different data scientists can easily share and
understand each other's work.
Think of it like building a Lego set. Each person gets the same
instructions and pieces, resulting in the same finished product every
time. This makes teamwork and consistency much easier.

Continuous Integration / Continuous Delivery (CI/CD), originating from

and gaining prominence in Software Development, is centred around
the idea of regularly delivering incremental changes to a live
application via an automated process of rebuilding, retesting, and
redeploying.

In contrast to traditional CI/CD pipelines for standard software

applications, Machine Learning introduces two additional dimensions:
Model and Data. While conventional software engineering practices
revolve around code, ML involves extensive codebases alongside the
management of substantial datasets and models to extract actionable
insights.

Designing an ML system involves grappling with challenges like:

• Storing model artifacts and enabling teams to track experiments

with their metadata for result comparability and reproducibility.

• Handling often large and rapidly changing datasets. In addition to

monitoring model performance from an application standpoint, ML
demands vigilance in tracking data changes and adjusting models
accordingly.

ML systems demand consistent monitoring for performance and data

drift. When model accuracy dips below a set baseline or data
experiences concept drift, the entire system must undergo another
cycle. This means replicating all steps, from data validation to model
training and evaluation, testing, and deployment. This underscores why
ML systems stand to gain significantly from automated pipelines,
especially in the context of CI/CD.
Exploring the Machine Learning Lifecycle

Now we learn what infrastructure setup we would need for a model

to be deployed in production. You can see in the above picture, ML
code is only a small part of it. Let us understand the components one
by one.

Data Collection — This step involves collecting data from various

sources. ML models require a lot of data to learn. Data collection
involves consolidating all kinds of raw data related to the problem.
i.e Image classification might require you to collect all available
images or scrape the web for images. Voice recognition may require
you to collect tons of audio samples.

Data Verification — In this step we check the validity of the data,

if the collected data is up to date, reliable, and reflects the real world,
is it in a proper consumable format, is the data structured properly.
Feature Extraction — Here, we select the best features for the
model to predict. In other words, your model may not require all the
data in its entirety for discovering patterns, some columns or parts
of data might be not used at all. Some models perform well when a
few columns are dropped. We usually rank the features with
importance, features with high importance are included, lower ones
or near zero ones are dropped.

Configuration — This step involves setting up the protocols for

communications, system integrations, and how various components
in the pipeline are supposed to talk to each other. You want your
data pipeline to be connected to the database, you want your ML
model to connect to database with proper access, your model to
expose prediction endpoints in a certain way, your model inputs to
be formatted in a certain way. All the necessary configurations
required for the system need to be properly finalized and
documented.

ML Code — Now we, come to the actual coding part. In this stage,
we develop a base model, which can learn from the data and predict.
There are tons of ML libraries out there with multiple language
support. Ex: tensorflow, pytorch, scikit-learn, keras, fast-ai and
many more. Once we have a model, we start improving its
performance by tweaking the hyper-parameters, testing different
learning approaches until we are satisfied that the model is
performing relatively better than its previous version.

Lecture+Notes Intro To MLOps Session3
No ratings yet
Lecture+Notes Intro To MLOps Session3
8 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
36 pages
Professional Machine Learning Engineer Demo
No ratings yet
Professional Machine Learning Engineer Demo
6 pages
LLaMa Model Hallucination Analysis
No ratings yet
LLaMa Model Hallucination Analysis
3 pages
Model Optimization For Stock Market Prediction Using Multiple Labelling Techniques
No ratings yet
Model Optimization For Stock Market Prediction Using Multiple Labelling Techniques
5 pages
Shreyash's Resume
No ratings yet
Shreyash's Resume
1 page
Multilayer Perceptron PDF
No ratings yet
Multilayer Perceptron PDF
5 pages
Bias-Variance Tradeoff in ML Interviews
No ratings yet
Bias-Variance Tradeoff in ML Interviews
46 pages
100 Machine Learning Interview Q&A
No ratings yet
100 Machine Learning Interview Q&A
24 pages
Andrea Martorana Tusa: Failure Prediction For Manufacturing Industry
No ratings yet
Andrea Martorana Tusa: Failure Prediction For Manufacturing Industry
23 pages
Hive on Google Cloud Dataproc Guide
No ratings yet
Hive on Google Cloud Dataproc Guide
16 pages
Pulkit Agarwal: Tech Projects & Experience
No ratings yet
Pulkit Agarwal: Tech Projects & Experience
1 page
Evaluate RAG - Phoenix
No ratings yet
Evaluate RAG - Phoenix
25 pages
Quantum Leap Agentic Ai in Quantum Computing
No ratings yet
Quantum Leap Agentic Ai in Quantum Computing
24 pages
Advance Deep Learning
No ratings yet
Advance Deep Learning
10 pages
Building GPT-2 from Scratch in PyTorch
No ratings yet
Building GPT-2 from Scratch in PyTorch
13 pages
M. Tech. Semester - I: Advanced Computer Architecture (MCSCS102IBMCSCS 902)
No ratings yet
M. Tech. Semester - I: Advanced Computer Architecture (MCSCS102IBMCSCS 902)
12 pages
Bias-Variance Tradeoff Presentation
No ratings yet
Bias-Variance Tradeoff Presentation
11 pages
Managing The AI Native Product - AI Product Manager's Handbook - Second Edition
No ratings yet
Managing The AI Native Product - AI Product Manager's Handbook - Second Edition
39 pages
LLM Chains for Product Naming and Analysis
No ratings yet
LLM Chains for Product Naming and Analysis
7 pages
Dragon Bundle Projects List
No ratings yet
Dragon Bundle Projects List
18 pages
11 Machine Learning System Design PDF
No ratings yet
11 Machine Learning System Design PDF
7 pages
LLM Interview Questions PDF
No ratings yet
LLM Interview Questions PDF
12 pages
Introducing MLOps PDF
No ratings yet
Introducing MLOps PDF
112 pages
Artificial Immune System
No ratings yet
Artificial Immune System
18 pages
AI Concepts for Tech Enthusiasts
No ratings yet
AI Concepts for Tech Enthusiasts
1 page
Weights and Biases in Neural Networks
No ratings yet
Weights and Biases in Neural Networks
10 pages
Python Metaprogramming
100% (1)
Python Metaprogramming
93 pages
LoRA vs QLoRA: Fine-Tuning Techniques
No ratings yet
LoRA vs QLoRA: Fine-Tuning Techniques
5 pages
Types of Neural Networks Explained
No ratings yet
Types of Neural Networks Explained
8 pages
Intro to Exploratory Data Analysis
No ratings yet
Intro to Exploratory Data Analysis
17 pages
Learn AI Quantum 2022 PDF
No ratings yet
Learn AI Quantum 2022 PDF
13 pages
Intro To Machine Learning With PyTorch
No ratings yet
Intro To Machine Learning With PyTorch
48 pages
AI ML Program Playbook (McCombs)
No ratings yet
AI ML Program Playbook (McCombs)
4 pages
Introduction to Reinforcement Learning
No ratings yet
Introduction to Reinforcement Learning
19 pages
Python AI ML Complete Roadmap With Skills
No ratings yet
Python AI ML Complete Roadmap With Skills
3 pages
System Design Basics
No ratings yet
System Design Basics
193 pages
Fast Python High Performance Techniques For Large Datasets MEAP V10 Tiago Rodrigues Antao Instant Download
No ratings yet
Fast Python High Performance Techniques For Large Datasets MEAP V10 Tiago Rodrigues Antao Instant Download
110 pages
Overview of 7 Classification Algorithms
No ratings yet
Overview of 7 Classification Algorithms
21 pages
Issues in ML
No ratings yet
Issues in ML
2 pages
Deep Learning For NLP
No ratings yet
Deep Learning For NLP
78 pages
Bag of Words
No ratings yet
Bag of Words
32 pages
NLP Transformers for Data Scientists
No ratings yet
NLP Transformers for Data Scientists
38 pages
"Hello World" of Deep Learning
No ratings yet
"Hello World" of Deep Learning
26 pages
Federated Learning: Strategies & Applications
No ratings yet
Federated Learning: Strategies & Applications
24 pages
Cheat Sheet Data and Machine Learning Tools Landscape 1669819463
No ratings yet
Cheat Sheet Data and Machine Learning Tools Landscape 1669819463
1 page
Machine Learning Algorithm Guide
100% (1)
Machine Learning Algorithm Guide
15 pages
Real Amazon Aif c01 Study Questions by Armstrong
No ratings yet
Real Amazon Aif c01 Study Questions by Armstrong
9 pages
6months ML
No ratings yet
6months ML
161 pages
Machine Learning in Production
No ratings yet
Machine Learning in Production
31 pages
Word2Vec Tutorial - The Skip-Gram Model Chris McCormick PDF
No ratings yet
Word2Vec Tutorial - The Skip-Gram Model Chris McCormick PDF
39 pages
Simple Libraries in Python
No ratings yet
Simple Libraries in Python
12 pages
Machine Learning Notes Btech
No ratings yet
Machine Learning Notes Btech
3 pages
ML Material Unit1
No ratings yet
ML Material Unit1
32 pages
Reinforcement Learning Overview
No ratings yet
Reinforcement Learning Overview
31 pages
CEC453 Machine Learning
No ratings yet
CEC453 Machine Learning
168 pages
Bedrock Doc 1
No ratings yet
Bedrock Doc 1
4 pages
Serverless Inference in SageMaker
No ratings yet
Serverless Inference in SageMaker
45 pages
The Ultimate Guide To MLOps Ebook
No ratings yet
The Ultimate Guide To MLOps Ebook
10 pages
International Cricket Academy Overview
No ratings yet
International Cricket Academy Overview
61 pages
Decision Tree qUIZE
100% (6)
Decision Tree qUIZE
3 pages
Python Basics Assignment
No ratings yet
Python Basics Assignment
5 pages
Saraswathi
No ratings yet
Saraswathi
3 pages
Lab 2 NAND Gate Layout ECE334S: MAX Tutorial
No ratings yet
Lab 2 NAND Gate Layout ECE334S: MAX Tutorial
6 pages
Biology 2A03 Course Outline Winter 2010
No ratings yet
Biology 2A03 Course Outline Winter 2010
4 pages
Assembly Language Reference PDF
No ratings yet
Assembly Language Reference PDF
366 pages
Lab 4
No ratings yet
Lab 4
9 pages
Comparative & International Education Guide
No ratings yet
Comparative & International Education Guide
63 pages
B767 ATA 35 - Oxygen
100% (3)
B767 ATA 35 - Oxygen
36 pages
Office of Chairman, Counseling Board: Tentative State Quota PG Seat Metrix For Session 2024-25 (Round-1)
No ratings yet
Office of Chairman, Counseling Board: Tentative State Quota PG Seat Metrix For Session 2024-25 (Round-1)
1 page
Numerical Analysis Project
No ratings yet
Numerical Analysis Project
12 pages
Fault Detection Method Using A Convolution Neural Network For Hybrid Active Neutral-Point Clamped Inverters
No ratings yet
Fault Detection Method Using A Convolution Neural Network For Hybrid Active Neutral-Point Clamped Inverters
11 pages
As G481 Mechanics
0% (1)
As G481 Mechanics
69 pages
Chapter 03 Sources of Comparative Advantage
No ratings yet
Chapter 03 Sources of Comparative Advantage
68 pages
Weekly Math Learning Plan for Grade 1
No ratings yet
Weekly Math Learning Plan for Grade 1
4 pages
Metal Forming for Engineering Students
No ratings yet
Metal Forming for Engineering Students
112 pages
SS1 1ST Term Civic Educ
No ratings yet
SS1 1ST Term Civic Educ
34 pages
Manual Fierastrau Panglica Shark 280 SX
0% (1)
Manual Fierastrau Panglica Shark 280 SX
152 pages
International Business Management Assignment
No ratings yet
International Business Management Assignment
1 page
Developing Good Study Habits Among Students-DAWN
No ratings yet
Developing Good Study Habits Among Students-DAWN
2 pages
Environment of HRM
No ratings yet
Environment of HRM
24 pages
Ccsu252531056 1752070395
No ratings yet
Ccsu252531056 1752070395
2 pages
Schaffner Datasheet FN2060
No ratings yet
Schaffner Datasheet FN2060
7 pages
Astm D5420-21
No ratings yet
Astm D5420-21
3 pages
Syllabus Combined Ad No 10
No ratings yet
Syllabus Combined Ad No 10
15 pages
Comprehensive Valve and Sensor Parts List
No ratings yet
Comprehensive Valve and Sensor Parts List
6 pages
Design Clearances For Standard Wrenches and Sockets
100% (5)
Design Clearances For Standard Wrenches and Sockets
2 pages
Anurag Newv
No ratings yet
Anurag Newv
32 pages
MATLAB-Based SAR Image Processing
No ratings yet
MATLAB-Based SAR Image Processing
11 pages
Caring for Body Parts in PE Class
No ratings yet
Caring for Body Parts in PE Class
2 pages
Workshop Microproject List
No ratings yet
Workshop Microproject List
2 pages
175 170200
No ratings yet
175 170200
2 pages
Practical Reseach 1
No ratings yet
Practical Reseach 1
10 pages

Machine Learning Operations

Uploaded by

Machine Learning Operations

Uploaded by

Machine Learning Operations

The basic introduction combines of what exactly MLOPS and its

What is the use of the MLOps?

Discussion on the DevOps

Stages into DevOps

Before going onto deployment, the code goes through multiple

Process of Building the DevOps:

3. Monitoring the space / infrastructure where we have deployed:

4. Monitoring the model’s performance, accuracy, errors and bias.

This will not be much beneficial for some of the particular

UNDERSTANDING THE CI/CD PIPELINE

CI/CD ensures that the integration and delivery of incremental changes

The Integration pipeline focuses on the initial stages of software

The "immutable" way is like following a strict recipe without any

Applying thing over here:

Traditional Approach: Data scientists work on their own laptops, using

Reproducible results: You can be confident that any changes in model

Continuous Integration / Continuous Delivery (CI/CD), originating from

In contrast to traditional CI/CD pipelines for standard software

Designing an ML system involves grappling with challenges like:

• Storing model artifacts and enabling teams to track experiments

• Handling often large and rapidly changing datasets. In addition to

ML systems demand consistent monitoring for performance and data

Now we learn what infrastructure setup we would need for a model

Data Collection — This step involves collecting data from various

Data Verification — In this step we check the validity of the data,

Configuration — This step involves setting up the protocols for

You might also like