0% found this document useful (0 votes)

39 views7 pages

How To Use Apache Ignite For Machine Learning

This article explains how to use Apache Ignite, an in-memory database with a machine learning framework, for machine learning tasks like linear regression. It highlights the advantages of having ETL processes and machine learning on the same system to improve efficiency and scalability. A simple Java code example demonstrates how to implement linear regression using Ignite, although the documentation is noted to be lacking in detail.

Uploaded by

Srinath Pitta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views7 pages

How To Use Apache Ignite For Machine Learning

Uploaded by

Srinath Pitta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

HOW TO USE APACHE IGNITE FOR MACHINE LEARNING

In this article, we show how to do machine learning with Apache Ignite.

What is Apache Ignite?

Apache Ignite is an in-memory database that includes a machine learning framework. (If you wonder
why it has an ML framework, consider that Apache Spark has one too, probably for the same
reason.)
Ignite is written for Java programmers. Of course, that means you can use it with Scala, too, since
that sits on top of Java. They have connectors for different languages, including:

Python
C++
C#

Why does the Ignite database have an ML frame?

Apache Ignite says they believe that the ETL (extract, transform, and load) should all take place on
the same system. They point out that machine learning pipelines often involve more than one
person and more than one system. So, the data scientist might be waiting around for the data
engineers to provide data. It would be better to put it all onto one system, they say. That would
reduce the number of times you convert the data from one format to another in order to machine
learning.
They also make the very valid point that certain other ML frameworks, like scikit-Learn, don't scale.
That is a good point because it means your data could be too big to fit into the memory of one
machine. For example, Python Pandas and NumPy data structures cannot be run on a cluster.
So, what do you do when your data is too big? In that case you would probably have to use
something that does run on a cluster, like Spark ML or TensorFlow. That said, Ignite works with
Spark, TensorFlow, Hadoop, Kafka, and other systems too.

Simple example: Linear Regression

Here, we show a simple example of how to use Apache Ignite to do linear regression. The idea is to
show how Ignite runs in-place—not so much to explain linear regression. What is quite remarkable is
you don't even need to install Apache Ignite to run this example. And you don't need to start a
server. The Ignite ML framework does that.
Instead, you just download the framework using a Maven pom.xml file. Then, when you write Java
code to do the regression analysis, Ignite launches an instance of itself.
You can download the pom.xml here and code here.
Here is the
Java code. It's pretty simple, as the data is mocked up.
However, I can say that the Ignite documentation is not very thorough, which makes things less than
simple. For documentation, they provide only Javadocs and sample code. I would welcome a much
more detailed user guide. For example, it's not clear what the third argument in this does:

trainer.fit(ignite, dataCache, vectorizer);

Here is full the code. Below, I explain certain sections.

package com.bmc.ml;

import java.io.IOException;
import java.util.List;
import java.util.UUID;

import org.apache.ignite.Ignite;
import org.apache.ignite.IgniteCache;
import org.apache.ignite.Ignition;

import
org.apache.ignite.cache.affinity.rendezvous.RendezvousAffinityFunction;
import org.apache.ignite.configuration.CacheConfiguration;
import org.apache.ignite.configuration.IgniteConfiguration;
import org.apache.ignite.ml.dataset.feature.extractor.Vectorizer;
import org.apache.ignite.ml.dataset.feature.extractor.impl.DummyVectorizer;
import org.apache.ignite.ml.math.primitives.vector.Vector;
import org.apache.ignite.ml.math.primitives.vector.VectorUtils;
import org.apache.ignite.ml.regressions.linear.LinearRegressionLSQRTrainer;
import org.apache.ignite.ml.regressions.linear.LinearRegressionModel;
import org.apache.ignite.ml.selection.scoring.evaluator.Evaluator;
import org.apache.ignite.ml.selection.scoring.metric.MetricName;
import org.apache.ignite.ml.math.primitives.vector.Vector;
import org.apache.ignite.ml.math.primitives.vector.VectorUtils;

public class LRExample {

public static void main(String[] args) throws IOException {

System.out.println();
System.out.println(">>> Linear regression model over cache based
dataset usage example started.");
// Start ignite grid.

IgniteConfiguration igniteCfg = new IgniteConfiguration();

igniteCfg.setWorkDirectory("/Users/walkerrowe/Downloads");
Ignite ignite = Ignition.start(igniteCfg);

System.out.println(">>> Ignite grid started.");

IgniteCache<Integer, Vector> dataCache = getCache(ignite);

try {
// dataCache = new SandboxMLCache(ignite).fillCacheWith();

System.out.println(">>> Create new linear regression trainer

object.");
LinearRegressionLSQRTrainer trainer = new
LinearRegressionLSQRTrainer();

System.out.println(">>> Perform the training to get the model.");

dataCache.put(1,VectorUtils.of(1,1.8));
dataCache.put(2,VectorUtils.of(2,4.3));
dataCache.put(3,VectorUtils.of(3,6.2));
dataCache.put(4,VectorUtils.of(4,5));
dataCache.put(5,VectorUtils.of( 5,11));
dataCache.put(6,VectorUtils.of(6,11));
dataCache.put(7,VectorUtils.of(7,15));

Vectorizer<Integer, Vector, Integer, Double> vectorizer = new

DummyVectorizer()
.labeled(Vectorizer.LabelCoordinate.FIRST);

LinearRegressionModel mdl = trainer.fit(ignite, dataCache,

vectorizer);

double rmse = Evaluator.evaluate(

dataCache, mdl,
new
DummyVectorizer().labeled(Vectorizer.LabelCoordinate.FIRST),
MetricName.RMSE
);

System.out.println("rmse = " + rmse);

System.out.println("intercept = " + mdl.getIntercept());

System.out.println("Weights = " );

Vector weights = mdl.getWeights();

double[] w = weights.asArray();
for (double v : w) {
System.out.println(v);
}

System.out.println("==================");

} finally {
if (dataCache != null)
dataCache.destroy();
}
}

static private IgniteCache<Integer, Vector> getCache(Ignite ignite) {

CacheConfiguration<Integer, Vector> cacheConfiguration = new
CacheConfiguration<>();
cacheConfiguration.setName("ML_EXAMPLE_" + UUID.randomUUID());
cacheConfiguration.setAffinity(new RendezvousAffinityFunction(false,
10));

return ignite.createCache(cacheConfiguration);
}

Here are the results:

rmse = 0.5963051208050183
intercept = 0.5762626813570405
Weights = 0.4413657685174977

When you start the code, it starts the server, as you can see:

__________ ________________
/ _/ ___/ |/ / _/_ __/ __/
_/ // (7 7 // / / / / _/
/___/\___/_/|_/___/ /_/ /___/

Ignite documentation: http://ignite.apache.org

Quiet mode.
^-- Logging by 'JavaLogger '
^-- To see **FULL** console log here add -DIGNITE_QUIET=false or "-v" to
ignite.{sh|bat}

OS: Mac OS X 10.15.5 x86_64

The code, explained

Give it a working directory and leave the constructor to the IgniteConfiguration() constructor
empty.

IgniteConfiguration igniteCfg = new IgniteConfiguration();

igniteCfg.setWorkDirectory("/Users/walkerrowe/Downloads");
Ignite ignite = Ignition.start(igniteCfg);

This create the array and writes it to the Ignite database (which they call cache):

dataCache.put(1,VectorUtils.of(1,1.8));

Here you pass the datacache to the linear regression fit() method, to calculate the weights and
coefficient.

LinearRegressionModel mdl = trainer.fit(ignite, dataCache, vectorizer);

Additional resources
For more on this topic, check out our BMC Machine Learning & Big Data Blog or browse these
articles:

Apache Spark Guide, with 15+ articles

Hadoop Guide, with 20+ articles
Machine Learning with TensorFlow and Keras
Enabling the Citizen Data Scientists

K Means Clustering in Apache Ignite Machine Learning
No ratings yet
K Means Clustering in Apache Ignite Machine Learning
8 pages
Apache Ignite In-Memory Computing Guide
No ratings yet
Apache Ignite In-Memory Computing Guide
47 pages
ML Engine
No ratings yet
ML Engine
3 pages
Apache Ignite
No ratings yet
Apache Ignite
42 pages
Practical Machine Learning Pipelines With Mllib: Joseph K. Bradley
No ratings yet
Practical Machine Learning Pipelines With Mllib: Joseph K. Bradley
35 pages
Operationalizing The Model
No ratings yet
Operationalizing The Model
46 pages
Module 5.pptx - 20250608 - 201231 - 0000
No ratings yet
Module 5.pptx - 20250608 - 201231 - 0000
43 pages
Data Engineering Notes
No ratings yet
Data Engineering Notes
11 pages
Ignitebook Sample
100% (1)
Ignitebook Sample
128 pages
AI Lab11 Task
No ratings yet
AI Lab11 Task
21 pages
Data Engineer Generative Ai
No ratings yet
Data Engineer Generative Ai
17 pages
Translating Deep - Learning - Model - To - Java
No ratings yet
Translating Deep - Learning - Model - To - Java
3 pages
Topic Cheatsheet For GCP's Professional Machine Learning Engineer Beta Exam
100% (1)
Topic Cheatsheet For GCP's Professional Machine Learning Engineer Beta Exam
2 pages
Dense Neural Nets
No ratings yet
Dense Neural Nets
68 pages
Final DL
No ratings yet
Final DL
26 pages
001IntroductiontomachinelearningPart I
No ratings yet
001IntroductiontomachinelearningPart I
10 pages
Apache Ignite Introduction - GridGain Systems
No ratings yet
Apache Ignite Introduction - GridGain Systems
39 pages
ML Cheat Sheet
No ratings yet
ML Cheat Sheet
2 pages
Assignment B 3 Customer Churn Modeling
No ratings yet
Assignment B 3 Customer Churn Modeling
7 pages
Advanced Data Science with Spark
No ratings yet
Advanced Data Science with Spark
47 pages
Lab Manual - MACHINE LEARNING LABORATORY
No ratings yet
Lab Manual - MACHINE LEARNING LABORATORY
42 pages
Scalable-ML-3 4 1
No ratings yet
Scalable-ML-3 4 1
147 pages
Machine Learning with Python Guide
No ratings yet
Machine Learning with Python Guide
3 pages
Towardsdatascience Com Building An ML Application With Mllib in Pyspark Part 1 Ac13f01606e2
No ratings yet
Towardsdatascience Com Building An ML Application With Mllib in Pyspark Part 1 Ac13f01606e2
20 pages
ML Hota Assign5
No ratings yet
ML Hota Assign5
2 pages
MLib Cheat Sheet Design
No ratings yet
MLib Cheat Sheet Design
1 page
Unit 2 - Class - Preceptron
No ratings yet
Unit 2 - Class - Preceptron
13 pages
AI Engineer Interview Prep Guide
100% (1)
AI Engineer Interview Prep Guide
16 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
10 pages
AI Java Weekly Plan Detailed
No ratings yet
AI Java Weekly Plan Detailed
5 pages
Deep Learning with Databricks Overview
No ratings yet
Deep Learning with Databricks Overview
38 pages
GVPCOEW - Neural Networks Deep Learning Material - 2024 - DONE
No ratings yet
GVPCOEW - Neural Networks Deep Learning Material - 2024 - DONE
110 pages
AI Internal Questions Solution
No ratings yet
AI Internal Questions Solution
15 pages
End To End Project
No ratings yet
End To End Project
21 pages
Machine Learning with Spark Guide
No ratings yet
Machine Learning with Spark Guide
26 pages
Java Introduction To Machine Learning Code Sheet
No ratings yet
Java Introduction To Machine Learning Code Sheet
8 pages
Machine Learning Engineer Interview Preparation Guide
No ratings yet
Machine Learning Engineer Interview Preparation Guide
14 pages
Machine Learning for Beginners
No ratings yet
Machine Learning for Beginners
18 pages
CSE545 sp23 (5) Neural Network Workflows 2-26
No ratings yet
CSE545 sp23 (5) Neural Network Workflows 2-26
100 pages
Deep Learning
No ratings yet
Deep Learning
21 pages
Advanced ML with TensorFlow on GCP
No ratings yet
Advanced ML with TensorFlow on GCP
150 pages
Exp 3
No ratings yet
Exp 3
7 pages
Lab 12
No ratings yet
Lab 12
6 pages
Scalable Machine Learning With Apache Spark en
No ratings yet
Scalable Machine Learning With Apache Spark en
145 pages
1 AI - Introduction and ML
No ratings yet
1 AI - Introduction and ML
32 pages
U4 BDH
No ratings yet
U4 BDH
19 pages
AI ML Nov 15
No ratings yet
AI ML Nov 15
32 pages
SageMaker ML Model Deployment Guide
No ratings yet
SageMaker ML Model Deployment Guide
13 pages
Internship Lekhana
No ratings yet
Internship Lekhana
17 pages
RLDL128
No ratings yet
RLDL128
73 pages
Text Classification - Movie Review - News Wires
No ratings yet
Text Classification - Movie Review - News Wires
5 pages
Assignment 2 - Neural Network Fundamentals
No ratings yet
Assignment 2 - Neural Network Fundamentals
7 pages
Algorithms and Frameworks Used in The Development of Machine Learning Models
No ratings yet
Algorithms and Frameworks Used in The Development of Machine Learning Models
5 pages
MMC102 - Module 4 - Notes
No ratings yet
MMC102 - Module 4 - Notes
39 pages
Key Features of SageMaker Studio
No ratings yet
Key Features of SageMaker Studio
2 pages
Spark MLIB
No ratings yet
Spark MLIB
50 pages
Deep Learning With Google Cloud (PDFDrive)
No ratings yet
Deep Learning With Google Cloud (PDFDrive)
99 pages
APKA Report
No ratings yet
APKA Report
3 pages
Lisa Marie J. Clemente Atty. Irene D. Valones April 12, 2018 Human Rights Law, Sun., 10-12nn
No ratings yet
Lisa Marie J. Clemente Atty. Irene D. Valones April 12, 2018 Human Rights Law, Sun., 10-12nn
4 pages
Year 3 Maths Homework Sheets
33% (3)
Year 3 Maths Homework Sheets
8 pages
I M Lab Report LabVIEW 1
No ratings yet
I M Lab Report LabVIEW 1
11 pages
POB - Grade 10 - Lesson 34 - Business Documents
100% (2)
POB - Grade 10 - Lesson 34 - Business Documents
2 pages
Ethics For The Information Age 3rd Edition PDF Download
No ratings yet
Ethics For The Information Age 3rd Edition PDF Download
2 pages
Construction Engineering Management 1
No ratings yet
Construction Engineering Management 1
11 pages
Templates in Text and Chat - 327d21 PDF
No ratings yet
Templates in Text and Chat - 327d21 PDF
3 pages
Overview and Applications of Profinet: Andy Verwer Verwer Training & Consultancy LTD
No ratings yet
Overview and Applications of Profinet: Andy Verwer Verwer Training & Consultancy LTD
33 pages
Monthly Forecast I I. 4 Vedas MP3 Free!: Purusha Sookta Homam On Rama Navami, 15th April. 2016. Book Your Archana Online
No ratings yet
Monthly Forecast I I. 4 Vedas MP3 Free!: Purusha Sookta Homam On Rama Navami, 15th April. 2016. Book Your Archana Online
3 pages
Check Point fw monitor Cheat Sheet
No ratings yet
Check Point fw monitor Cheat Sheet
2 pages
Summary Settlement of Estates in PH
No ratings yet
Summary Settlement of Estates in PH
4 pages
Executive Summary-1524027911
No ratings yet
Executive Summary-1524027911
14 pages
Jeddah Islamic Port Overview
No ratings yet
Jeddah Islamic Port Overview
3 pages
Access Modifiers in Java
No ratings yet
Access Modifiers in Java
5 pages
Database Security: CSIT115 Data Management and Security
No ratings yet
Database Security: CSIT115 Data Management and Security
19 pages
Lokpal and Lokayukta - UPSC Notes
No ratings yet
Lokpal and Lokayukta - UPSC Notes
6 pages
Data Science and Machine Learning (Vasudevan T V)
No ratings yet
Data Science and Machine Learning (Vasudevan T V)
92 pages
Manual de Serviços de Fluidos para Produtos Cummins®
No ratings yet
Manual de Serviços de Fluidos para Produtos Cummins®
34 pages
Simatic Wincc
No ratings yet
Simatic Wincc
20 pages
Facilitating Learning Module 17
No ratings yet
Facilitating Learning Module 17
8 pages
Chart of Account Argartile Revbaru Ke 2 Saryono
No ratings yet
Chart of Account Argartile Revbaru Ke 2 Saryono
17 pages
Software Testing Exam Questions
No ratings yet
Software Testing Exam Questions
17 pages
Demotivating Factors in Employee Satisfaction
No ratings yet
Demotivating Factors in Employee Satisfaction
5 pages
Aci Committee 336 Footing, Mats and Drilled Piers
No ratings yet
Aci Committee 336 Footing, Mats and Drilled Piers
6 pages
Level 3 Class Notes
No ratings yet
Level 3 Class Notes
11 pages
Minnesota Housing Discrimination Case
No ratings yet
Minnesota Housing Discrimination Case
64 pages
FS1 Episode 11 Amtalao Michelle
No ratings yet
FS1 Episode 11 Amtalao Michelle
16 pages
Effects of Psychological Distance and Social Influence On Tourists Hotel Booking Preferences
No ratings yet
Effects of Psychological Distance and Social Influence On Tourists Hotel Booking Preferences
19 pages
Ejaz Ahmad Resume: MBA, AutoCAD, MS Office Skills
No ratings yet
Ejaz Ahmad Resume: MBA, AutoCAD, MS Office Skills
2 pages
Module 4 (Part 3) - Open Channel Flow
No ratings yet
Module 4 (Part 3) - Open Channel Flow
3 pages

How To Use Apache Ignite For Machine Learning

Uploaded by

How To Use Apache Ignite For Machine Learning

Uploaded by

HOW TO USE APACHE IGNITE FOR MACHINE LEARNING

In this article, we show how to do machine learning with Apache Ignite.

What is Apache Ignite?

Why does the Ignite database have an ML frame?

Simple example: Linear Regression

trainer.fit(ignite, dataCache, vectorizer);

public class LRExample {

public static void main(String[] args) throws IOException {

IgniteConfiguration igniteCfg = new IgniteConfiguration();

System.out.println(">>> Ignite grid started.");

IgniteCache<Integer, Vector> dataCache = getCache(ignite);

System.out.println(">>> Create new linear regression trainer

System.out.println(">>> Perform the training to get the model.");

Vectorizer<Integer, Vector, Integer, Double> vectorizer = new

LinearRegressionModel mdl = trainer.fit(ignite, dataCache,

double rmse = Evaluator.evaluate(

System.out.println("rmse = " + rmse);

System.out.println("intercept = " + mdl.getIntercept());

Vector weights = mdl.getWeights();

static private IgniteCache<Integer, Vector> getCache(Ignite ignite) {

Here are the results:

Ignite documentation: http://ignite.apache.org

OS: Mac OS X 10.15.5 x86_64

The code, explained

IgniteConfiguration igniteCfg = new IgniteConfiguration();

LinearRegressionModel mdl = trainer.fit(ignite, dataCache, vectorizer);

Apache Spark Guide, with 15+ articles

You might also like