0% found this document useful (0 votes)

23 views35 pages

Data Analytics Iot Unit5 Modified

Uploaded by

bjananika17

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views35 pages

Data Analytics Iot Unit5 Modified

Uploaded by

bjananika17

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 35

Data and Analytics for IoT

MODULE
4
 As more and more devices are added to IoT networks,
the data generated by these systems becomes
overwhelming
 Traditional data management systems are simply
unprepared
for the demands of what has come to be known as “big
data.”
 The real value of IoT is not just in connecting things but
rather in the data produced by those things, the new services you
can enable via those connected things, and the business
insights that the data can reveal.
 However, to be useful, the data needs to be handled in a
way that is organized and controlled.
 Thus, a new approach to data analytics is needed for
An Introduction to Data Analytics
for IoT
 In the world of IoT, the creation of massive amounts of data
from sensors is common and one of the biggest challenges—
not only from a transport perspective but also from a
data management standpoint

 Modern jet engines are fitted with thousands of sensors

that generate a whopping 10GB of data per second

 Analyzing this amount of data in the most efficient manner

possible falls under the umbrella of data analytics
 Not all data is the same; it can be categorized and thus
analyzed in different ways.

 Depending on how data is categorized, various data analytics

tools and processing methods can be applied.

 Two important categorizations from an IoT

perspective are whether the data is structured or unstructured
and whether it is in motion or at rest.
Structured Versus Unstructured
Data
 Structured data and unstructured data are important
classifications as they typically require different toolsets from
a data analytics perspective
 Structured data means that the data follows a model or
schema that defines how the data is represented or organized,
meaning it fits well with a traditional relational database
management system (RDBMS).
 In many cases you will find structured data in a simple
tabular form—for example, a spreadsheet where data
occupies a specific cell and can be explicitly defined and
referenced
 Structured data can be found in most computing systems
and includes everything from banking transaction and
invoices to computer log files and router configurations.

 IoT sensor data often uses structured values, such as

temperature, pressure, humidity, and so on, which are
all sent in a known format.

 Structured data is easily formatted, stored, queried, and processed

 Because of the highly organizational format of structured
data, a wide array of data analytics tools are readily
available for processing this type of data.

 From custom scripts to commercial software like Microsoft

Excel and Tableau
 Unstructured data lacks a logical schema
forunderstanding and decoding the data through
traditional programming means.

 Examples of this data type include text, speech, images,

and video.

 As a general rule, any data that does not fit neatly into a
predefined data model is classified as unstructured
data
 According to some estimates, around 80% of a business’s
data is unstructured.
 Because of this fact, data analytics methods that can be
applied to unstructured data, such as cognitive
computing and machine learning, are deservedly garnering
a lot of attention.
 With machine learning applications, such as natural
language processing (NLP), you can decode
speech.
 With image/facial recognition applications, you can extract
critical information from still images and video
Smart objects in IoT networks generate both
structured and unstructured data.

 Structured data is more easily managed and processed due

to its well-defined organization.

 On the other hand, unstructured data can be harder to

deal with and typically requires very different analytics
tools for processing the data
Data in Motion Versus Data at
Rest
Data in IoT networks is either in transit (“data in motion”)
or being held or stored (“data at rest”).

 Examples of data in motion include traditional client/server

exchanges, such as web browsing and file transfers, and
email.

 Data saved to a hard drive, storage array, or USB drive is

data at rest.
 From an IoT perspective, the data from smart objects is
considered data in motion as it passes through the network en
route to its final destination.
 This is often processed at the edge, using fog computing.
 When data is processed at the edge, it may be filtered and deleted
or forwarded on for further processing and possible storage at a
fog node or in the data center.
 Data does not come to rest at the edge.
 When data arrives at the data center, it is possible to process it
in real-time, just like at the edge, while it is still in motion.
 Tools with this sort of capability, are Spark, Storm, and
Flink
 Data at rest in IoT networks can be typically found in
IoT brokers or in some sort of storage array at the
data center

 Hadoop not only helps with data processing but also

data storage
IoT Data Analytics
Overview
 The true importance of IoT data from smart objects
is realized only when the analysis of the data leads to
actionable business intelligence and insights.

 Data analysis is typically broken down by the types

of results that are produced
Types of Data Analysis Results
Four types of data analysis
results
 Descriptive:
 Descriptive data analysis tells you what is happening,
either now or in the past.
 For example, a thermometer in a truck engine
reports temperature values every second.
 From a descriptive analysis perspective, you can pull this data at
any moment to gain insight into the current operating
condition of the truck engine.
 If the temperature value is too high, then there may
be a cooling problem or the engine may be experiencing
too much load.
 Diagnostic:
 When you are interested in the “why,” diagnostic data
analysis
can provide the answer.
 Continuing with the example of the temperature sensor in the
truck engine, you might wonder why the truck engine
failed.
 Diagnostic analysis might show that the temperature
of the engine was too high, and the engine
overheated.
 Applying diagnostic analysis across the data generated by a
wide range of smart objects can provide a clear picture of why
a problem or an event occurred
 Predictive:
 Predictive analysis aims to foretell problems or
issues
before they occur.
 For example, with historical values of temperatures for the
truck engine, predictive analysis could provide an
estimate on the remaining life of certain components
in the engine.
 These components could then be proactively replaced before
failure occurs.
 Or perhaps if temperature values of the truck engine start to
rise slowly over time, this could indicate the need for an oil
change or some other sort of engine cooling
maintenance.
 Prescriptive:
 Prescriptive analysis goes a step beyond predictive and
recommends
solutions for upcoming problems.
 A prescriptive analysis of the temperature data from a truck
engine might calculate various alternatives to cost-
effectively
maintain our truck
 These calculations could range from the cost necessary for more frequent
oil
changes and cooling maintenance to installing new cooling equipment on the
engine or upgrading to a lease on a model with a more powerful
engine.
 Prescriptive analysis looks at a variety of factors and makes the
 Both predictive and prescriptive analyses are more resource
intensive and increase complexity, but the value they
provide is much greater than the value from descriptive and
diagnostic analysis
IoT Data Analytics
Challenges
Problems by using RDMS in IoT

1.Scaling Problems (performance issues, costly to

resolve, req more h/w, architechture changes)

2. Volatility of Data (change in schema)

Machine
Learning
ML is central to IoT.
 Data collected by smart objects needs to be analyzed, and
intelligent actions need to be taken based on these
analyses.
 Performing this kind of operation manually is almost impossible
(or very, very slow and inefficient).

 Machines are needed to process information fast and

react instantly when thresholds are met
 Ex: advances in self-driving vehicle--abnormal pattrn
recognition in a crowd and automated intelligent
and machine-assisted decision system
Machine Learning
Overview
 Machine learning is, in fact, part of a larger set of technologies
commonly grouped under the term artificial intelligence
(AI).

 AI includes any technology that allows a computing system to

mimic human intelligence using any technique, from
very advanced logic to basic “if-then-else” decision loops.

 Any computer that uses rules to make decisions belong

to this group
 A simple example is an app that can help you
find your parked car.
 A GPS reading of your position at regular intervals calculates
your speed.
 A basic threshold system determines whether you are driving
(for example, “if speed > 20 mph or 30 kmh, then start
calculating speed”).
 When you park and disconnect from the car
Bluetooth system, the app simply records the location
when the disconnection happens.
 This is where your car is parked.
 In more complex cases, static rules cannot be simply
inserted into the program because they require parameters
that can change or that are imperfectly understood
 A typical example is a dictation program that runs on a
computer.
The program is configured to recognize the audio pattern
of each word in a dictionary, but it does not know your
voice’s specifics—your accent, tone, speed, and so on
You need to record a set of predetermined sentences to
help the tool match well-known words to the sounds
you make when you say the words.
 This process is called machine learning.
 ML is concerned with any process where the
computer needs to receive a set of data that is
processed to help perform a task with more
efficiency.
 ML is a vast field but can be simply divided in two main
categories: supervised and unsupervised
learning
Supervised
Learning
 In supervised learning, the machine is trained with input for
which there is a known correct answer.
 For example, suppose that you are training a system to recognize
when there is a human in a mine tunnel.
 A sensor equipped with a basic camera can capture shapes
and return them to a computing system that is responsible
for determining whether the shape is a human or
something else (such as a vehicle, a pile of ore, a rock, a piece
of wood, and so on.).
 With supervised learning techniques, hundreds or thousands
of images are fed into the machine, and each image is
labelled (human or nonhuman in this case).
 This is called the training set.
 An algorithm is used to determine common parameters
and common differences between the images.
 The comparison is usually done at the scale of the entire
image, or pixel by pixel.
 Images are resized to have the same characteristics
(resolution, color depth, position of the central figure, and
so on), and each point is analyzed.
 Each new image is compared to the set of known “good images,” and a
deviation is calculated to determine how different, the new
image is from the average human image and, therefore, the
probability that what is shown is a human figure. This process is
called classification.

 After training, the machine should be able to recognize human shapes.

Before real field deployments, the machine is usually tested with
unlabeled pictures— this is called the validation or the test set,
depending on the ML system used—to verify that the recognition
level is at acceptable thresholds. If the machine does not reach the
level of success expected, more training is needed
 In other cases, the learning process is not about classifying in two
or more categories but about finding a correct value.
 For example, the speed of the flow of oil in a pipe is a
function of the size of the pipe, the viscosity of the oil, pressure, and a
few other factors.
 When you train the machine with measured values, the machine
can predict the speed of the flow for a new, and unmeasured,
viscosity.
 This process is called regression; regression predicts numeric
values, whereas classification predicts categories
Unsupervised
Learning
In some cases, supervised learning is not the best method for a
machine to help with a human decision.
 Suppose that you are processing IoT data from a
factory
manufacturing small engines.

 You know that about 0.1% of the produced engines on average

need adjustments to prevent later defects, and your task is to
identify them before they get mounted into machines and shipped
away from the factory.

 With hundreds of parts, it may be very difficult to detect the

potential defects, and it is almost impossible to train a machine to
recognize issues that may not be visible
 However, you can test each engine and record multiple
parameters, such as sound, pressure, temperature of key
parts, and so on.
 Once data is recorded, you can graph these elements in
relation to one another (for example, temperature as a
function of pressure, sound versus rotating speed
overtime).
 You can then input this data into a computer and use
mathematical functions to find groups.
 For example, you may decide to group the engines by the
sound they make at a given temperature.
 A standard function to operate this grouping, K-means clustering,
finds the mean values for a group of engines (for example,
mean value for temperature, mean frequency for sound).
 Grouping the engines this way can quickly reveal several types of
engines that all belong to the same category (for example, small
engine of chainsaw type, medium engine of lawnmower type).
 All engines of the same type produce sounds and temperatures in
the same range as the other members of the same group.
 There will occasionally be an engine in the group that
displays unusual characteristics (slightly out of
expected temperature or sound range).
 This is the engine that you send for manual evaluation.
 The computing process associated with this determination is
called unsupervised learning.
 This type of learning is unsupervised because there is not a
“good” or “bad” answer known in advance.
 It is the variation from a group behavior that allows the
computer to learn that something is different

Iot Module4 RMR
No ratings yet
Iot Module4 RMR
121 pages
IoT Data Analytics: Structured vs Unstructured
No ratings yet
IoT Data Analytics: Structured vs Unstructured
74 pages
Module 4 Complete
No ratings yet
Module 4 Complete
97 pages
Unit 4 Iot
No ratings yet
Unit 4 Iot
92 pages
Unit 5
No ratings yet
Unit 5
46 pages
IOT Unit-IV
No ratings yet
IOT Unit-IV
74 pages
15CS81 M4 Introduction
No ratings yet
15CS81 M4 Introduction
28 pages
IoT Notes
No ratings yet
IoT Notes
21 pages
Module4-Data Analytics-Ppt-Dlb-Chapter5
No ratings yet
Module4-Data Analytics-Ppt-Dlb-Chapter5
50 pages
IoT & Its Applications Unit-IV
No ratings yet
IoT & Its Applications Unit-IV
44 pages
Internet of Things (IOT) : Module - 4
No ratings yet
Internet of Things (IOT) : Module - 4
18 pages
Hadoop for IoT Data Analytics
No ratings yet
Hadoop for IoT Data Analytics
42 pages
IOT 4 Module
No ratings yet
IOT 4 Module
48 pages
IIOT Unit 3 NOTES
No ratings yet
IIOT Unit 3 NOTES
22 pages
CPE 445-Internet of Things - Chapter 7
No ratings yet
CPE 445-Internet of Things - Chapter 7
39 pages
IoT - New 6
No ratings yet
IoT - New 6
186 pages
Data Analytics For IoT Solutions (Module VI)
No ratings yet
Data Analytics For IoT Solutions (Module VI)
81 pages
Unit 4 IOT
No ratings yet
Unit 4 IOT
37 pages
Iot 4
No ratings yet
Iot 4
37 pages
Iot Analytics
No ratings yet
Iot Analytics
14 pages
Introduction To Data Analytics For IoT
100% (1)
Introduction To Data Analytics For IoT
4 pages
IoT Data Analysis for Students
No ratings yet
IoT Data Analysis for Students
17 pages
IOT - Unit - 4
No ratings yet
IOT - Unit - 4
62 pages
Unit 1 DAW
No ratings yet
Unit 1 DAW
30 pages
IoT Data Analytics Overview and Challenges
No ratings yet
IoT Data Analytics Overview and Challenges
27 pages
Internet of Things 18Cs81: Module - 4 Data and Analytics For Iot
No ratings yet
Internet of Things 18Cs81: Module - 4 Data and Analytics For Iot
32 pages
Unit 4-IOT
No ratings yet
Unit 4-IOT
21 pages
IoT - Module 4 - 8th Sem
No ratings yet
IoT - Module 4 - 8th Sem
17 pages
IOT Module 4
No ratings yet
IOT Module 4
17 pages
UNIT IV - Iot - 1
No ratings yet
UNIT IV - Iot - 1
27 pages
Unit 4-IOT
No ratings yet
Unit 4-IOT
21 pages
IoT Data Analytics Insights
No ratings yet
IoT Data Analytics Insights
45 pages
Industry 4.0 & AI in Data Management
No ratings yet
Industry 4.0 & AI in Data Management
8 pages
IoT U-4
No ratings yet
IoT U-4
12 pages
Data Science
No ratings yet
Data Science
25 pages
Machine Learning For Internet of Things Data A 2018 Digital Communications A
No ratings yet
Machine Learning For Internet of Things Data A 2018 Digital Communications A
15 pages
Bsadcom 201910007
No ratings yet
Bsadcom 201910007
18 pages
17CS81 IOT Notes Module4
No ratings yet
17CS81 IOT Notes Module4
17 pages
IOT Unit 4 Data and Analytics For IoT by Dr.M.K.Jayanthi Kannan
100% (3)
IOT Unit 4 Data and Analytics For IoT by Dr.M.K.Jayanthi Kannan
41 pages
Big Data Analytics and QliqView Insights
No ratings yet
Big Data Analytics and QliqView Insights
47 pages
Unit 2
No ratings yet
Unit 2
22 pages
Data Science in IOT
No ratings yet
Data Science in IOT
220 pages
Big Data Insights for Businesses
No ratings yet
Big Data Insights for Businesses
17 pages
Wollega University Department of Computer Science Selected Topics in Computer Science by Tadele D. March 18, 2023
100% (1)
Wollega University Department of Computer Science Selected Topics in Computer Science by Tadele D. March 18, 2023
75 pages
Unit 1 Understanding Big Data
No ratings yet
Unit 1 Understanding Big Data
17 pages
SP Unit-5
No ratings yet
SP Unit-5
8 pages
Big Data & IoT: Integration Insights
No ratings yet
Big Data & IoT: Integration Insights
19 pages
BDA Module1
No ratings yet
BDA Module1
75 pages
Iot CP and A CH 4
No ratings yet
Iot CP and A CH 4
18 pages
Data Analytics (Da) by I Tech World
No ratings yet
Data Analytics (Da) by I Tech World
65 pages
Unit-4-Solution Framework For IoT Applications
No ratings yet
Unit-4-Solution Framework For IoT Applications
8 pages
Big Data Analytics
No ratings yet
Big Data Analytics
58 pages
Emerging Technologies2
No ratings yet
Emerging Technologies2
27 pages
Data Analytics in IoT Overview
No ratings yet
Data Analytics in IoT Overview
44 pages
Unit 1 Understanding Big Data
No ratings yet
Unit 1 Understanding Big Data
17 pages
Data Analytics For IOT
No ratings yet
Data Analytics For IOT
57 pages
IoT ML
No ratings yet
IoT ML
20 pages
UNIT IV - Iot
No ratings yet
UNIT IV - Iot
13 pages
Chapter 6 Trends
No ratings yet
Chapter 6 Trends
15 pages
Basic Data Mining Tasks
No ratings yet
Basic Data Mining Tasks
1 page
AI As Digital Assistant For Medicos
100% (1)
AI As Digital Assistant For Medicos
15 pages
Goldman Sachs AI Adoption Tracker 2024Q2 Hardware Investment Grows
No ratings yet
Goldman Sachs AI Adoption Tracker 2024Q2 Hardware Investment Grows
13 pages
LBNL 2024 United States Data Center Energy Usage Report
No ratings yet
LBNL 2024 United States Data Center Energy Usage Report
79 pages
Propositional Vs FO Inference FC and BC
No ratings yet
Propositional Vs FO Inference FC and BC
6 pages
The Bloomsbury Companion To Language Industry Studies - Erik Angelone (Editor), Maureen Ehrensberger-Dow (Editor), - Bloomsbury UK, London, 2019 - 9781350024939 - Anna's
No ratings yet
The Bloomsbury Companion To Language Industry Studies - Erik Angelone (Editor), Maureen Ehrensberger-Dow (Editor), - Bloomsbury UK, London, 2019 - 9781350024939 - Anna's
419 pages
Slide 1
No ratings yet
Slide 1
29 pages
Ielts Speaking Work
No ratings yet
Ielts Speaking Work
11 pages
Understanding Technical English in ESP
100% (1)
Understanding Technical English in ESP
9 pages
Handwriting Recognition with MDLSTM
No ratings yet
Handwriting Recognition with MDLSTM
6 pages
Visual Guide to Transformers
No ratings yet
Visual Guide to Transformers
30 pages
Rural India Navigating Challenges Embracing
No ratings yet
Rural India Navigating Challenges Embracing
11 pages
Machine Learning For Structural Engineering (April 2022)
No ratings yet
Machine Learning For Structural Engineering (April 2022)
44 pages
Artificial Intelligence in A Military Co
No ratings yet
Artificial Intelligence in A Military Co
23 pages
CS429: Data Mining Overview
No ratings yet
CS429: Data Mining Overview
26 pages
R2 Unet PDF
No ratings yet
R2 Unet PDF
12 pages
Innovative Teaching in Computing Lab
No ratings yet
Innovative Teaching in Computing Lab
12 pages
AI Notes Part-3
No ratings yet
AI Notes Part-3
29 pages
Hitachi's Digital Strategy 2023 Overview
No ratings yet
Hitachi's Digital Strategy 2023 Overview
32 pages
Chapter 1
No ratings yet
Chapter 1
7 pages
Medlens: Smart Health Diagnosis System
100% (1)
Medlens: Smart Health Diagnosis System
88 pages
Tybsc-Cs Sem5 Ai Apr19
No ratings yet
Tybsc-Cs Sem5 Ai Apr19
2 pages
Competing Age Ai Virtual Brochure
No ratings yet
Competing Age Ai Virtual Brochure
3 pages
Porn AI Image and Video Generator Free (Xe2Ry)
No ratings yet
Porn AI Image and Video Generator Free (Xe2Ry)
9 pages
PyBrain Slides
No ratings yet
PyBrain Slides
20 pages
AI-Powered Outreach Tool For Small Businesses ($2.5M ARR)
No ratings yet
AI-Powered Outreach Tool For Small Businesses ($2.5M ARR)
4 pages
E-Eli5-Way-3bd2b1164a53: CNN (Source:)
No ratings yet
E-Eli5-Way-3bd2b1164a53: CNN (Source:)
4 pages
AI Concepts and Business Applications
No ratings yet
AI Concepts and Business Applications
11 pages
Chapter 3 - Training Deep Neural Networks
No ratings yet
Chapter 3 - Training Deep Neural Networks
25 pages
MBA Emerging Technologies Syllabus
No ratings yet
MBA Emerging Technologies Syllabus
71 pages

Data Analytics Iot Unit5 Modified

Uploaded by

Data Analytics Iot Unit5 Modified

Uploaded by

Data and Analytics for IoT

 Modern jet engines are fitted with thousands of sensors

 Analyzing this amount of data in the most efficient manner

 Depending on how data is categorized, various data analytics

 Two important categorizations from an IoT

 IoT sensor data often uses structured values, such as

 Structured data is easily formatted, stored, queried, and processed

 From custom scripts to commercial software like Microsoft

 Examples of this data type include text, speech, images,

 Structured data is more easily managed and processed due

 On the other hand, unstructured data can be harder to

 Examples of data in motion include traditional client/server

 Data saved to a hard drive, storage array, or USB drive is

 Hadoop not only helps with data processing but also

 Data analysis is typically broken down by the types

1.Scaling Problems (performance issues, costly to

2. Volatility of Data (change in schema)

 Machines are needed to process information fast and

 AI includes any technology that allows a computing system to

 Any computer that uses rules to make decisions belong

 After training, the machine should be able to recognize human shapes.

 You know that about 0.1% of the produced engines on average

 With hundreds of parts, it may be very difficult to detect the

You might also like