0% found this document useful (0 votes)

38 views33 pages

7-Data and Analytics For IoT

The document discusses data analytics for the Internet of Things (IoT), highlighting the challenges of managing massive data generated by sensors. It covers various analytics types, tools like Hadoop and Apache Spark, and the importance of edge and network analytics for real-time processing and insights. Key concepts include structured vs. unstructured data, data in motion vs. data at rest, and the benefits of edge streaming analytics.

Uploaded by

studytutor2022

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views33 pages

7-Data and Analytics For IoT

Uploaded by

studytutor2022

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

7 Data and Analytics for IoT

Yuemin Ding
Tecnun School of Engineering
University of Navarra

1
Outlines
• An Introduction to Data Analytics for IoT
• Big Data Analytics Tools and Technology
• Edge Streaming Analytics
• Network Analytics

2
Data Analytics for IoT
• In IoT, the creation of massive amounts of data
from sensors is one of the biggest challenges.
▪ Modern jet engines are fitted with thousands of sensors
that generate 10GB of data per second.
▪ A twin-engine commercial aircraft with these engines
operating on average 8 hours a day will generate over
500 TB of data daily.

3
Structured v.s. Unstructured Data
• Structured data means that the data follows a
model or schema that defines how the data is
represented or organized.
• Unstructured data lacks a logical schema for
understanding and decoding the data through
traditional programming means.

4
Structured v.s. Unstructured Data
• Structured data and unstructured data require
different toolsets for analysis.
• Around 80% of a business’s data is unstructured.

5
Data in Motion Versus Data at Rest
• Data in IoT networks is either in transit (“data in
motion”) or being held or stored (“data at rest”).
▪ Data in motion include traditional client/server
exchanges, such as web browsing and file transfers, and
email.
▪ Data saved to a hard drive, storage array, or USB drive
is data at rest.

• Data in motion → real-time processing and

analysis → Spark, Storm, and Flink, etc.
• Data at rest → huge volume → Hadoop.

6
IoT Data Analytics Overview
• Data analysis is typically broken down by the
types of results that are produced:
▪ Descriptive: tells what is happening, either now or in the
past.

7
IoT Data Analytics Overview
• Data analysis is typically broken down by the
types of results that are produced:
▪ Diagnostic: provides the answer when you are
interested in the “why”.

8
IoT Data Analytics Overview
• Data analysis is typically broken down by the
types of results that are produced:
▪ Predictive: foretells problems or issues before they occur

9
IoT Data Analytics Overview
• Data analysis is typically broken down by the
types of results that are produced:
▪ Prescriptive: recommends solutions for upcoming
problems.

10
Outlines
• An Introduction to Data Analytics for IoT
• Big Data Analytics Tools and Technology
• Edge Streaming Analytics
• Network Analytics

11
Big Data Analytics Tools and Technology
• ‘Three Vs’ to categorize big data:
▪ Velocity refers to how quickly data is being collected
and analyzed.
▪ Variety refers to different types of data.
▪ Volume refers to the scale of the data.
• Over time, other Vs have been added to big data

12
Massively Parallel Processing Databases
• Massively parallel processing (MPP) databases
were built on the concept of the relational data
warehouses
• MPP are designed to be much faster, to be
efficient, and to support reduced query times.

13
Hadoop
• Hadoop was originally developed as a result of
projects at Google and Yahoo!
• The project had two key elements:
▪ Hadoop Distributed File System (HDFS): A system
for storing data across multiple nodes
▪ MapReduce: A distributed processing engine that splits
a large task into smaller ones that can be run in parallel

14
Hadoop
• Hadoop takes advantage of a distributed
architecture to store and process massive
amounts of data and can leverage resources
from all nodes in the cluster.

15
Hadoop
• NameNodes: They coordinate where the data is stored
and maintain a map of where each block of data is stored
and where it is replicated.
• DataNodes: These are the servers where the data are
stored.

16
Hadoop
• Drawback:
▪ MapReduce breaks down a query into smaller tasks,
which is useful for the analysis of historical data.
▪ Depending on how much data is being queried and the
complexity of the query, the result could take seconds
or minutes to return.
▪ If you have a real-time process running, MapReduce is
not the right data processing engine for that.

17
Apache Spark
• Apache Spark is an in-memory distributed data analytics
platform designed to accelerate processes in the Hadoop
ecosystem.
• At each stage of a MapReduce operation, the data is read
and written back to the disk → latency and slow
• With Spark, the processing of this data is moved into high-
speed memory → allowing near-real-time processing of
events

Hadoop

Spark

18
Source: analyticsvidhya.com
Apache Spark
• Real-time processing is done by a component of the
Apache Spark project called Spark Streaming.
• Spark Streaming is responsible for taking live-streamed
data from a messaging system and dividing it into smaller
micro-batches.
• The Spark processing engine operates on these smaller
pieces of data, allowing rapid insights into the data and
subsequent actions.
• Similar platforms include Apache Storm and Flink.

19
Source: analyticsvidhya.com
Apache Kafka
• Apache Kafka is a messaging system is designed
to accept data, or messages, from where the
data is generated and delivered the data to
stream-processing engines such as Spark
Streaming or Storm.

20
Lambda Architecture
• Ultimately, the key elements of IoT use cases
involve the collection, processing, and storage of
data using multiple technologies.
• Querying both data in motion (streaming) and
data at rest (batch processing) requires a
combination of different projects.

21
Outlines
• An Introduction to Data Analytics for IoT
• Big Data Analytics Tools and Technology
• Edge Streaming Analytics
• Network Analytics

22
Edge Streaming Analytics
• Key values of edge streaming analytics
▪ Reducing data at the edge → Passing all the
IoT data to the cloud is inefficient and is
expensive in terms of bandwidth and network
infrastructure
▪ Analysis and response at the edge →
Some data is useful only at the edge (such as
the control within a local factory)
▪ Time sensitivity → Edge analytics allows
timely analysis and immediate responses to
changing conditions

23
Edge Streaming Analytics
• Three stages of streaming analytics at the
edge:
▪ Raw input data → data coming from the sensors into
the analytics processing unit
▪ Analytics processing unit (APU) → filters and
combines data streams, organizes them by time
windows, and performs various analytical functions
▪ Output streams → The data that is organized into
insightful streams and passed on for storage and further
processing in the cloud.

24
Edge Streaming Analytics
• Core functions of the Analytics Processing
Unit (APU):
▪ Filter → identifies the information that is considered
important to be processed on the edge
▪ Transform → manipulate the data structure into a form
required for further processing
▪ Time → establish a timing context of real-time
streaming data flows

25
Edge Streaming Analytics
• Core functions of the Analytics Processing
Unit (APU):
▪ Correlate → combine multiple data streams from
different types of sensors, such as body temperature,
heart rate, and blood pressure of the patient
▪ Match patterns → gain deeper insights into the data,
such as a sudden change in heart rate
▪ Improve business intelligence → more quickly and
timely response

26
Edge Streaming Analytics
• Depending on the application, analytics can
happen at any point throughout the IoT system
• An example of pressure and temperature
measurement on an oil rig:
▪ Analytics directly on the edge
▪ Fog node locating on the same oil rig performs
streaming analytics from several edge devices
▪ After fog analysis, result forwarded to the cloud for
deeper historical analysis

27
Outlines
• An Introduction to Data Analytics for IoT
• Big Data Analytics Tools and Technology
• Edge Streaming Analytics
• Network Analytics

28
Network Analytics
• Data analytics → finding patterns in the data
generated by endpoints
• Network analytics → discovering patterns in the
communication flows from a network perspective

29
Network Analytics
• For wireless IoT networks, packet sniffer can be
used for flow analytics

CatSniffer

Wireshark

30
Source: test-and-measurement-world.com
Network Analytics
• The benefits of network flow analytics:
▪ Network traffic monitoring and profiling→
IPv4/IPv6 networkwide traffic volume and pattern
analysis
▪ Application traffic monitoring and profiling → gain
a detailed time-based view of IoT access services, such
as MQTT and CoAP
▪ Capacity planning → track and anticipate IoT traffic
growth and help in the planning of upgrades
▪ Security analysis→ change in network traffic behavior
may indicate a cyber security event, such as a denial of
service (DoS) attack.
▪ Accounting → analyze and optimize the billing
▪ Data warehousing and data mining → Flow data can
be warehoused for later retrieval and analysis

31
Summary
• An Introduction to Data Analytics for IoT
• Big Data Analytics Tools and Technology
• Edge Streaming Analytics
• Network Analytics

32
Thank you!
Q&A

Hadoop for IoT Data Analytics
No ratings yet
Hadoop for IoT Data Analytics
21 pages
Module4 1
No ratings yet
Module4 1
68 pages
IoT - New 6
No ratings yet
IoT - New 6
186 pages
UNIT IV - Iot - 1
No ratings yet
UNIT IV - Iot - 1
27 pages
Module 4 Complete
No ratings yet
Module 4 Complete
97 pages
Data Analytics For IoT Solutions (Module VI)
No ratings yet
Data Analytics For IoT Solutions (Module VI)
81 pages
Hadoop for IoT Data Analytics
No ratings yet
Hadoop for IoT Data Analytics
42 pages
Introduction To Data Analytics For IoT
100% (1)
Introduction To Data Analytics For IoT
4 pages
Week8 Day3
No ratings yet
Week8 Day3
7 pages
Unit 5 Notes IOT
No ratings yet
Unit 5 Notes IOT
40 pages
Unit 6 Iot
No ratings yet
Unit 6 Iot
12 pages
IIOT Unit 3 NOTES
No ratings yet
IIOT Unit 3 NOTES
22 pages
Module 5
No ratings yet
Module 5
30 pages
Deepa Mam Cloud Analytics
No ratings yet
Deepa Mam Cloud Analytics
40 pages
Week 10 - IoT Platforms - 5 - Final
No ratings yet
Week 10 - IoT Platforms - 5 - Final
49 pages
IOT 4 Module
No ratings yet
IOT 4 Module
48 pages
Core Functions of Edge Analytics
No ratings yet
Core Functions of Edge Analytics
13 pages
Big Data & IoT Framework Guide
No ratings yet
Big Data & IoT Framework Guide
13 pages
IoT and Cloud Integration Guide
No ratings yet
IoT and Cloud Integration Guide
313 pages
Data Handling & Analytics: Unit 5
No ratings yet
Data Handling & Analytics: Unit 5
18 pages
Analyzing Data in The Internet of Things PDF
100% (1)
Analyzing Data in The Internet of Things PDF
66 pages
Unit 5
No ratings yet
Unit 5
46 pages
IoT Notes
No ratings yet
IoT Notes
21 pages
Big IoT Data Analytics - Architecture, Opportunities, and Open Research Challenges
No ratings yet
Big IoT Data Analytics - Architecture, Opportunities, and Open Research Challenges
17 pages
IOT Unit-IV
No ratings yet
IOT Unit-IV
74 pages
Data Analytics Iot Unit5 Modified
No ratings yet
Data Analytics Iot Unit5 Modified
35 pages
IoT Edge Analytics Insights
No ratings yet
IoT Edge Analytics Insights
41 pages
Iot Module4 RMR
No ratings yet
Iot Module4 RMR
121 pages
IoT Data Analytics: Key Enablers Survey
No ratings yet
IoT Data Analytics: Key Enablers Survey
24 pages
IoT Data Analytics Overview
No ratings yet
IoT Data Analytics Overview
24 pages
Module4-Data Analytics-Ppt-Dlb-Chapter5
No ratings yet
Module4-Data Analytics-Ppt-Dlb-Chapter5
50 pages
Unit 3.2 (2 MARKS)
No ratings yet
Unit 3.2 (2 MARKS)
2 pages
Unit II Notes
No ratings yet
Unit II Notes
53 pages
IoT Data Analytics: Structured vs Unstructured
No ratings yet
IoT Data Analytics: Structured vs Unstructured
74 pages
Defining IoT Analytics
No ratings yet
Defining IoT Analytics
20 pages
Lecture 13 IoT Cloud Computing
No ratings yet
Lecture 13 IoT Cloud Computing
41 pages
Big Data Analyticsfor Io T
No ratings yet
Big Data Analyticsfor Io T
12 pages
Internet of Things 18Cs81: Module - 4 Data and Analytics For Iot
No ratings yet
Internet of Things 18Cs81: Module - 4 Data and Analytics For Iot
32 pages
Iot CP and A CH 4
No ratings yet
Iot CP and A CH 4
18 pages
Iot Analytics
No ratings yet
Iot Analytics
14 pages
IoT and Big Data Concepts Explained
No ratings yet
IoT and Big Data Concepts Explained
6 pages
IoT Big Data Challenges and Solutions
No ratings yet
IoT Big Data Challenges and Solutions
14 pages
04 - IoT - Unit 4 - Data Handling & Analytics
No ratings yet
04 - IoT - Unit 4 - Data Handling & Analytics
52 pages
Iot CP and A CH 2
No ratings yet
Iot CP and A CH 2
19 pages
Research Paper (Edited)
No ratings yet
Research Paper (Edited)
4 pages
Business Environment Assignment 2
No ratings yet
Business Environment Assignment 2
13 pages
A Presentation and A Demo On Real-Time Edge Analytics
No ratings yet
A Presentation and A Demo On Real-Time Edge Analytics
38 pages
IoT Data Collection & Data Lifecycle
No ratings yet
IoT Data Collection & Data Lifecycle
3 pages
Mca Iv Iot Unit 4
No ratings yet
Mca Iv Iot Unit 4
17 pages
Internet of Things (IOT) : Module - 4
No ratings yet
Internet of Things (IOT) : Module - 4
18 pages
Said 2020
No ratings yet
Said 2020
12 pages
Paper 2
No ratings yet
Paper 2
10 pages
Unit 4 Iot
No ratings yet
Unit 4 Iot
92 pages
Definition of IoT Data Analytics
No ratings yet
Definition of IoT Data Analytics
18 pages
IoT Data Analysis for Students
No ratings yet
IoT Data Analysis for Students
17 pages
IoT Notes Revised
No ratings yet
IoT Notes Revised
8 pages
IoT - Module 4 - 8th Sem
No ratings yet
IoT - Module 4 - 8th Sem
17 pages
Iot Stream Processing and Analytics in The Fog
No ratings yet
Iot Stream Processing and Analytics in The Fog
21 pages
6-Application Protocols For IoT
No ratings yet
6-Application Protocols For IoT
32 pages
Load Distributing
No ratings yet
Load Distributing
44 pages
Neural NetworksChapter2Sup
No ratings yet
Neural NetworksChapter2Sup
20 pages
Neural Presentation
No ratings yet
Neural Presentation
16 pages
Lesson 9
No ratings yet
Lesson 9
15 pages
NN Ch2
No ratings yet
NN Ch2
36 pages
3D Printer Design & Manufacturing
No ratings yet
3D Printer Design & Manufacturing
52 pages
Compiler Construction Lab Manual
No ratings yet
Compiler Construction Lab Manual
33 pages
Understanding Programmable Logic Controllers
No ratings yet
Understanding Programmable Logic Controllers
75 pages
Apps Reviewer
No ratings yet
Apps Reviewer
14 pages
Xpert Calibration Package Insert (Ingles)
No ratings yet
Xpert Calibration Package Insert (Ingles)
48 pages
Zone To Win
No ratings yet
Zone To Win
30 pages
Manual Studio 5000 Safety
No ratings yet
Manual Studio 5000 Safety
540 pages
Toshiba Drivve - Solutions Overview
No ratings yet
Toshiba Drivve - Solutions Overview
16 pages
Chạy 2 Ứng Dụng Trên Android
No ratings yet
Chạy 2 Ứng Dụng Trên Android
26 pages
Recitation 2: Time Series in Matlab
No ratings yet
Recitation 2: Time Series in Matlab
8 pages
Django Short Notes
No ratings yet
Django Short Notes
3 pages
Oracle Customer Data Hub Implementation Concepts and Strategies
No ratings yet
Oracle Customer Data Hub Implementation Concepts and Strategies
125 pages
User Manual 20210609 v1 C009 ASC 2400 IM ENG 210527 V3 5294656
No ratings yet
User Manual 20210609 v1 C009 ASC 2400 IM ENG 210527 V3 5294656
8 pages
5001 05 ProcessControlFaceplateWindows PPT
No ratings yet
5001 05 ProcessControlFaceplateWindows PPT
18 pages
2022 Scheme and Syllabus 1
No ratings yet
2022 Scheme and Syllabus 1
131 pages
Ford Type 1 IMMO Emulator Guide
No ratings yet
Ford Type 1 IMMO Emulator Guide
1 page
How To Flash An Android Phone Using PC Software
No ratings yet
How To Flash An Android Phone Using PC Software
12 pages
The Double Pendulum Experiment
No ratings yet
The Double Pendulum Experiment
9 pages
FTD POV Best Practices Quick Start Guide
No ratings yet
FTD POV Best Practices Quick Start Guide
18 pages
Oracle Performance Tuning 101 - Developer Perspective 090528 - 1
No ratings yet
Oracle Performance Tuning 101 - Developer Perspective 090528 - 1
25 pages
SDL Scancode Mapping Overview
No ratings yet
SDL Scancode Mapping Overview
7 pages
Online Railway Ticket Booking System
No ratings yet
Online Railway Ticket Booking System
11 pages
Unit 1 - Sic 1
No ratings yet
Unit 1 - Sic 1
11 pages
Effective Manupatra Judgment Searches
100% (1)
Effective Manupatra Judgment Searches
8 pages
Presentación - Workshop "BVMS Master Features" - 54126
No ratings yet
Presentación - Workshop "BVMS Master Features" - 54126
71 pages
LabVIEW Robotics Simulator Overview
0% (1)
LabVIEW Robotics Simulator Overview
7 pages
DraftSight 2017 SP3 System Requirements v1
No ratings yet
DraftSight 2017 SP3 System Requirements v1
1 page
Supply Chain Risk Management With Machine Learning Technology - A Literature Review and Future Research Directions
No ratings yet
Supply Chain Risk Management With Machine Learning Technology - A Literature Review and Future Research Directions
12 pages
Wolfenstein
No ratings yet
Wolfenstein
6 pages
01 Microservices Material
No ratings yet
01 Microservices Material
4 pages

7-Data and Analytics For IoT

Uploaded by

7-Data and Analytics For IoT

Uploaded by

7 Data and Analytics for IoT

• Data in motion → real-time processing and

You might also like