100% found this document useful (1 vote)

283 views102 pages

Understanding NoSQL and Its Benefits

The document discusses NoSQL databases as an alternative to traditional relational databases. It covers why NoSQL databases were developed, including the need to handle large volumes of data across clusters of servers and the "impedance mismatch" between object-oriented programs and relational databases. It also summarizes several common data models used in NoSQL databases, including key-value, document, column family, and graph models.

Uploaded by

Amina Sultana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

283 views102 pages

Understanding NoSQL and Its Benefits

Uploaded by

Amina Sultana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 102

NOSQL

MODULE-1

Why NOSQL

Aggregate Data Models

More Details on Data Models

WHY NOSQL
NoSQL database provides much more
flexibility when it comes to handling
data. There is no requirement to specify
the schema to start working with the
application. Also, the NoSQL database
doesn't put a restriction on the types of
data you can store together. It allows you
to add more new types as your needs
change
THE VALUE OF RELATIONAL
DATABASES

 Getting at Persistent Data

 Concurrency

 Integration

 A (Mostly) Standard Model

GETTING AT PERSISTENT DATA

 Need to Store Large Data.

 Two Ways storing Data
 Main Memory – Limited in Space – loss of Data due to
power failures
 Backing Data - Large in Size – Slower

 Productivity Apps – Word Processor – File System

 Enterprise Applications – Database
CONCURRENCY

 Multiple Users Accessing at a Time

 Majorly Modifying Data

 Transaction Handling (Deadlock)

 Transactions should be Rolled Back if Needed

Hotel Room Booking

INTEGRATION

 Applications Written by Multiple Teams

 Collaboration

 Shared Database Integration

 Concurrency Control of Database handles Multiple

Applications
A (MOSTLY) STANDARD MODEL

 Relational databases have succeeded because they provide

the core benefits we outlined earlier in a (mostly) standard

way

 Vendors Might Differ but not the Benefits

 Note:
 Every RDBMS system must follow the same model or same structure or

same definition, only difference is queries may change.

IMPEDANCE MISMATCH

Though RDBMS provides many advantages still it is not

perfect. One of the dissatisfaction for developers is
“Impedance Mismatch”

Impedance Mismatch

The difference between the relational model and the in-

memory data structures
IMPEDANCE MISMATCH

 The relational data model organizes data into a structure

of tables and rows, or more properly, relations and tuples
 The values in a relational tuple have to be simple—they
cannot contain any structure, such as a nested record or a
list
 if you want to use a richer in-memory data structure, you
have to translate it to a relational representation to store
it on disk
IMPEDANCE MISMATCH
IMPEDANCE MISMATCH

 The Solution in earl 2000’s is OOP (object oriented

programming) and OOD (object oriented Database) .
 OOD given solution to Impedance Mismatch
 Major issue is Integration with RDBMS
 Frame Works for Integrations like HIBERNATE
 Solution is not Feasible
APPLICATION AND INTEGRATION
DATABASES
 Integration Database
with multiple applications, usually developed by
separate teams, storing their data in a common
database. This improves communication because all
the applications are operating on a consistent set of
persistent data
 Complexity has been Increased

 Number of Applications is a Tedious Task

 In 2000’s the Paradigm Shift is “WEB SERVICES”

APPLICATION AND INTEGRATION
DATABASES

 HTTP
 Flexibility in Exchanging the Data through HTTP REQ/RESP

 XML or JSON
 Application Specific Database instead of Integrated
Database
 Eg: flipkart website- working as Application Specific.
ATTACK OF THE CLUSTERS
 Growth in Millenium in the Name of Applications and
Databases
 Y2K Problem

 Traffic on Websites Increased

 Social Media

 Log Data

 Mapping of Data

To handle this kind of increase, you have two choices: up or

out
SCALE UP or GO OUT OF THE MARKET
Eg: orkut
ATTACK OF THE CLUSTERS
 Scaling up implies bigger machines, more processors,
disk storage, and memory. But bigger machines get more
and more expensive, not to mention that there are real
limits as your size increases. The alternative is to use lots
of small machines in a cluster.
 A cluster of small machines can use commodity
hardware and ends up being cheaper at these kinds of
scales. It can also be more resilient—while individual
machine failures are common, the overall cluster can be
built to keep going despite such failures, providing high
reliability.
ATTACK OF THE CLUSTERS
 Relational databases are not designed to be run on
clusters
 Clustered relational databases, such as the Oracle RAC
or Microsoft SQL Server, work on the concept of a
shared disk subsystem
 This mismatch between relational databases and clusters
led some organization to consider an alternative route to
data storage. Two companies in particular—Google and
Amazon
 BigTable from Google and Dynamo from Amazon.
THE EMERGENCE OF NOSQL
 Late 90’s
 Open Source

 Carlo Strozzi

 This database stores its tables as ASCII files, each tuple

represented by a line with fields separated by tabs
 The name comes from the fact that the database doesn’t
use SQL as a query language
 The database is manipulated through shell scripts that
can be combined into the usual UNIX pipelines
THE EMERGENCE OF NOSQL
 Relational databases use ACID transactions to handle
consistency across the whole database.
 NoSQL databases offer a range of options for
consistency and distribution
 Graph databases are one style of NoSQL databases that
uses a distribution model similar to relational databases
but offers a different data model that makes it better at
handling data with complex relationships.
 NoSQL databases operate without a schema
 Useful when dealing with nonuniform data
KEY POINTS
 Relational databases have been a successful technology for twenty
years, providing persistence, concurrency control, and an integration
mechanism.
 Application developers have been frustrated with the impedance
mismatch between the relational model and the in-memory data
structures.
 There is a movement away from using databases as integration points
towards encapsulating databases within applications and integrating
through services.
 The vital factor for a change in data storage was the need to support
large volumes of data by running on clusters. Relational databases are
not designed to run efficiently on clusters.
 NoSQL is an accidental neologism. There is no prescriptive definition
—all you can make is an observation of common characteristics.
KEY POINTS
 The common characteristics of NoSQL databases are
 Not using the relational model

 Running well on clusters

 Open-source

 Built for the 21st century web estates

 Schemaless

 The most important result of the rise of NoSQL is

Polyglot Persistence – Various Data Storage options are
available
AGGREGATE DATA MODELS
 A data model is the model through which we perceive
and manipulate our data
 Data Model describes how we interact with the data in
the database
 Distinct from a storage model, which describes how the
database stores and manipulates the data internally
 Developer might point to an entity-relationship diagram
of their database and refer to that as their data model
containing customers, orders, products, and the like
AGGREGATE DATA MODELS
 Relational Model
 Consists of Rows and Columns in the form of Tables
 NoSQL solution has a different model that it uses, which
we put into four categories widely used in the NoSQL
ecosystem:
 Key-Value
 Document
 Column-Family
 Graph
AGGREGATES
 Relational model takes the information that we want to
store and divides it into tuples (rows)
 A tuple is a limited data structure

 Cannot nest one tuple within another to get nested

records, nor can you put a list of values or tuples within
another.
aggregate is a collection of related objects that we wish to
treat as a unit. Aggregate will write with JSON or XML.
Eg: kaggle where you can get datasets
DATA MODEL ORIENTED AROUND A
RELATIONAL DATABASE(USING UML)
 A column store database is a type of database
that stores data using a column oriented model.
 A column store database can also be referred to
as a:
• Column database

• Column family database

• Column oriented database

• Wide column store database

• Wide column store

• Columnar database

• Columnar store
The Structure of a Column Store Database
Columns store databases use a concept called a keyspace. A
keyspace is kind of like a schema in the relational model. The
keyspace contains all the column families (kind of like tables in
the relational model), which contain rows, which contain
columns.
GRAPH DATABASES
UPDATING MV

NoSQL Database Replication Models
No ratings yet
NoSQL Database Replication Models
32 pages
Big Data Analytics Unit-2
No ratings yet
Big Data Analytics Unit-2
30 pages
UML Tools
No ratings yet
UML Tools
13 pages
Nosql Module 2
100% (1)
Nosql Module 2
87 pages
CC Module 5
No ratings yet
CC Module 5
26 pages
NoSQL Database Guide for Big Data
No ratings yet
NoSQL Database Guide for Big Data
5 pages
Module 4 Nosql
No ratings yet
Module 4 Nosql
8 pages
NoSQL vs. Relational Databases
No ratings yet
NoSQL vs. Relational Databases
20 pages
Lecture 3 Multiprocessor Vs Multicomputer Vs DS
No ratings yet
Lecture 3 Multiprocessor Vs Multicomputer Vs DS
55 pages
Bca Vi Sem Bi - Unit II
No ratings yet
Bca Vi Sem Bi - Unit II
84 pages
NOSQL
No ratings yet
NOSQL
55 pages
Unit V
No ratings yet
Unit V
67 pages
System Models For Distributed and Cloud Computing
No ratings yet
System Models For Distributed and Cloud Computing
15 pages
Chapter 1 Introduction To Big Data
No ratings yet
Chapter 1 Introduction To Big Data
19 pages
HBase Overview: Data Model & Clients
No ratings yet
HBase Overview: Data Model & Clients
34 pages
Cloud Computing Unit 4 Notes
No ratings yet
Cloud Computing Unit 4 Notes
41 pages
MST Unit 5
No ratings yet
MST Unit 5
6 pages
Stack Implementation with Linked List in Java
No ratings yet
Stack Implementation with Linked List in Java
20 pages
B+ Tree and B Tree Indexing in DBMS
No ratings yet
B+ Tree and B Tree Indexing in DBMS
27 pages
Introduction to Hadoop HDFS
No ratings yet
Introduction to Hadoop HDFS
9 pages
NoSQL Scaling and Consistency
No ratings yet
NoSQL Scaling and Consistency
76 pages
Document Database Data Modeling
No ratings yet
Document Database Data Modeling
27 pages
Unit Iii
100% (1)
Unit Iii
43 pages
Unit 1 Bda Complete Notes
No ratings yet
Unit 1 Bda Complete Notes
15 pages
Object-Oriented Database Concepts
No ratings yet
Object-Oriented Database Concepts
42 pages
NoSQL Module1 PPT
No ratings yet
NoSQL Module1 PPT
64 pages
Bda Unit 1
No ratings yet
Bda Unit 1
32 pages
Understanding NoSQL Databases and CAP Theorem
No ratings yet
Understanding NoSQL Databases and CAP Theorem
23 pages
Understanding Cloud Foundations
No ratings yet
Understanding Cloud Foundations
68 pages
Object-Oriented Databases Guide
No ratings yet
Object-Oriented Databases Guide
31 pages
Digital Marketing and Non - Line World: Business Intelligence The Intelligence Platforms
No ratings yet
Digital Marketing and Non - Line World: Business Intelligence The Intelligence Platforms
11 pages
Understanding Object-Oriented Databases
No ratings yet
Understanding Object-Oriented Databases
13 pages
OS - Module 5 - Memory Management
No ratings yet
OS - Module 5 - Memory Management
81 pages
BDA - Chapter-1-Components of Hadoop Ecosystem - Lecture 3
0% (1)
BDA - Chapter-1-Components of Hadoop Ecosystem - Lecture 3
38 pages
Redis Presentation
80% (5)
Redis Presentation
43 pages
Data Structures for Beginners
100% (1)
Data Structures for Beginners
31 pages
Cassandra: Decentralized Storage System
No ratings yet
Cassandra: Decentralized Storage System
37 pages
RMI & JDBC for Java Developers
No ratings yet
RMI & JDBC for Java Developers
30 pages
Bda U-5
No ratings yet
Bda U-5
30 pages
Unit V Cloud Technologies and Advancements
100% (1)
Unit V Cloud Technologies and Advancements
33 pages
Unit-3 (Mongo DB)
No ratings yet
Unit-3 (Mongo DB)
47 pages
BCA Project Report Format
No ratings yet
BCA Project Report Format
3 pages
Message Queues (ActiveMQs and Kafka)
No ratings yet
Message Queues (ActiveMQs and Kafka)
7 pages
Unit V Case Studies
No ratings yet
Unit V Case Studies
37 pages
Big Data Stream Processing Guide
No ratings yet
Big Data Stream Processing Guide
22 pages
Java Persistence for Developers
No ratings yet
Java Persistence for Developers
23 pages
SQL vs NoSQL: Key Differences Explained
No ratings yet
SQL vs NoSQL: Key Differences Explained
4 pages
Key-Value Database Characteristics
No ratings yet
Key-Value Database Characteristics
13 pages
NoSQL Databases: CAP Theorem & MongoDB Guide
No ratings yet
NoSQL Databases: CAP Theorem & MongoDB Guide
34 pages
Understanding Hadoop MapReduce Workflow
No ratings yet
Understanding Hadoop MapReduce Workflow
11 pages
Understanding NoSQL Databases Explained
No ratings yet
Understanding NoSQL Databases Explained
54 pages
Unit 5-Key - Value Store Database
No ratings yet
Unit 5-Key - Value Store Database
16 pages
Unit Iv Multithreading and Generic Programming
No ratings yet
Unit Iv Multithreading and Generic Programming
24 pages
NoSQL Databases: A Developer's Guide
No ratings yet
NoSQL Databases: A Developer's Guide
36 pages
Apache Spark Architecture Overview
No ratings yet
Apache Spark Architecture Overview
4 pages
Cloud Computing Architecture Module III
No ratings yet
Cloud Computing Architecture Module III
14 pages
MC5303 Web Programming Essentials
100% (1)
MC5303 Web Programming Essentials
115 pages
Module 1
No ratings yet
Module 1
52 pages
DMND Module 1
No ratings yet
DMND Module 1
51 pages
NoSQL vs SQL: Key Differences Explained
No ratings yet
NoSQL vs SQL: Key Differences Explained
70 pages
Api For Odfpy
No ratings yet
Api For Odfpy
89 pages
Unit 4: Results Recording: Closing Inspection Characteristics
No ratings yet
Unit 4: Results Recording: Closing Inspection Characteristics
2 pages
The ABC of Cybersecurity How To Prevent Phishing Social Engineering Attacks, Incident Management Best Practices And... (Mike Miller)
No ratings yet
The ABC of Cybersecurity How To Prevent Phishing Social Engineering Attacks, Incident Management Best Practices And... (Mike Miller)
335 pages
2022 Scheme and Syllabus 1
No ratings yet
2022 Scheme and Syllabus 1
131 pages
1
No ratings yet
1
14 pages
Test Monitoring and Control Overview
No ratings yet
Test Monitoring and Control Overview
20 pages
Allplan FT V17 Manual: Getting Started
No ratings yet
Allplan FT V17 Manual: Getting Started
418 pages
Lego Multiplication Flashcards
No ratings yet
Lego Multiplication Flashcards
17 pages
Lecture 2: Data Structures
No ratings yet
Lecture 2: Data Structures
5 pages
6 Series Mill Controller Operation Manual: 6 系列銑床操作手冊 Date: 2015/11/13
No ratings yet
6 Series Mill Controller Operation Manual: 6 系列銑床操作手冊 Date: 2015/11/13
154 pages
Da1 Scribd
No ratings yet
Da1 Scribd
7 pages
C++ CH 3
No ratings yet
C++ CH 3
7 pages
FRONTEX Best Practice Technical Guidelines For ABC
No ratings yet
FRONTEX Best Practice Technical Guidelines For ABC
63 pages
SIC Assembler Language Program Example
100% (1)
SIC Assembler Language Program Example
66 pages
Key Components of Infrastructure Planning For A Growing Business
No ratings yet
Key Components of Infrastructure Planning For A Growing Business
3 pages
Salesforce Spring 24 Release Notes
No ratings yet
Salesforce Spring 24 Release Notes
14 pages
Passive Income for Creatives
No ratings yet
Passive Income for Creatives
11 pages
Vedant Rathore Resume N.
No ratings yet
Vedant Rathore Resume N.
1 page
FortiAnalyzer
No ratings yet
FortiAnalyzer
43 pages
ICS 103: Computer Programming in C Handout-2 Topic: Overview of C. Objective
No ratings yet
ICS 103: Computer Programming in C Handout-2 Topic: Overview of C. Objective
3 pages
E Catalog Platinous PDF
No ratings yet
E Catalog Platinous PDF
8 pages
RADAN 2020.0 - Installation Document
No ratings yet
RADAN 2020.0 - Installation Document
24 pages
Assignment One Building Robots
No ratings yet
Assignment One Building Robots
3 pages
EndevorSCM Scenarios ENU
No ratings yet
EndevorSCM Scenarios ENU
260 pages
To ICT: By: Alliah Vance O. Capuz 11 Abm A-St. Sebastian
No ratings yet
To ICT: By: Alliah Vance O. Capuz 11 Abm A-St. Sebastian
16 pages
Advt Tor Applnform Sms-Itda
No ratings yet
Advt Tor Applnform Sms-Itda
8 pages
3D Printer Design & Manufacturing
No ratings yet
3D Printer Design & Manufacturing
52 pages
Focusrite IEEE 1394 Compatibility
No ratings yet
Focusrite IEEE 1394 Compatibility
5 pages
Amdgpu Help
No ratings yet
Amdgpu Help
1 page
Corsa C Cruise Control Installation Guide
100% (3)
Corsa C Cruise Control Installation Guide
29 pages

Understanding NoSQL and Its Benefits

Uploaded by

Understanding NoSQL and Its Benefits

Uploaded by

NOSQL

Aggregate Data Models

More Details on Data Models

 Getting at Persistent Data

 A (Mostly) Standard Model

 Need to Store Large Data.

 Productivity Apps – Word Processor – File System

 Multiple Users Accessing at a Time

 Majorly Modifying Data

 Transaction Handling (Deadlock)

 Transactions should be Rolled Back if Needed

Hotel Room Booking

 Applications Written by Multiple Teams

 Shared Database Integration

 Concurrency Control of Database handles Multiple

 Relational databases have succeeded because they provide

the core benefits we outlined earlier in a (mostly) standard

 Vendors Might Differ but not the Benefits

same definition, only difference is queries may change.

Though RDBMS provides many advantages still it is not

The difference between the relational model and the in-

 The relational data model organizes data into a structure

 The Solution in earl 2000’s is OOP (object oriented

 Number of Applications is a Tedious Task

 In 2000’s the Paradigm Shift is “WEB SERVICES”

 Traffic on Websites Increased

To handle this kind of increase, you have two choices: up or

 This database stores its tables as ASCII files, each tuple

 Running well on clusters

 Built for the 21st century web estates

 The most important result of the rise of NoSQL is

 Cannot nest one tuple within another to get nested

• Column family database

• Column oriented database

• Wide column store database

• Wide column store

You might also like