0% found this document useful (0 votes)

13 views49 pages

No SQL

The document discusses the evolution and advantages of NoSQL databases, highlighting their ability to handle large datasets and the limitations of traditional RDBMS. It covers key concepts such as the CAP theorem, types of NoSQL databases, and specific implementations like Cassandra. The document also provides insights into data modeling, consistency models, and practical examples of using NoSQL in applications.

Uploaded by

gamedneek7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views49 pages

No SQL

Uploaded by

gamedneek7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 49

Why this topic?

< Client’s Application Roadmap

– “Reduction of cycle time for the document
intake process. Currently, it can take anywhere
from a few days to a few weeks from the time
the documents are received to when they are
available to the client.”
< New York Times used Hadoop/MapReduce to
convert pre-1980 articles that were TIFF
images to PDF.

2
Agenda

< Some history

< What is NoSQL
< CAP Theorem
< What is lost
< Types of NoSQL
< Data Model
< Frameworks
< Demo
< Wrapup

3
History of the World, Part 1

< Relational
Databases – mainstay of business
< Web-based applications caused spikes
– Especially true for public-facing e-Commerce sites
< Developers begin to front RDBMS with memcache or
integrate other caching mechanisms within the
application (ie. Ehcache)

4
Scaling Up

< Issues with scaling up when the dataset is just too

big
< RDBMS were not designed to be distributed
< Began to look at multi-node database solutions
< Known as ‘scaling out’ or ‘horizontal scaling’
< Different approaches include:
– Master-slave
– Sharding

5
Scaling RDBMS – Master/Slave

< Master-Slave
– All writes are written to the master. All reads
performed against the replicated slave databases
– Critical reads may be incorrect as writes may not
have been propagated down
– Large data sets can pose problems as master needs
to duplicate data to slaves

6
Scaling RDBMS - Sharding

< Partition or sharding

– Scales well for both reads and writes
– Not transparent, application needs to be partition-
aware
– Can no longer have relationships/joins across
partitions
– Loss of referential integrity across shards

7
Other ways to scale RDBMS

< Multi-Masterreplication
< INSERT only, not UPDATES/DELETES
< No JOINs, thereby reducing query time
– This involves de-normalizing data
< In-memory databases

8
What is NoSQL?

< Stands for Not Only SQL

< Class of non-relational data storage systems
< Usually do not require a fixed table schema nor do
they use the concept of joins
< All NoSQL offerings relax one or more of the ACID
properties (will talk about the CAP theorem)

9
Why NoSQL?

< For data storage, an RDBMS cannot be the be-

all/end-all
< Just as there are different programming languages,
need to have other data storage tools in the toolbox
< A NoSQL solution is more acceptable to a client now
than even a year ago
– Think about proposing a Ruby/Rails or Groovy/Grails
solution now versus a couple of years ago

10
How did we get here?

< Explosion of social media sites (Facebook,

Twitter) with large data needs
< Rise of cloud-based solutions such as Amazon
S3 (simple storage solution)
< Just as moving to dynamically-typed
languages (Ruby/Groovy), a shift to
dynamically-typed data with frequent schema
changes
< Open-source community

11
Dynamo and BigTable

< Three
major papers were the seeds of the NoSQL
movement
– BigTable (Google)
– Dynamo (Amazon)
• Gossip protocol (discovery and error detection)
• Distributed key-value data store
• Eventual consistency
– CAP Theorem (discuss in a sec ..)

12
The Perfect Storm

< Large datasets, acceptance of alternatives, and

dynamically-typed data has come together in a
perfect storm
< Not a backlash/rebellion against RDBMS
< SQL is a rich query language that cannot be rivaled
by the current list of NoSQL offerings

13
CAP Theorem

< Three properties of a system: consistency,

availability and partitions
< You can have at most two of these three properties
for any shared-data system
< To scale out, you have to partition. That leaves
either consistency or availability to choose from
– In almost all cases, you would choose availability over
consistency

14
Availability

< Traditionally, thought of as the server/process

available five 9’s (99.999 %).
< However, for large node system, at almost any point
in time there’s a good chance that a node is either
down or there is a network disruption among the
nodes.
– Want a system that is resilient in the face of network
disruption

15
Consistency Model

<A consistency model determines rules for visibility

and apparent order of updates.
< For example:
– Row X is replicated on nodes M and N
– Client A writes row X to node N
– Some period of time t elapses.
– Client B reads row X from node M
– Does client B see the write from client A?
– Consistency is a continuum with tradeoffs
– For NoSQL, the answer would be: maybe
– CAP Theorem states: Strict Consistency can't be
achieved at the same time as availability and partition-
tolerance.
16
Eventual Consistency

< When no updates occur for a long period of time,

eventually all updates will propagate through the
system and all the nodes will be consistent
< For a given accepted update and a given node,
eventually either the update reaches the node or the
node is removed from service
< Known as BASE (Basically Available, Soft state,
Eventual consistency), as opposed to ACID

17
What kinds of NoSQL

< NoSQL solutions fall into two major areas:

– Key/Value or ‘the big hash table’.
• Amazon S3 (Dynamo)
• Voldemort
• Scalaris
– Schema-less which comes in multiple flavors,
column-based, document-based or graph-based.
• Cassandra (column-based)
• CouchDB (document-based)
• Neo4J (graph-based)
• HBase (column-based)

18
Key/Value

Pros:
– very fast
– very scalable
– simple model
– able to distribute horizontally

Cons:
- many data structures (objects) can't be easily
modeled as key value pairs

19
Schema-Less

Pros:
- Schema-less data model is richer than key/value pairs
- eventual consistency
- many are distributed
- still provide excellent performance and scalability

Cons:
- typically no ACID transactions or joins

20
Common Advantages

< Cheap, easy to implement (open source)

< Data are replicated to multiple nodes (therefore identical
and fault-tolerant) and can be partitioned
– Down nodes easily replaced
– No single point of failure
< Easy to distribute
< Don't require a schema
< Can scale up and down
< Relax the data consistency requirement (CAP)

21
What am I giving up?

< joins
< group by
< order by
< ACID transactions
< SQL as a sometimes frustrating but still powerful
query language
< easy integration with other applications that support
SQL

22
Cassandra

< Originallydeveloped at Facebook

< Follows the BigTable data model: column-oriented
< Uses the Dynamo Eventual Consistency model
< Written in Java
< Open-sourced and exists within the Apache family
< Uses Apache Thrift as it’s API

23
Thrift

< Created at Facebook along with Cassandra

< Is a cross-language, service-generation framework
< Binary Protocol (like Google Protocol Buffers)
< Compiles to: C++, Java, PHP, Ruby, Erlang, Perl, ...

24
Searching

< Relational
– SELECT `column` FROM `database`,`table` WHERE
`id` = key;
– SELECT product_name FROM rockets WHERE id =
123;
< Cassandra (standard)
– keyspace.getSlice(key, “column_family”, "column")
– keyspace.getSlice(123, new ColumnParent(“rockets”),
getSlicePredicate());

25
Typical NoSQL API

< Basic API access:

– get(key) -- Extract the value given a key
– put(key, value) -- Create or update the value given its
key
– delete(key) -- Remove the key and its associated
value
– execute(key, operation, parameters) -- Invoke an
operation to the value (given its key) which is a
special data structure (e.g. List, Set, Map .... etc).

26
Data Model

< Within Cassandra, you will refer to data this

way:
– Column: smallest data element, a tuple with
a name and a value
:Rockets, '1' might return:
{'name' => ‘Rocket-Powered Roller Skates',
‘toon' => ‘Ready Set Zoom',
‘inventoryQty' => ‘5‘,
‘productUrl’ => ‘rockets\1.gif’}

27
Data Model Continued

– ColumnFamily: There’s a single structure used to group

both the Columns and SuperColumns. Called a
ColumnFamily (think table), it has two types, Standard &
Super.
• Column families must be defined at startup

– Key: the permanent name of the record

– Keyspace: the outer-most level of organization. This
is usually the name of the application. For example,
‘Acme' (think database name).

28
Cassandra and Consistency

< Talked previous about eventual consistency

< Cassandra has programmable read/writable
consistency
– One: Return from the first node that responds
– Quorom: Query from all nodes and respond with the
one that has latest timestamp once a majority of
nodes responded
– All: Query from all nodes and respond with the one
that has latest timestamp once all nodes responded.
An unresponsive node will fail the node

29
Cassandra and Consistency

– Zero: Ensure nothing. Asynchronous write done in

background
– Any: Ensure that the write is written to at least 1
node
– One: Ensure that the write is written to at least 1
node’s commit log and memory table before receipt to
client
– Quorom: Ensure that the write goes to node/2 + 1
– All: Ensure that writes go to all nodes. An
unresponsive node would fail the write

30
Consistent Hashing

< Partition using consistent hashing

– Keys hash to a point on a
fixed circular space
– Ring is partitioned into a set of
ordered slots and servers and
keys hashed over these slots
< Nodes take positions on the circle.
< A, B, and D exists.
– B responsible for AB range.
– D responsible for BD range.
– A responsible for DA range.
< C joins.
– B, D split ranges.
– C gets BC from D.

31
Domain Model

< Design your domain model first

< Create your Cassandra data store to fit your domain
model

32
Data Model

ColumnFamily: Rockets
Key Value

1 Name Value

name Rocket-Powered Roller Skates

toon Ready, Set, Zoom
inventoryQty 5
brakes false

2 Name Value

name Little Giant Do-It-Yourself Rocket-Sled Kit

toon Beep Prepared
inventoryQty 4
brakes false

3 Name Value

name Acme Jet Propelled Unicycle

toon Hot Rod and Reel
inventoryQty 1
wheels 1

33
Data Model Continued

– Optional super column: a named list. A super

column contains standard columns, stored in recent
order
• Say the OtherProducts has inventory in categories. Querying
(:OtherProducts, '174927') might return:
{‘OtherProducts' => {'name' => ‘Acme Instant Girl', ..},
‘foods': {...}, ‘martian': {...}, ‘animals': {...}}
• In the example, foods, martian, and animals are all super
column names. They are defined on the fly, and there can be
any number of them per row. :OtherProducts would be the
name of the super column family.
– Columns and SuperColumns are both tuples with a
name & value. The key difference is that a standard
Column’s value is a “string” and in a SuperColumn the
value is a Map of Columns.

34
Data Model Continued

< Columns are always sorted by their name. Sorting

supports:
– BytesType
– UTF8Type
– LexicalUUIDType
– TimeUUIDType
– AsciiType
– LongType
< Each of these options treats the Columns' name as a
different data type

35
Hector

< Leading Java API for Cassandra

< Sits on top of Thrift
< Adds following capabilities
– Load balancing
– JMX monitoring
– Connection-pooling
– Failover
– JNDI integration with application servers
– Additional methods on top of the standard get, update,
delete methods.
< Under discussion
– hooks into Spring declarative transactions

36
Hector and JMX

37
Code Examples: Tomcat Configuration

Tomcat context.xml

J2EE web.xml

<resource-env-ref>
<description>Object factory for Cassandra clients.</description>
<resource-env-ref-name>cassandra/CassandraClientFactory</resource-
env-ref-name>
<resource-env-ref-
type>org.apache.naming.factory.BeanFactory</resource-env-ref-type>
</resource-env-ref>

38
Code Examples: Spring Configuration

Spring applicationContext.xml

<bean id="cassandraHostConfigurator“
class="org.springframework.jndi.JndiObjectFactoryBean">
<property name="jndiName">
<value>cassandra/CassandraClientFactory</value></property>
<property name="resourceRef"><value>true</value></property>
</bean>

39
Code Examples: Cassandra Get Operation

try {
cassandraClient = cassandraClientPool.borrowClient();

// keyspace is Acme
Keyspace keyspace = cassandraClient.getKeyspace(getKeyspace());

// inventoryType is Rockets
List<Column> result = keyspace.getSlice(Long.toString(inventoryId), new
ColumnParent(inventoryType), getSlicePredicate());

inventoryItem.setInventoryItemId(inventoryId);
inventoryItem.setInventoryType(inventoryType);

loadInventory(inventoryItem, result);
} catch (Exception exception) {
logger.error("An Exception occurred retrieving an inventory item", exception);
} finally {
try {
cassandraClientPool.releaseClient(cassandraClient);
} catch (Exception exception) {
logger.warn("An Exception occurred returning a Cassandra client to the pool", exception);
}
}

40
Code Examples: Cassandra Update Operation

try {
cassandraClient = cassandraClientPool.borrowClient();

Map<String, List<ColumnOrSuperColumn>> data = new HashMap<String,

List<ColumnOrSuperColumn>>();
List<ColumnOrSuperColumn> columns = new ArrayList<ColumnOrSuperColumn>();

// Create the inventoryId column.

ColumnOrSuperColumn column = new ColumnOrSuperColumn();
columns.add(column.setColumn(new Column("inventoryItemId".getBytes("utf-8"),
Long.toString(inventoryItem.getInventoryItemId()).getBytes("utf-8"), timestamp)));

column = new ColumnOrSuperColumn();

columns.add(column.setColumn(new Column("inventoryType".getBytes("utf-8"),
inventoryItem.getInventoryType().getBytes("utf-8"), timestamp)));
….
data.put(inventoryItem.getInventoryType(), columns);
cassandraClient.getCassandra().batch_insert(getKeyspace(),
Long.toString(inventoryItem.getInventoryItemId()), data, ConsistencyLevel.ANY);
} catch (Exception exception) {
…
}

41
Some Statistics

< FacebookSearch
< MySQL > 50 GB Data
– Writes Average : ~300 ms
– Reads Average : ~350 ms
< Rewritten with Cassandra > 50 GB Data
– Writes Average : 0.12 ms
– Reads Average : 15 ms

42
Some things to think about

< Ruby on Rails and Grails have ORM baked in. Would
have to build your own ORM framework to work with
NoSQL.
– Some plugins exist.
< Same would go for Java/C#, no Hibernate-like
framework.
– A simple JDO framework does exist.
< Support for basic languages like Ruby.

43
Some more things to think about

< Troubleshooting performance problems

< Concurrency on non-key accesses
< Are the replicas working?
< No TOAD for Cassandra
– though some NoSQL offerings have GUI tools
– have SQLPlus-like capabilities using Ruby IRB
interpreter.

44
Don’t forget about the DBA

< Itdoes not matter if the data is deployed on a

NoSQL platform instead of an RDBMS.
< Still need to address:
– Backups & recovery
– Capacity planning
– Performance monitoring
– Data integration
– Tuning & optimization
< What happens when things don’t work as
expected and nodes are out of sync or you
have a data corruption occurring at 2am?
< Who you gonna call?
– DBA and SysAdmin need to be on board
45
Where would I use it?

< For most of us, we work in corporate IT and a

LinkedIn or Twitter is not in our future
< Where would I use a NoSQL database?
< Do you have somewhere a large set of uncontrolled,
unstructured, data that you are trying to fit into a
RDBMS?
– Log Analysis
– Social Networking Feeds (many firms hooked in
through Facebook or Twitter)
– External feeds from partners (EAI)
– Data that is not easily analyzed in a RDBMS such as
time-based data
– Large data feeds that need to be massaged before
entry into an RDBMS

46
Summary

< Leading users of NoSQL datastores are social

networking sites such as Twitter, Facebook,
LinkedIn, and Digg.
< To implement a single feature in Cassandra, Digg
has a dataset that is 3 terabytes and 76 billion
columns.
< Not every problem is a nail and not every solution is
a hammer.

47
Questions

48
Resources

< Cassandra
– http://cassandra.apache.org
< Hector
– http://wiki.github.com/rantav/hector
– http://prettyprint.me
< NoSQL News websites
– http://nosql.mypopescu.com
– http://www.nosqldatabases.com
< High Scalability
– http://highscalability.com
< Video
– http://www.infoq.com/presentations/Project-
Voldemort-at-Gilt-Groupe

BigData NoSQL
No ratings yet
BigData NoSQL
30 pages
Introduction to NoSQL Databases
No ratings yet
Introduction to NoSQL Databases
43 pages
No SQL
No ratings yet
No SQL
109 pages
Module 1
No ratings yet
Module 1
69 pages
Understanding NoSQL Databases and CAP Theorem
No ratings yet
Understanding NoSQL Databases and CAP Theorem
23 pages
CIS - 468 - 04 - NOSQL Databases and Big Data Storage Systems
No ratings yet
CIS - 468 - 04 - NOSQL Databases and Big Data Storage Systems
102 pages
Riak CS Latency in NoSQL Systems
No ratings yet
Riak CS Latency in NoSQL Systems
22 pages
BDS Session 5 - NoSQL DB
No ratings yet
BDS Session 5 - NoSQL DB
51 pages
NoSQL Databases: Features and Limitations
No ratings yet
NoSQL Databases: Features and Limitations
13 pages
NoSQL D
No ratings yet
NoSQL D
26 pages
NoSQL for Tech Professionals
No ratings yet
NoSQL for Tech Professionals
29 pages
NoSQL Databases
No ratings yet
NoSQL Databases
52 pages
4.NoSQL 1
No ratings yet
4.NoSQL 1
69 pages
BDS Session 10
No ratings yet
BDS Session 10
70 pages
Lec 24
No ratings yet
Lec 24
16 pages
Intro No SQL
No ratings yet
Intro No SQL
44 pages
Lecture 6 - NoSQL
No ratings yet
Lecture 6 - NoSQL
28 pages
NoSQL vs. Cloud Data Storage Systems
No ratings yet
NoSQL vs. Cloud Data Storage Systems
17 pages
Integrating NoSQL with Ruby on Rails
No ratings yet
Integrating NoSQL with Ruby on Rails
15 pages
Seminar Topic Nosql
No ratings yet
Seminar Topic Nosql
73 pages
NoSQL for Tech Professionals
No ratings yet
NoSQL for Tech Professionals
40 pages
NoSQL for Tech Professionals
No ratings yet
NoSQL for Tech Professionals
30 pages
NoSQL Database Technologies Overview
No ratings yet
NoSQL Database Technologies Overview
44 pages
NoSQL Data Management Overview Guide
No ratings yet
NoSQL Data Management Overview Guide
62 pages
BIG - DATA - Unit 4
No ratings yet
BIG - DATA - Unit 4
99 pages
Overview of NoSQL Databases and Concepts
No ratings yet
Overview of NoSQL Databases and Concepts
26 pages
2.1 Nosql
No ratings yet
2.1 Nosql
25 pages
SQL Server Key-Value Store Insights
No ratings yet
SQL Server Key-Value Store Insights
109 pages
NoSQL
No ratings yet
NoSQL
18 pages
Riak CS Latency in NoSQL Systems
No ratings yet
Riak CS Latency in NoSQL Systems
49 pages
Unit VI - 1
No ratings yet
Unit VI - 1
31 pages
NoSQL Database
No ratings yet
NoSQL Database
64 pages
Module 3
No ratings yet
Module 3
37 pages
Visual Guide To NoSQL Systems - Nathan Hurst's Blog
No ratings yet
Visual Guide To NoSQL Systems - Nathan Hurst's Blog
10 pages
Nosql
No ratings yet
Nosql
64 pages
Big Data Analytics Unit-2
No ratings yet
Big Data Analytics Unit-2
30 pages
NoSQL Databases Explained
No ratings yet
NoSQL Databases Explained
13 pages
Understanding NoSQL Databases
No ratings yet
Understanding NoSQL Databases
31 pages
Nosql KK
No ratings yet
Nosql KK
23 pages
Big Data Analysis
No ratings yet
Big Data Analysis
9 pages
Unit 4: Big Data Tehnology Landscape Two Inportant Technologies
No ratings yet
Unit 4: Big Data Tehnology Landscape Two Inportant Technologies
42 pages
NGD Unit 1-4
No ratings yet
NGD Unit 1-4
43 pages
Understanding NoSQL Databases
No ratings yet
Understanding NoSQL Databases
15 pages
NoSQL for Data Engineers
No ratings yet
NoSQL for Data Engineers
144 pages
9 TH
No ratings yet
9 TH
33 pages
Bcse302l Dbms Module-7 Nosql
No ratings yet
Bcse302l Dbms Module-7 Nosql
30 pages
NoSQL & MongoDB Overview
No ratings yet
NoSQL & MongoDB Overview
47 pages
Overview of NoSQL Database Systems
No ratings yet
Overview of NoSQL Database Systems
9 pages
Lecture 1
No ratings yet
Lecture 1
31 pages
NoSQL Databases: Types, Features, and CAP Theorem
No ratings yet
NoSQL Databases: Types, Features, and CAP Theorem
112 pages
Bda Mod 3
No ratings yet
Bda Mod 3
70 pages
21bcs9882 Yuvraj Dbms 9
No ratings yet
21bcs9882 Yuvraj Dbms 9
6 pages
NoSQL, Cloud Computing, and IOT
No ratings yet
NoSQL, Cloud Computing, and IOT
3 pages
Module 5 - NoSQL Databases
No ratings yet
Module 5 - NoSQL Databases
33 pages
41 NoSQL Introduction
No ratings yet
41 NoSQL Introduction
18 pages
Lecture 1 - NoSQL
No ratings yet
Lecture 1 - NoSQL
31 pages
Unitw 12 W 2
No ratings yet
Unitw 12 W 2
18 pages
R Programming: Flat & Cross Tables
No ratings yet
R Programming: Flat & Cross Tables
3 pages
Practical Document For Information Technology Class 10
No ratings yet
Practical Document For Information Technology Class 10
4 pages
Mainframe VSAM Essentials
No ratings yet
Mainframe VSAM Essentials
94 pages
Spool Respcal - SQL: 11 1 Rlwrap Sqlplus System/admonbd2 As Sysdba @respcal - SQL
No ratings yet
Spool Respcal - SQL: 11 1 Rlwrap Sqlplus System/admonbd2 As Sysdba @respcal - SQL
11 pages
Operate Database Application
No ratings yet
Operate Database Application
14 pages
Map - Ethiopia
100% (10)
Map - Ethiopia
1 page
DBMS Assignment 3
No ratings yet
DBMS Assignment 3
9 pages
M.tech - Data Analytics
No ratings yet
M.tech - Data Analytics
3 pages
Quiz (Instana L2) - Attempt Review
No ratings yet
Quiz (Instana L2) - Attempt Review
21 pages
Wonderware Historian Product Overview
No ratings yet
Wonderware Historian Product Overview
8 pages
Architect
100% (1)
Architect
11 pages
Arduino Entegreli Yapay Zeka Destekli Hijack Furby Asistan
No ratings yet
Arduino Entegreli Yapay Zeka Destekli Hijack Furby Asistan
6 pages
Distributed SQL Mariadb Xpand Architecture - Whitepaper - 1106
No ratings yet
Distributed SQL Mariadb Xpand Architecture - Whitepaper - 1106
19 pages
Crop 4679 Stics of The Agricultural Environment Using Various Feature Selection Techniques and Classifiers
No ratings yet
Crop 4679 Stics of The Agricultural Environment Using Various Feature Selection Techniques and Classifiers
6 pages
Data Mining in Business Analytics
No ratings yet
Data Mining in Business Analytics
11 pages
Upload A Document To Access Your Download: Samkim Who Are You
No ratings yet
Upload A Document To Access Your Download: Samkim Who Are You
3 pages
An Agent Framework For Real-Time Financial Information Searching With Large Language Models
No ratings yet
An Agent Framework For Real-Time Financial Information Searching With Large Language Models
7 pages
M3 Concept Map Assignment & Instructions
No ratings yet
M3 Concept Map Assignment & Instructions
2 pages
Database Administrator Roles & Functions
No ratings yet
Database Administrator Roles & Functions
9 pages
Data, Info, and Knowledge Basics
No ratings yet
Data, Info, and Knowledge Basics
6 pages
Quiz For Unit 1 - Vocabulary
0% (2)
Quiz For Unit 1 - Vocabulary
5 pages
Unit - 3 Study Material
No ratings yet
Unit - 3 Study Material
98 pages
VPLEX - VPLEX Customer Procedures-Manage
No ratings yet
VPLEX - VPLEX Customer Procedures-Manage
16 pages
Bro Liberty Chapter One by (Xero)
No ratings yet
Bro Liberty Chapter One by (Xero)
4 pages
Final Ems Project Proposal
No ratings yet
Final Ems Project Proposal
14 pages
Crack Data Analyst Interviews Complete Syllabus
No ratings yet
Crack Data Analyst Interviews Complete Syllabus
10 pages
E-Commerce Product Recommendation System
No ratings yet
E-Commerce Product Recommendation System
14 pages
Intrusion Detection System (IDS)
No ratings yet
Intrusion Detection System (IDS)
15 pages
Tracking Cookies and Especially Third-Party Tracking Cookies Are Commonly Used As
100% (1)
Tracking Cookies and Especially Third-Party Tracking Cookies Are Commonly Used As
9 pages
DBMS 01
No ratings yet
DBMS 01
10 pages

No SQL

Uploaded by

No SQL

Uploaded by

Why this topic?

< Client’s Application Roadmap

< Some history

< Issues with scaling up when the dataset is just too

< Partition or sharding

< Stands for Not Only SQL

< For data storage, an RDBMS cannot be the be-

< Explosion of social media sites (Facebook,

< Large datasets, acceptance of alternatives, and

< Three properties of a system: consistency,

< Traditionally, thought of as the server/process

<A consistency model determines rules for visibility

< When no updates occur for a long period of time,

< NoSQL solutions fall into two major areas:

< Cheap, easy to implement (open source)

< Originallydeveloped at Facebook

< Created at Facebook along with Cassandra

< Basic API access:

< Within Cassandra, you will refer to data this

– ColumnFamily: There’s a single structure used to group

– Key: the permanent name of the record

< Talked previous about eventual consistency

– Zero: Ensure nothing. Asynchronous write done in

< Partition using consistent hashing

< Design your domain model first

name Rocket-Powered Roller Skates

name Little Giant Do-It-Yourself Rocket-Sled Kit

name Acme Jet Propelled Unicycle

– Optional super column: a named list. A super

< Columns are always sorted by their name. Sorting

< Leading Java API for Cassandra

Map<String, List<ColumnOrSuperColumn>> data = new HashMap<String,

// Create the inventoryId column.

column = new ColumnOrSuperColumn();

< Troubleshooting performance problems

< Itdoes not matter if the data is deployed on a

< For most of us, we work in corporate IT and a

< Leading users of NoSQL datastores are social

You might also like