0% found this document useful (0 votes)

97 views19 pages

Dynamo

Dynamo is Amazon's highly available key-value storage system that provides simple operations to access uniquely identified data items. It guarantees availability and durability through techniques like consistent hashing for partitioning, vector clocks for data versioning, and sloppy quorums for writes and hinted handoffs. The system is designed to scale incrementally through symmetry and decentralization while meeting service level agreements for response times under heavy loads.

Uploaded by

b_sisco

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

97 views19 pages

Dynamo

Uploaded by

b_sisco

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

Dynamo: Amazons Highly Available Key-value Store

Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman, Alex Pilchin, Swaminathan Sivasubramanian, Peter Vosshall and Werner Vogels

Motivation
Build a distributed storage system:

Scale Simple: key-value Highly available Guarantee Service Level Agreements (SLA)

System Assumptions and Requirements

Query Model:
simple read and write operations to a data

item that is uniquely identified by a key.

ACID Properties:
Durability.

Atomicity, Consistency, Isolation,

Efficiency:

latency requirements which are in general measured at the 99.9th percentile of the distribution. operation environment is assumed to be non-hostile and there are no security related requirements such as authentication and authorization.

Other Assumptions:

Service Level Agreements (SLA)

Application can deliver its

functionality in abounded time: Every dependency in the

platform needs to deliver its functionality with even tighter bounds.

Example: service guaranteeing

that it will provide a response within 300ms for 99.9% of its requests for a peak client load of 500 requests per second.

Service-oriented architecture of Amazons platform

Design Consideration
Sacrifice strong consistency for availability

Conflict resolution is executed during read

instead of write, i.e. always writeable. Other principles:

Incremental scalability. Symmetry. Decentralization. Heterogeneity.

Summary of techniques used in Dynamo and their advantages

Problem
Partitioning High Availability for writes

Technique
Consistent Hashing Vector clocks with reconciliation during reads

Advantage
Incremental Scalability Version size is decoupled from update rates. Provides high availability and durability guarantee when some of the replicas are not available. Synchronizes divergent replicas in the background. Preserves symmetry and avoids having a centralized registry for storing membership and node liveness information.

Handling temporary failures

Sloppy Quorum and hinted handoff

Recovering from permanent failures

Anti-entropy using Merkle trees

Membership and failure detection

Gossip-based membership protocol and failure detection.

Partition Algorithm
Consistent hashing: the output
range of a hash function is treated as a fixed circular space or ring.
Virtual

Nodes: Each node can

be responsible for more than one virtual node.

Advantages of using virtual nodes

If a node becomes unavailable the

load handled by this node is evenly dispersed across the remaining available nodes. When a node becomes available again, the newly available node accepts a roughly equivalent amount of load from each of the other available nodes. The number of virtual nodes that a node is responsible can decided based on its capacity, accounting for heterogeneity in the physical infrastructure.

Replication
Each data item is

replicated at N hosts. preference list: The list of nodes that is responsible for storing a particular key.

Data Versioning
A put() call may return to its caller before the

update has been applied at all the replicas A get() call may return many versions of the same object. Challenge: an object having distinct version sub-histories,
which the system will need to reconcile in the future.

Solution:

uses vector clocks in order to capture causality between different versions of the same object.

Vector Clock
A vector clock is a list of (node, counter)

pairs. Every version of every object is associated with one vector clock. If the counters on the first objects clock are less-than-or-equal to all of the nodes in the second clock, then the first is an ancestor of the second and can be forgotten.

Vector clock example

Execution of get () and put () operations

1. Route its request through a generic load

balancer that will select a node based on load information. 2. Use a partition-aware client library that routes requests directly to the appropriate coordinator nodes.

Sloppy Quorum
R/W is the minimum number of nodes that

must participate in a successful read/write operation. Setting R + W > N yields a quorum-like system. In this model, the latency of a get (or put) operation is dictated by the slowest of the R (or W) replicas. For this reason, R and W are usually configured to be less than N, to provide better latency.

Hinted handoff
Assume N = 3. When A

is temporarily down or unreachable during a write, send replica to D. D is hinted that the replica is belong to A and it will deliver to A when A is recovered. Again: always writeable

Other techniques
Replica synchronization:

Merkle hash tree.

Membership and Failure Detection:

Gossip

Implementation
Java

Local persistence component allows for

different storage engines to be plugged in:

Berkeley Database (BDB) Transactional Data Store: object of tens of kilobytes MySQL: object of > tens of kilobytes BDB Java Edition, etc.

Evaluation

4.3 Amazon Dynamodb
No ratings yet
4.3 Amazon Dynamodb
16 pages
Amazon Dynamo: A Key-Value Store Overview
No ratings yet
Amazon Dynamo: A Key-Value Store Overview
23 pages
Dynamo: Amazon's Key-Value Store Overview
No ratings yet
Dynamo: Amazon's Key-Value Store Overview
26 pages
Dynamo: Amazon's Highly Available Key-Value Store
No ratings yet
Dynamo: Amazon's Highly Available Key-Value Store
21 pages
Dynamo: Amazon's Highly Available Key-Value Store
No ratings yet
Dynamo: Amazon's Highly Available Key-Value Store
21 pages
Replication and Consistency in Distributed Systems (Cont'd)
No ratings yet
Replication and Consistency in Distributed Systems (Cont'd)
17 pages
Distributed Systems Overview
No ratings yet
Distributed Systems Overview
48 pages
Dynamodb Part 2
No ratings yet
Dynamodb Part 2
4 pages
Key-Value Databases Explained
No ratings yet
Key-Value Databases Explained
75 pages
Elixir vs C++ for Dynamo Project Choice
No ratings yet
Elixir vs C++ for Dynamo Project Choice
50 pages
CC Unit 3
No ratings yet
CC Unit 3
19 pages
4 - Key-Value Stores
No ratings yet
4 - Key-Value Stores
47 pages
Amazon Dynamo DB - Presentation
100% (1)
Amazon Dynamo DB - Presentation
30 pages
Amazon's Dynamo: High Availability Focus
No ratings yet
Amazon's Dynamo: High Availability Focus
6 pages
REPLICATION
No ratings yet
REPLICATION
20 pages
Nosql 1
No ratings yet
Nosql 1
40 pages
DS Unit5
No ratings yet
DS Unit5
13 pages
Midterm Cheatsheet
No ratings yet
Midterm Cheatsheet
2 pages
Grokking The Advanced System Design Interview
91% (11)
Grokking The Advanced System Design Interview
397 pages
Unit 5
No ratings yet
Unit 5
29 pages
Assignment Systems2023
100% (3)
Assignment Systems2023
11 pages
Distributed Sys 6thsem
No ratings yet
Distributed Sys 6thsem
11 pages
Distributed Systems Practitioners Dimos Raptis Raspoznan
No ratings yet
Distributed Systems Practitioners Dimos Raptis Raspoznan
259 pages
Big Data Architecture Overview
No ratings yet
Big Data Architecture Overview
16 pages
DC UT1 CompsA
No ratings yet
DC UT1 CompsA
23 pages
NoSQL Data Management Techniques
No ratings yet
NoSQL Data Management Techniques
27 pages
Introduction To Distributed Systems
No ratings yet
Introduction To Distributed Systems
9 pages
Ds
No ratings yet
Ds
32 pages
Consensus
No ratings yet
Consensus
77 pages
Lecture 11A - Replication Control
No ratings yet
Lecture 11A - Replication Control
15 pages
A Case Study On Different Applications and Security Issues in Distributed Systems
No ratings yet
A Case Study On Different Applications and Security Issues in Distributed Systems
10 pages
Replication: Distributed Computing
No ratings yet
Replication: Distributed Computing
43 pages
Understanding Dynamo's Design and Use Cases
No ratings yet
Understanding Dynamo's Design and Use Cases
3 pages
Replication Module7 Selfstudy
No ratings yet
Replication Module7 Selfstudy
32 pages
Google File System Architecture Overview
No ratings yet
Google File System Architecture Overview
40 pages
ICS 408 Exam A
No ratings yet
ICS 408 Exam A
5 pages
Distributed Computing: Beakal Gizachew Assefa
No ratings yet
Distributed Computing: Beakal Gizachew Assefa
54 pages
Replication Control in Distributed Systems
No ratings yet
Replication Control in Distributed Systems
29 pages
07 Replication
No ratings yet
07 Replication
14 pages
CSC 403 Net-Centric Computing
No ratings yet
CSC 403 Net-Centric Computing
47 pages
Lect26 After
No ratings yet
Lect26 After
28 pages
NoSQL - Unit 2
No ratings yet
NoSQL - Unit 2
11 pages
Fault Tolerance Unit 3-4
No ratings yet
Fault Tolerance Unit 3-4
32 pages
DS Mod 1
No ratings yet
DS Mod 1
44 pages
Distributed Computing Overview
No ratings yet
Distributed Computing Overview
7 pages
Unit I
No ratings yet
Unit I
17 pages
NoSQL Sharding and Replication Guide
No ratings yet
NoSQL Sharding and Replication Guide
28 pages
Ch10 Replication
No ratings yet
Ch10 Replication
27 pages
SDA Presentation
No ratings yet
SDA Presentation
12 pages
Module 2
No ratings yet
Module 2
40 pages
Consistency in Distributed Systems
No ratings yet
Consistency in Distributed Systems
21 pages
DS Syllabus Introduction (Reference)
No ratings yet
DS Syllabus Introduction (Reference)
44 pages
Distributed Ledger Technology: The Science of The Blockchain
No ratings yet
Distributed Ledger Technology: The Science of The Blockchain
168 pages
Ch02 - Big Data Storage Concepts
No ratings yet
Ch02 - Big Data Storage Concepts
23 pages
System Design - ML Design 1 PDF
100% (1)
System Design - ML Design 1 PDF
24 pages
Big Data Management and Nosql Databases: Doc. Rndr. Irena Holubova, PH.D
No ratings yet
Big Data Management and Nosql Databases: Doc. Rndr. Irena Holubova, PH.D
27 pages
Dos 6
No ratings yet
Dos 6
22 pages
Sigma Delta Modulation
No ratings yet
Sigma Delta Modulation
8 pages
Example ENG
No ratings yet
Example ENG
1 page
Lekcija09 - 04 NoSQL Redis
No ratings yet
Lekcija09 - 04 NoSQL Redis
40 pages
IEEE Citation Style Guide Overview
No ratings yet
IEEE Citation Style Guide Overview
8 pages
Getting Started Labview For NXT
No ratings yet
Getting Started Labview For NXT
7 pages
LabVIEW Programming Basics
No ratings yet
LabVIEW Programming Basics
25 pages
Visual Navigation for Robots
No ratings yet
Visual Navigation for Robots
26 pages
DBMS Sample Questions Guide
No ratings yet
DBMS Sample Questions Guide
6 pages
From Zero To Hero
No ratings yet
From Zero To Hero
21 pages
MBE1323 Information Technology in TVET
No ratings yet
MBE1323 Information Technology in TVET
46 pages
Student Result System Overview
0% (1)
Student Result System Overview
6 pages
Java Notes
No ratings yet
Java Notes
4 pages
Creating Soft Partitions on Solaris 10
No ratings yet
Creating Soft Partitions on Solaris 10
7 pages
Course s0 English
No ratings yet
Course s0 English
66 pages
ECO1104, Connect, Purchase Options & Registration Instructions
No ratings yet
ECO1104, Connect, Purchase Options & Registration Instructions
11 pages
Java
No ratings yet
Java
10 pages
.NET Framework Guide for Developers
No ratings yet
.NET Framework Guide for Developers
62 pages
FANUC Series 16i 18i 160i 180i-WB Programmable Parameter Input Specifications
No ratings yet
FANUC Series 16i 18i 160i 180i-WB Programmable Parameter Input Specifications
3 pages
WANem 1.1 Setup Guide
No ratings yet
WANem 1.1 Setup Guide
12 pages
ABW2011IQM15
No ratings yet
ABW2011IQM15
21 pages
1 Introduction To Computing Technology
No ratings yet
1 Introduction To Computing Technology
46 pages
Synchronous Circuit Design & Timing
No ratings yet
Synchronous Circuit Design & Timing
21 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
2 pages
6.1 - Faults Alarms G120 CU240
No ratings yet
6.1 - Faults Alarms G120 CU240
100 pages
Mastermind Monitoring 6 22 01 PDF
No ratings yet
Mastermind Monitoring 6 22 01 PDF
209 pages
IBM Total Storage DS6000 Command-Line Interface User's Guide
No ratings yet
IBM Total Storage DS6000 Command-Line Interface User's Guide
636 pages
Miracle Advance Android Tool V1.2
100% (1)
Miracle Advance Android Tool V1.2
2 pages
Micro C Programming
No ratings yet
Micro C Programming
21 pages
Poweredge R220 Rack Server: The Dell Online Store: Build Your System
No ratings yet
Poweredge R220 Rack Server: The Dell Online Store: Build Your System
3 pages
Secret Codes For Phone
No ratings yet
Secret Codes For Phone
13 pages
Java Pattern
No ratings yet
Java Pattern
12 pages
Codename One: A Lightweight Mobile Framework
No ratings yet
Codename One: A Lightweight Mobile Framework
18 pages
Chapter 8 - Introduction To FreeRTOS
No ratings yet
Chapter 8 - Introduction To FreeRTOS
8 pages
PC Hardware Price List
No ratings yet
PC Hardware Price List
2 pages
Synopsis: Project: NGO Management System
No ratings yet
Synopsis: Project: NGO Management System
7 pages
ARM An ARMv8.1-M Performance Monitoring User Guide
No ratings yet
ARM An ARMv8.1-M Performance Monitoring User Guide
58 pages
Vulcan Introduction Tutorial
100% (6)
Vulcan Introduction Tutorial
24 pages
Akarshan Gupta: Work Experience Skills
No ratings yet
Akarshan Gupta: Work Experience Skills
1 page

Dynamo

Uploaded by

Dynamo

Uploaded by

Dynamo: Amazons Highly Available Key-value Store

System Assumptions and Requirements

item that is uniquely identified by a key.

Atomicity, Consistency, Isolation,

Service Level Agreements (SLA)

functionality in abounded time: Every dependency in the

Example: service guaranteeing

Service-oriented architecture of Amazons platform

Conflict resolution is executed during read

instead of write, i.e. always writeable. Other principles:

Incremental scalability. Symmetry. Decentralization. Heterogeneity.

Summary of techniques used in Dynamo and their advantages

Handling temporary failures

Sloppy Quorum and hinted handoff

Recovering from permanent failures

Anti-entropy using Merkle trees

Membership and failure detection

Gossip-based membership protocol and failure detection.

Nodes: Each node can

be responsible for more than one virtual node.

Advantages of using virtual nodes

Vector clock example

Execution of get () and put () operations

Merkle hash tree.

Membership and Failure Detection:

Local persistence component allows for

different storage engines to be plugged in:

You might also like