Reliable Scalable Data Systems Guide

This chapter introduces the topics that will be covered in the book, which are foundations of data systems, distributed data, and derived data. It discusses that most applications are data-intensive and this book will talk about design principles for reliable, scalable, and maintainable data systems. The main concerns for data systems are reliability in the face of hardware/software faults and human errors, scalability to large loads and performance needs, and maintainability through operability, simplicity and evolvability.

Uploaded by

anjaneyaprasad nidubrolu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

268 views2 pages

Reliable Scalable Data Systems Guide

Uploaded by

anjaneyaprasad nidubrolu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

Designing Data-Intensive Applications

Chapter 1: Reliable, Scalable, and Maintainable Applications

Book Organization
● This book is organized into three parts
○ Part I is Foundations of Data Systems, and introduces the topic and building
blocks
○ Part II is Distributed Data, and discusses the challenges when scaling to multiple
machines
○ Part III is Derived Data, and discusses integrating data, including batch and
stream processing

Introduction
● Most applications are data-intensive, not compute-intensive
● Data systems are such a successful abstraction that we use them all of the time without
thinking about it
● This book will talk about the principles and practicalities of data systems. What do they
have in common, what distinguishes them, and how do they achieve their characteristics
● The three main concerns for data systems are reliability, scalability, and maintainability

Reliability
● Simple definition of reliability is continuing to work correctly even when things go wrong
● Hardware Faults
○ Single machines are made more resilient through redundant hardware
○ Larger data and computing demands have driven move to multi-machine
redundancy, often using software fault-tolerance techniques
● Software Errors
○ Hard to detect, can lie dormant until unusual set of circumstances
○ Can have a systematic error or cascading failures which cause multiple system
failures, unlike hardware faults
○ No quick solution -- a set of small things each help: planning, thorough testing,
process isolation, allowing crash & restart, monitoring system behavior
● Human Errors
○ Can combine several approaches to deal with human error, including:
■ Minimize opportunities for human error
■ Decouple places where people make the most mistakes from places
where mistakes can cause failures
■ Allow quick & easy recovery from problems
■ Use detailed and clear monitoring
■ Provide good management & training

Scalability
● Even a system working well won’t necessarily be reliable with 10x users
● Planning for scalability means asking if the system grows in a particular way, what are
the options for coping with that growth
● Describing Load
○ Use metrics called load parameters, e.g. post tweet averages 4.6k requests/sec,
peak of 12k requests/sec
○ Example of Twitter’s different designs to deal with updating home timelines
● Describing Performance
○ Common concerns are response time and throughput
○ Metrics like response time are reported in percentiles, e.g. median, 95th, 99th
● Approaches for Coping with Load
○ Can scale up or scale out
○ Stateless services are easy to scale out, but scaling out stateful systems can
introduce a lot of complexity

Maintainability
● Operability: Making Life Easy for Operations
○ Good operability includes:
■ Providing visibility into the runtime behavior and internals
■ Support for automation and integration with standard tools
■ Self-healing
● Simplicity: Managing Complexity
○ Good to remove accidental complexity (not inherent in problem, just the
implementation)
○ A good abstraction hides implementation details behind easy to understand
interface
● Evolvability: Making Change Easy
○ Agile working patterns provide a framework for adapting to change
○ Evolvability is defined as agility at a large data system level
○ Simplicity and good abstractions can go a long way toward evolvability

IAU ST Lecture2
No ratings yet
IAU ST Lecture2
30 pages
System Design
No ratings yet
System Design
9 pages
Designing Data Intensive Applications
No ratings yet
Designing Data Intensive Applications
23 pages
Data Systems for Developers
No ratings yet
Data Systems for Developers
10 pages
Lecture 2 Scalable Data Systems
No ratings yet
Lecture 2 Scalable Data Systems
41 pages
Top 10 Software Architecture Traits
No ratings yet
Top 10 Software Architecture Traits
11 pages
10 - Reliable, Maintainable and Scalable
No ratings yet
10 - Reliable, Maintainable and Scalable
7 pages
February 2011 Master of Computer Application (MCA) - Semester 3 MC0071 - Software Engineering - 4 Credits (Book ID: B0808 & B0809) Assignment Set - 1 (60 Marks)
No ratings yet
February 2011 Master of Computer Application (MCA) - Semester 3 MC0071 - Software Engineering - 4 Credits (Book ID: B0808 & B0809) Assignment Set - 1 (60 Marks)
14 pages
Course Introduction: Dsecl Zc556 Stream Processing and Analytics Lecture No. 1.0
No ratings yet
Course Introduction: Dsecl Zc556 Stream Processing and Analytics Lecture No. 1.0
52 pages
An Introduction To Software Engineering
No ratings yet
An Introduction To Software Engineering
25 pages
Software Requirement Article
No ratings yet
Software Requirement Article
5 pages
UNIT3
No ratings yet
UNIT3
15 pages
Designing Data-Intensive Applications, 2nd Edition (Early - Martin Kleppmann and Chris Riccomini - 2nd, 2024 - O'Reilly Media, Inc - 9781098119058 - Anna's Archive
100% (1)
Designing Data-Intensive Applications, 2nd Edition (Early - Martin Kleppmann and Chris Riccomini - 2nd, 2024 - O'Reilly Media, Inc - 9781098119058 - Anna's Archive
244 pages
Software Architecture - Ch5 - Part 3
No ratings yet
Software Architecture - Ch5 - Part 3
25 pages
Cloud Design Patterns 1711512535
No ratings yet
Cloud Design Patterns 1711512535
3 pages
Software Project Management Note BCA Bhairahawa Multiple Campus
No ratings yet
Software Project Management Note BCA Bhairahawa Multiple Campus
126 pages
Data Design Development
No ratings yet
Data Design Development
219 pages
Dca 3103
No ratings yet
Dca 3103
11 pages
Importance of Reliable Software
No ratings yet
Importance of Reliable Software
11 pages
Software Requirements 11.04.23
No ratings yet
Software Requirements 11.04.23
94 pages
EContent 11 2023 12 02 20 03 22 Cloudcomputingch5pptx 2023 10 12 19 22 05
No ratings yet
EContent 11 2023 12 02 20 03 22 Cloudcomputingch5pptx 2023 10 12 19 22 05
42 pages
3 SEM - Software-Engineering-Notes
No ratings yet
3 SEM - Software-Engineering-Notes
95 pages
Unit 3 OOAD
No ratings yet
Unit 3 OOAD
24 pages
2nd Class SE
No ratings yet
2nd Class SE
15 pages
Devops-Unit 2
No ratings yet
Devops-Unit 2
15 pages
Software Engineering Notes:: Maintainability Reliability Scalability
No ratings yet
Software Engineering Notes:: Maintainability Reliability Scalability
26 pages
w24 Itec 4040m - Lecture 6 - NFR I v1.1
No ratings yet
w24 Itec 4040m - Lecture 6 - NFR I v1.1
43 pages
Software Quality Management Guide
No ratings yet
Software Quality Management Guide
68 pages
Chp1. Introduction SE
No ratings yet
Chp1. Introduction SE
55 pages
1SPM Note
No ratings yet
1SPM Note
126 pages
22cs602 Oose Unit III
No ratings yet
22cs602 Oose Unit III
41 pages
Software Reliability Software Failure Measures of Reliability & Availability Software Safety Quality Standards ISO 9000 CMM SQA Plan
No ratings yet
Software Reliability Software Failure Measures of Reliability & Availability Software Safety Quality Standards ISO 9000 CMM SQA Plan
21 pages
IS303 Architectural Analysis: SMU SIS Personal Notes
No ratings yet
IS303 Architectural Analysis: SMU SIS Personal Notes
86 pages
Cloud Computing for Big Data Insights
No ratings yet
Cloud Computing for Big Data Insights
97 pages
Unit 1
No ratings yet
Unit 1
61 pages
Comprehensive Architecture Requirements Guide
No ratings yet
Comprehensive Architecture Requirements Guide
10 pages
Designing Data-Intensive Applications Notes
No ratings yet
Designing Data-Intensive Applications Notes
91 pages
FDS CO2 Session 13 14
No ratings yet
FDS CO2 Session 13 14
28 pages
Introduction to Software Engineering
No ratings yet
Introduction to Software Engineering
46 pages
Object-Oriented Design Guide
No ratings yet
Object-Oriented Design Guide
67 pages
SE Chapter 1
No ratings yet
SE Chapter 1
26 pages
Unit 4 SR Notes
No ratings yet
Unit 4 SR Notes
9 pages
3103
No ratings yet
3103
4 pages
Understanding Software Quality Attributes
No ratings yet
Understanding Software Quality Attributes
68 pages
The Object Oriented Software Development-3
No ratings yet
The Object Oriented Software Development-3
4 pages
Timetable Distribution System: Department of Computer Science & Information Technology
No ratings yet
Timetable Distribution System: Department of Computer Science & Information Technology
17 pages
A Developer's Guide To Load Testing: Software Architecture For Developers
No ratings yet
A Developer's Guide To Load Testing: Software Architecture For Developers
61 pages
Requirements Engineering: Objectives
No ratings yet
Requirements Engineering: Objectives
21 pages
Book 15 May 2023
No ratings yet
Book 15 May 2023
23 pages
Data Engg Unit 2
No ratings yet
Data Engg Unit 2
68 pages
Desarrollo Basado en Modelos de Aplicaciones Intensivas de Datos Sobre Recursos de La Nube
No ratings yet
Desarrollo Basado en Modelos de Aplicaciones Intensivas de Datos Sobre Recursos de La Nube
32 pages
Software Quality: Abstract
No ratings yet
Software Quality: Abstract
3 pages
3 Sen
No ratings yet
3 Sen
79 pages
Professional Software Development Overview
No ratings yet
Professional Software Development Overview
18 pages
Software Engineering Unit-1 Notes
No ratings yet
Software Engineering Unit-1 Notes
4 pages
Software Quality
No ratings yet
Software Quality
7 pages
Understanding Reliability, Scalability, Maintainability
No ratings yet
Understanding Reliability, Scalability, Maintainability
3 pages
Lagna Lord in Various Houses
100% (1)
Lagna Lord in Various Houses
31 pages
Overview of the Wheatstone Bridge
50% (2)
Overview of the Wheatstone Bridge
9 pages
Load Sensing Steering Units TI BC152886483962en-001003 April2021
No ratings yet
Load Sensing Steering Units TI BC152886483962en-001003 April2021
90 pages
Kasoa Postcodes, Postal Codes, ZIP Codes, Kasoa PIN Code and Elevation.
No ratings yet
Kasoa Postcodes, Postal Codes, ZIP Codes, Kasoa PIN Code and Elevation.
1 page
DIN GS-C25 Material Equivalents Guide
100% (1)
DIN GS-C25 Material Equivalents Guide
2 pages
Technical Specifications Baby Warmer
No ratings yet
Technical Specifications Baby Warmer
1 page
Journal
No ratings yet
Journal
12 pages
USF-50 Series Technical Training: Glory - LTD Ver. 3.0
100% (2)
USF-50 Series Technical Training: Glory - LTD Ver. 3.0
372 pages
6440 Pah
No ratings yet
6440 Pah
6 pages
Understanding Phrenology Concepts
No ratings yet
Understanding Phrenology Concepts
7 pages
Method Statement For Pipe Culvert by Anil Kumar
0% (1)
Method Statement For Pipe Culvert by Anil Kumar
2 pages
Chemistry Test Review-Bingo Card
No ratings yet
Chemistry Test Review-Bingo Card
1 page
Group 2 PR-1
100% (1)
Group 2 PR-1
43 pages
Probability Basics for Students
No ratings yet
Probability Basics for Students
18 pages
Imperial Overstretch Thesis Guide
100% (3)
Imperial Overstretch Thesis Guide
6 pages
Creative Synthesis
No ratings yet
Creative Synthesis
2 pages
Nghiên Cứu Ẩn Dụ ý Niệm Không Gian Thời Gian Trong Tiếng Anh Chương Trình Loại 1
No ratings yet
Nghiên Cứu Ẩn Dụ ý Niệm Không Gian Thời Gian Trong Tiếng Anh Chương Trình Loại 1
16 pages
Funding Request for PPTAF Operations
No ratings yet
Funding Request for PPTAF Operations
3 pages
Surveying - II - Module2 - Error&Adjustment
No ratings yet
Surveying - II - Module2 - Error&Adjustment
15 pages
Prototype Side-Coupled Tube for e-Linac
No ratings yet
Prototype Side-Coupled Tube for e-Linac
16 pages
Digital Signal Processing Solution Mannual Chapter Bonus Oppenheim
100% (2)
Digital Signal Processing Solution Mannual Chapter Bonus Oppenheim
14 pages
Bob Heilig - Legacy Leadership - FB Groups Guide
No ratings yet
Bob Heilig - Legacy Leadership - FB Groups Guide
15 pages
7 Keeping Your Code Readable
No ratings yet
7 Keeping Your Code Readable
7 pages
Bohne 1984
100% (1)
Bohne 1984
4 pages
FLR1600
No ratings yet
FLR1600
3 pages
Thunderbolt Kids Science Comic Books Grade 5
No ratings yet
Thunderbolt Kids Science Comic Books Grade 5
176 pages
Intro:: Continuation of Lists of Honorable Guests
No ratings yet
Intro:: Continuation of Lists of Honorable Guests
3 pages
Mec Ca3 Question PDF
No ratings yet
Mec Ca3 Question PDF
2 pages
Wash Electromechanic Al Officer: The Job
No ratings yet
Wash Electromechanic Al Officer: The Job
7 pages
Radiation Protection for Students
No ratings yet
Radiation Protection for Students
8 pages

Reliable Scalable Data Systems Guide

Uploaded by

Reliable Scalable Data Systems Guide

Uploaded by

Designing Data-Intensive Applications

Chapter 1: Reliable, Scalable, and Maintainable Applications

You might also like