Distributed DB

A distributed database system stores data across multiple nodes that are connected via a network. There are two main types: homogeneous, where all nodes use the same database system, and heterogeneous, where nodes can use different systems. Data can be stored using replication, where copies are kept at different nodes, or fragmentation, where relations are split into smaller parts across nodes. Popular architectures include client-server, peer-to-peer, and shared-nothing. Distributed databases improve reliability, allow data sharing, and enable faster processing through parallelism.

Uploaded by

Archana Saravanan

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views

Distributed DB

Uploaded by

Archana Saravanan

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Distributed Database Systems

- MAGESH R (23MCA0091)
- MANISH KUMAR V (23MCA0114)
Definition

 A distributed database is a database that is spread across multiple locations or nodes,

typically in a network or across the internet. In a distributed database system, data is
stored on different computers or servers, and these computers are connected to each
other, allowing them to communicate and coordinate in managing the data.
 A centralized distributed database management system (DDBMS) manages the
distributed data as if it were stored in one physical location. DDBMS synchronizes all
data operations among databases and ensures that the updates in one database
automatically reflect on databases in other sites.
Types

 There are two types of distributed databases:

 Homogenous
 Heterogenous
Homogenous

 A homogeneous distributed database is a

network of databases spread across multiple
locations, where each database shares the same
database management system (DDBMS), data
model, and operating system.
 This uniformity simplifies management,
ensuring that all nodes within the distributed
system operate with identical structures and
software. It allows for seamless communication
between nodes, making tasks like data access
and updates consistent and straightforward.
Heterogenous

 A heterogeneous distributed database is a network of

databases that are spread across different locations and
operate with diverse database management systems
(DBMS), data models, or operating systems. Unlike
homogeneous distributed databases, which maintain
uniformity in software and structure across all nodes,
heterogeneous systems embrace diversity.
 Each node within the network may use distinct
technologies, making it more adaptable to specific
requirements. In the case of a heterogeneous
distributed database, a particular site can be
completely unaware of other sites causing limited
cooperation in processing user requests.
Distributed Database Storage

 Distributed database storage is managed in two ways:

 Replication
 Fragmentation
Replication

 In database replication, the systems store copies of data on

different sites. If an entire database is available on multiple
sites, it is a fully redundant database.The advantage of
database replication is that it increases data availability on
different sites and allows for parallel query requests to be
processed.
 However, database replication means that data requires
constant updates and synchronization with other sites to
maintain an exact database copy. Any changes made on one
site must be recorded on other sites, or else inconsistencies
occur.
 Constant updates cause a lot of server overhead and
complicate concurrency control, as a lot of concurrent
queries must be checked in all available sites.
Fragmentation

 When it comes to fragmentation of distributed database storage, the relations are

fragmented, which means they are split into smaller parts. Each of the fragments is stored
on a different site, where it is required.The prerequisite for fragmentation is to make sure
that the fragments can later be reconstructed into the original relation without losing data.
 The advantage of fragmentation is that there are no data copies, which prevents data
inconsistency.
 There are two types of fragmentation:
 Horizontal fragmentation
 Vertical fragmentation
 Horizontal fragmentation - The relation  Vertical fragmentation - The relation schema
schema is fragmented into groups of rows, and is fragmented into smaller schemas, and each
each group (tuple) is assigned to one fragment contains a common candidate key to
fragment. guarantee a lossless join.
Client Server Architecture

 A common method for spreading database functionality is

the client−server architecture. Clients communicate with
a central server, which controls the distributed database
system, in this design.
 The server is in charge of maintaining data storage,
controlling access, and organizing transactions. This
architecture has several clients and servers connected. A
client sends a query and the server which is available at
the earliest would help solve it. This Architecture is
simple to execute because of the centralised server
system.
Peer-Peer Architecture

 Each node in the distributed database system may

function as both a client and a server in a peer−to−peer
architecture. Each node is linked to the others and works
together to process and store data.
 Each node is in charge of managing its data management
and organizing node−to−node interactions. Because the
loss of a single node does not cause the system to
collapse, peer−to−peer systems provide decentralized
control and high fault tolerance.
 This design is ideal for distributed systems with nodes
that can function independently and with equal
capabilities.
Federated Architecture

 Multiple independent databases with various types are

combined into a single meta−database using a
federated database design. It offers a uniform interface
for navigating and exploring distributed data.
 In the federated design, each site maintains a
separate, independent database, while the virtual
database manager internally distributes requests.
When working with several data sources or legacy
systems that can't be simply updated, federated
architectures are helpful.
Shared Nothing Architecture

 Data is divided up and spread among several nodes in a

shared−nothing architecture, with each node in charge
of a particular portion of the data. Resources are not
shared across nodes, and each node runs independently.
 Due to the system's capacity to add additional nodes as
needed without affecting the current nodes, this design
offers great scalability and fault tolerance. Large−scale
distributed systems, such as data warehouses or big
data analytics platforms, frequently employ
shared−nothing designs.
Advantages
1. Reliability:
Data may be replicated in several sites so that the failure of a single site does not make
the data inaccessible.
2. Information Sharing:
Users in one site can access the data present in other sites.
3. Faster data processing:
A distributed database allows for the processing of data at several sites simultaneously.
4. Faster data access:
In a distributed system, the data is usually stored at the site where the demand for it is the
greatest. This can lead to faster access of the data and better performance.
5. Autonomy::
Each site retains some level of control over its data, unlike a central database.
6. Modularity:
New sites can be added and removed when required thus improving flexibility.
Dis Advantages

1. Complexity:
The design and management of Distributed DBMS are very complex especially the
heterogeneous DDBMS since it can use different software.
2. Increased Storage:
Data may be replicated at several sites which leads to increase storage requirements.
3. Difficulty in maintaining integrity:
Integrity refers to the consistency of data. When the data is replicated at multiple sites, all
of them need to be updated if a change is made to one.
4. Communication costs:
The need for the sites to communicate with each other adds more complexity and cost.
5. Security:
Since data is stored at multiple sites, the security risk increases.
A real life example

 Consider a company like Walmart which has branches all over the USA. Each branch stores
information about the customers, products and purchases in that branch. The schema can look
something like this
 Customers(ID, Name, Email, Address, Phone No)
Products(ID, Name, Category, Price)
Purchases(CustomerID, ProductID, Timestamp)
 Suppose the CEO wants to know the number of purchases in the whole of USA. In the manual
approach, we would have to log in to each branch and run a query to get the count of purchases and
then combine the results. This can be very time-consuming.
 But if the system is a distributed database, we can get the count of all purchases by using a single
query.

EE Lab Manuls Fast Nu
No ratings yet
EE Lab Manuls Fast Nu
69 pages
MT6797 Android Scatter
No ratings yet
MT6797 Android Scatter
11 pages
Distributed Database System
No ratings yet
Distributed Database System
4 pages
Unit 2 DDMS
No ratings yet
Unit 2 DDMS
26 pages
CH.4
No ratings yet
CH.4
16 pages
Unit - 2 (1) DBMS
No ratings yet
Unit - 2 (1) DBMS
25 pages
Unit-2_Distributed Database System
No ratings yet
Unit-2_Distributed Database System
7 pages
What Is A Distributed Database
No ratings yet
What Is A Distributed Database
8 pages
Distributed Databases Introduction
100% (1)
Distributed Databases Introduction
16 pages
Module 1
No ratings yet
Module 1
24 pages
Intro To DDBMS
No ratings yet
Intro To DDBMS
12 pages
Distributed Data Model
No ratings yet
Distributed Data Model
11 pages
ADS Chapter 7 Distributed Database
No ratings yet
ADS Chapter 7 Distributed Database
16 pages
Advanced Database Chapter 6 and 7
No ratings yet
Advanced Database Chapter 6 and 7
30 pages
MC4202 - Adavanced Database Technology
No ratings yet
MC4202 - Adavanced Database Technology
159 pages
Distributed Database Vs Conventional Database
50% (2)
Distributed Database Vs Conventional Database
4 pages
DDB-distribution Database Important.
No ratings yet
DDB-distribution Database Important.
15 pages
Distributed Databases
No ratings yet
Distributed Databases
46 pages
Distributed Databases: Indu Saini (Research Scholar) IIT Roorkee Enrollment No.: 10926003
No ratings yet
Distributed Databases: Indu Saini (Research Scholar) IIT Roorkee Enrollment No.: 10926003
14 pages
Distributed Databases
No ratings yet
Distributed Databases
39 pages
Unit-Iii Distributed Database: System
No ratings yet
Unit-Iii Distributed Database: System
55 pages
ADT Notes
No ratings yet
ADT Notes
36 pages
System Admin and Server Integration
No ratings yet
System Admin and Server Integration
3 pages
Distributed DBMS
No ratings yet
Distributed DBMS
62 pages
Distributed DB
No ratings yet
Distributed DB
43 pages
Chapter 6 Distributed System Management
No ratings yet
Chapter 6 Distributed System Management
12 pages
Unit 1
No ratings yet
Unit 1
12 pages
Assignment 01
No ratings yet
Assignment 01
6 pages
Unit 4 DBMS
No ratings yet
Unit 4 DBMS
15 pages
Distributed Database System
No ratings yet
Distributed Database System
9 pages
ADBMS Presentation
No ratings yet
ADBMS Presentation
5 pages
UNIT- 1 DDB
No ratings yet
UNIT- 1 DDB
34 pages
Distributed Database Management System
No ratings yet
Distributed Database Management System
5 pages
Distributed Database
No ratings yet
Distributed Database
12 pages
Distributed DB
No ratings yet
Distributed DB
4 pages
Distributed Database
No ratings yet
Distributed Database
9 pages
Dd Mid Answers
No ratings yet
Dd Mid Answers
29 pages
DDB.NOTES
No ratings yet
DDB.NOTES
19 pages
Advanced Data Base Management Systems
No ratings yet
Advanced Data Base Management Systems
35 pages
Distributed Databases
100% (1)
Distributed Databases
26 pages
Distributed Database: Database Database Management System Storage Devices CPU Computers Network
No ratings yet
Distributed Database: Database Database Management System Storage Devices CPU Computers Network
15 pages
Unit 5
No ratings yet
Unit 5
28 pages
Advance Concept in Data Bases Unit-3 by Arun Pratap Singh
100% (2)
Advance Concept in Data Bases Unit-3 by Arun Pratap Singh
81 pages
Distributed Database
100% (1)
Distributed Database
24 pages
Question No 1 DDBMS Advantages and Disadvantage:: Example
No ratings yet
Question No 1 DDBMS Advantages and Disadvantage:: Example
3 pages
Practical No. 1: Aim: Study About Distributed Database System. Theory
No ratings yet
Practical No. 1: Aim: Study About Distributed Database System. Theory
22 pages
DDBS Lec1
No ratings yet
DDBS Lec1
20 pages
Adt Unitnotes 1to3
No ratings yet
Adt Unitnotes 1to3
107 pages
Adt Unit I
No ratings yet
Adt Unit I
18 pages
ADBMS Exam Question Answers
No ratings yet
ADBMS Exam Question Answers
54 pages
Ddbms Notes
No ratings yet
Ddbms Notes
21 pages
Sakshi dbms2
No ratings yet
Sakshi dbms2
55 pages
Distributed Database: Database Storage Devices CPU Database Management System Computers Network
No ratings yet
Distributed Database: Database Storage Devices CPU Database Management System Computers Network
9 pages
Advance DB Notes
No ratings yet
Advance DB Notes
5 pages
Team:DBMS: by Navdeep Kaur Assistant Professor Computer Science Department
No ratings yet
Team:DBMS: by Navdeep Kaur Assistant Professor Computer Science Department
19 pages
Parallel and Distributed Databases
No ratings yet
Parallel and Distributed Databases
7 pages
advanced database individual assignment
No ratings yet
advanced database individual assignment
4 pages
JK DBMS Ii Year (48P X 62C) Unit V
No ratings yet
JK DBMS Ii Year (48P X 62C) Unit V
48 pages
Distributed DBMS - Database Environments
No ratings yet
Distributed DBMS - Database Environments
7 pages
Distributed Database: Source
No ratings yet
Distributed Database: Source
19 pages
Database And Computer Management: SERIES 1, #3
From Everand
Database And Computer Management: SERIES 1, #3
Elias Mutegi
No ratings yet
Database Management System
From Everand
Database Management System
Knowledge Flow
No ratings yet
Created by - Patel Nehal - Sonpal Ripal - Patel Komal
50% (4)
Created by - Patel Nehal - Sonpal Ripal - Patel Komal
28 pages
DX Diag
No ratings yet
DX Diag
30 pages
Kibana, Grafana and Zeppelin On Monitoring Data
No ratings yet
Kibana, Grafana and Zeppelin On Monitoring Data
21 pages
P2V Checklist: Task Checked
No ratings yet
P2V Checklist: Task Checked
1 page
VLSI Architectures For Iterative Decoders in Magnetic Recording Channels
No ratings yet
VLSI Architectures For Iterative Decoders in Magnetic Recording Channels
8 pages
AZ 700 Checklist
No ratings yet
AZ 700 Checklist
6 pages
IPsec PDF
No ratings yet
IPsec PDF
14 pages
8086 Microprocessor
No ratings yet
8086 Microprocessor
89 pages
XML Step by Step - Pradeep
No ratings yet
XML Step by Step - Pradeep
17 pages
Data Access
No ratings yet
Data Access
11 pages
Robocopy Guide CMD
No ratings yet
Robocopy Guide CMD
8 pages
Computer Architecture - Memory System
100% (1)
Computer Architecture - Memory System
22 pages
Criminal Story:: Recover OPM Deleted Files Using Prodiscover 2018-19
No ratings yet
Criminal Story:: Recover OPM Deleted Files Using Prodiscover 2018-19
10 pages
Object Oriented Programming
No ratings yet
Object Oriented Programming
18 pages
Summary CPP
No ratings yet
Summary CPP
2 pages
Azure Synapse Course Presentation
100% (1)
Azure Synapse Course Presentation
261 pages
MT6580 Android Scatter
No ratings yet
MT6580 Android Scatter
7 pages
Tibero v5.0 SP1 Administrator's Guide v2.1.1 en
No ratings yet
Tibero v5.0 SP1 Administrator's Guide v2.1.1 en
310 pages
E - B R E: Exercises - Basic Regular Expressions
No ratings yet
E - B R E: Exercises - Basic Regular Expressions
6 pages
Knoppix
No ratings yet
Knoppix
14 pages
How To Cancel/ Restart The Cost Manager:: More Create Blog Sign in
No ratings yet
How To Cancel/ Restart The Cost Manager:: More Create Blog Sign in
4 pages
AC800M Modbus Interface White Paper
No ratings yet
AC800M Modbus Interface White Paper
25 pages
Data Structure - ch2 PDF
No ratings yet
Data Structure - ch2 PDF
54 pages
Cloning A Database Using RMAN
No ratings yet
Cloning A Database Using RMAN
7 pages
Pentest-Report Fdroid PDF
No ratings yet
Pentest-Report Fdroid PDF
17 pages
Unit 1: Microprocessors and Microcontroller.: The 8051 Architecture: Introduction, Architecture of 8051
No ratings yet
Unit 1: Microprocessors and Microcontroller.: The 8051 Architecture: Introduction, Architecture of 8051
44 pages
L1 FloatingPointNumbers Intro
No ratings yet
L1 FloatingPointNumbers Intro
17 pages
A Micro Project Report ON: Under The Guidance of
No ratings yet
A Micro Project Report ON: Under The Guidance of
10 pages