0% found this document useful (0 votes)

184 views19 pages

Distributed Database: Source

1) A distributed database system is a collection of databases distributed over different computers in a network. Each computer has local processing capabilities and participates in global applications that require accessing data across multiple sites. 2) Key promises of distributed database systems include transparent management of distributed and replicated data, improved reliability through data replication, and improved performance through data localization and parallel query execution. 3) Potential problems include increased complexity compared to centralized databases, higher hardware costs, and difficulties with distributed control, security, and synchronization across sites.

Uploaded by

Pro rusho

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

184 views19 pages

Distributed Database: Source

Uploaded by

Pro rusho

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

Distributed Database

Source:
1. Principles of Distributed Database Systems
By Tanner Ozsu, Patric Valdureitz
2. Slides available
Survey of advanced topics in Database Systems
 By 1998, centralized database managers (DBMSs) would be an

“antique curiosity” and most organizations would move towards

distributed database managers. Distribution was slowly starting
and “client/server” had just started.
 These systems were generally multiple client/single server

systems in which the distribution was mostly in terms of

functionality, not data. If multiple servers were used, clients
were responsible for managing the connections to these servers.
 Transparency of access was not widely supported, and each

client had to “know” the location of the required data. The

distribution of data among multiple servers was very primitive,
systems did not support fragmentation or replication of data.
 Systems of the time were “homogeneous” in that each system

could manage only data that were stored in its own database,
with no linkage to other repositories.
 Today’s client/server systems provide significant
transparency in accessing data from multiple servers, support
distributed transactions to facilitate transparency, and
execute queries over (horizontally) fragmented data.
 Further, new systems implement both synchronous and
asynchronous replication protocols, and many vendors have
introduced gateways to access other databases.
 Significant achievements have taken place in the development
and deployment of parallel database servers.
 Object database managers have entered the marketplace and
have found a niche market in some classes of applications
which are inherently distributed.
Distributed Database System
 Distributed database system (DDBS) technology is one of the
major recent developments in the database systems area.
 DDBS technology is the union of what appear to be two
diametrically opposed approaches to data processing:
database system and computer network technologies.
 One of the major motivations behind the use of database
systems is the desire to integrate the operational data of an
enterprise and to provide centralized, thus controlled access
to that data.
Fig. 1.2: Database Processing

 The technology of computer networks promotes a mode of

work that goes against all centralization efforts.
 The most important objective of the database technology is
integration, not centralization. It is possible to achieve
integration without centralization, and that is exactly what
the DDB technology attempts to achieve.
Distributed Data Processing
 Distributed computing system: It is a number of autonomous
processing elements (not necessarily homogeneous) that are
interconnected by a computer network and that cooperate in
performing their assigned tasks. The “processing element” is
a computing device that can execute a program on its own.

 What is being distributed?

- Processing logic: Processing logic or processing elements
are distributed.
- Function: Various functions of a computer system could be
delegated to various pieces of hardware or software.
- Data: Data used by a number of applications may be
distributed to a number of processing sites.
- Control: The control of the execution of various tasks might
be distributed instead of being performed by one computer
system.
 Why do we distribute at all?
- Distributed processing better corresponds to the
organizational structure of today’s widely distributed
enterprises, and that such a system is more reliable and more
responsive.
- Many of the current applications of computer technology
are inherently distributed. Electronic commerce over the
internet, multimedia applications such as news-on-demand,
manufacturing control systems are all examples of such
applications.
- The fundamental reason behind distributed processing is to
be better able to solve the big and complicated problems
simply by dividing them into smaller pieces (using a variation
of the divide-and-conquer rule) and assigning them to
different software groups.
Advantages
 Distributed computing provides an economical method of
harnessing more computing power by employing multiple
processing elements optimally.
 By attacking these problems in smaller groups working more or
less autonomously, it might be possible to discipline the cost of
software development.

Distributed Database System

 Distributed database: It is a collection of multiple, logically
interrelated databases distributed over a computer network.
 Distributed database management system (Distributed DBMS):
A distributed DBMS is defined as the software system that
permits the management of the DDBS and makes the
distribution transparent to the users.

Fig. 1.6: Central database on a network (Not a DDBS)

Fig. 1.7: DDBS environment

What is a Distributed Database System?
• A distributed database System is a collection of databases which
are distributed over different computers of a computer network.
• Each site has autonomous processing capability and can perform
local applications.
• Each site also participates in the execution of at least one global
application which requires accessing data at several sites.

Database 1
Database 3
Communication Network
Server 1
Server 3

Database 2

Server 2
Promises of DDBSs
1) Transparent management of distributed and replicated data
 Transparency refers to separation of the higher level semantics of
a system from lower level implementation issues. A transparent
system hides the implementation details from users.
 Consider an engineering firm that has offices in Boston,
Edmonton, Paris, and San Francisco. They run projects at each of
these sites and would like to maintain a database of their
employees, the projects and other related data.
- Assuming that the database is relational, we need the following
relations:
EMP(ENO, ENAME, TITLE)
PROJ(PNO, PNAME, BUDGET)
PAY(TITLE, SAL)
ASG(ENO, PNO, DUR, RESP)

Fig. 1.8: A distributed application

- For a DDBS, to localize each data such that data about the
employees in Edmonton office are stored in Edmonton, those
in the Boston office are stored in Boston, and so forth. The
same applies to the project and salary information.
 We partition each of the relations and store each partition at a
different site. This is known as fragmentation. It may be
preferable to duplicate some of this data at other sites for
performance and reliability reasons. The result is a distributed
database which is fragmented and replicated.

Fig. 1.8: A distributed application

2) Reliability through distributed transactions

 Distributed DBMS are intended to improve reliability since they
have replicated components and thereby, eliminate single
points of failure.
 The failure of a single site or the failure of a communication
link which makes one or more sites unreachable, is not
sufficient to bring down the entire system.
3) Improved Performance
1. A DDBMS fragments the conceptual database, enabling data
to be stored in close proximity to its points of use (data
localization).
• Since each site handles only a portion of the database,
contention for CPU and I/O services is not as severe as
for centralized databases
• Localization reduces remote access
2. The inherent parallelism of distributed systems may be
exploited for inter-query and intra-query parallelism.
• Inter-query parallelism results from the ability to
execute multiple queries at the same time.
• On the other hand, intra-query parallelism is achieved
by breaking up a single query into a number of
subqueries each of which is executed at a different site,
accessing a different part of the distributed database.
Potential Problems
 Complexity: More complex than centralized database
management ones.
 Cost: Distributed systems require additional hardware
(communication mechanisms etc.), thus have increased hardware
costs.
 Distribution of control: Distribution creates problems of
synchronization and coordination.
 Security: In a distributed database system, a network is involved
which is a medium that has its own security requirements. Thus,
the security problems in distributed database systems are by
nature more complicated than in centralized ones.
Problem Areas
 Distributed database design
 Distributed query processing
 Distributed directory management
 Distributed concurrency control
 Distributed deadlock management
 Reliability of distributed DBMS
 Operating system support
 Heterogeneous databases
 Relationship among problems
Distributed Database Design
 To place the database and applications across different sites,
there are two alternatives: i) partitioned (or non-replicated) and
ii) replicated.
◦ Partitioned scheme: Database is divided into a number of
disjoint partitions each of which is placed at a different site.
◦ Replicated scheme: It can be fully replicated where the entire
database is stored at each site, or partially replicated where
each partition of the database is stored at more than one site
but not at all the sites.
 Two fundamental design issues are fragmentation- the
separation of the database into partitions called fragments, and
distribution- the optimum distribution of the database.
Distributed Query Processing

 Query processing deals with designing algorithms that analyze

queries and convert them into a series of data manipulation
operations. The problem is how to decide on a strategy for
executing each query over the network in the most cost-effective
way.
 The objective is to optimize where the inherent parallelism is
used to improve the performance of executing the transaction.

Distributed Directory Management

 A directory contains information (such as descriptions and
locations) about data items in the database.
 A directory may be global to the entire DDBS or local to each site;
it can be centralized at one site or distributed over several sites.
Distributed Concurrency Control
 Concurrency control involves the synchronization of accesses to

the distributed database, such that the integrity of the database is

maintained.
 We have to worry about both the integrity of a single database,

and about the consistency of multiple copies of the database

(mutual consistency).
 Two solution classes are pessimistic and optimistic. In
pessimistic, synchronizing the execution of user requests before
the execution starts. In optimistic, executing the requests and
then checking if the execution compromised the consistency of
the database.
 Locking can be used in both cases. It is based on the mutual

exclusion of accesses to data items. Timestamping ensures the

execution of the transactions in some order.

Overview of Distributed Database Systems
No ratings yet
Overview of Distributed Database Systems
20 pages
Distributed Database Systems Guide
No ratings yet
Distributed Database Systems Guide
24 pages
Unit - 1 DDB
No ratings yet
Unit - 1 DDB
34 pages
Distributed Database
100% (1)
Distributed Database
24 pages
DDB Unit 1-5
No ratings yet
DDB Unit 1-5
190 pages
Unit - I Distributed Data Processing
100% (5)
Unit - I Distributed Data Processing
27 pages
Distributeddatabase
No ratings yet
Distributeddatabase
27 pages
Overview of Distributed Database Systems
No ratings yet
Overview of Distributed Database Systems
22 pages
Understanding Distributed Database Systems
No ratings yet
Understanding Distributed Database Systems
50 pages
Chapter-7 Distributed Database Systems
No ratings yet
Chapter-7 Distributed Database Systems
40 pages
Unit 1 - Scsa3008 - Distributed Database and Information
No ratings yet
Unit 1 - Scsa3008 - Distributed Database and Information
23 pages
ADBMS
No ratings yet
ADBMS
84 pages
Lec1 30 9 16
No ratings yet
Lec1 30 9 16
32 pages
4 Distributed Databases, NOSQL Systems, and BigData-1
No ratings yet
4 Distributed Databases, NOSQL Systems, and BigData-1
40 pages
Advanced Database Chapter 6 and 7
No ratings yet
Advanced Database Chapter 6 and 7
30 pages
Unit 1 DD
No ratings yet
Unit 1 DD
22 pages
CSC457 DOOD - Classnotes
No ratings yet
CSC457 DOOD - Classnotes
113 pages
Understanding Distributed Databases
No ratings yet
Understanding Distributed Databases
35 pages
Adv DB@Chap 4 S
No ratings yet
Adv DB@Chap 4 S
29 pages
DDBS Unit 1
No ratings yet
DDBS Unit 1
11 pages
Distribution Database
No ratings yet
Distribution Database
52 pages
Distributed Database Recovery Methods
No ratings yet
Distributed Database Recovery Methods
58 pages
Distributed Databases
No ratings yet
Distributed Databases
25 pages
Distributed Database Systems (DDBS)
No ratings yet
Distributed Database Systems (DDBS)
30 pages
NoSQL & Distributed Databases Overview
No ratings yet
NoSQL & Distributed Databases Overview
124 pages
Overview of Distributed Database Systems
No ratings yet
Overview of Distributed Database Systems
52 pages
Chapter - 6 Distributed Database System
No ratings yet
Chapter - 6 Distributed Database System
50 pages
Overview of Distributed Databases
No ratings yet
Overview of Distributed Databases
23 pages
Overview of Distributed Databases
No ratings yet
Overview of Distributed Databases
14 pages
Understanding Distributed Database Systems
No ratings yet
Understanding Distributed Database Systems
18 pages
Understanding Distributed DBMS Systems
No ratings yet
Understanding Distributed DBMS Systems
12 pages
Chapter 6 Distributed System Management
No ratings yet
Chapter 6 Distributed System Management
12 pages
Overview of Distributed Databases
No ratings yet
Overview of Distributed Databases
16 pages
Understanding Distributed Database Systems
No ratings yet
Understanding Distributed Database Systems
10 pages
Distributed Database Management Systems
No ratings yet
Distributed Database Management Systems
23 pages
Distributed Database Concepts & Design
No ratings yet
Distributed Database Concepts & Design
83 pages
Understanding Distributed Databases
No ratings yet
Understanding Distributed Databases
19 pages
Features and Management of Distributed Databases
No ratings yet
Features and Management of Distributed Databases
26 pages
Lecture3-Distributed Introduction
No ratings yet
Lecture3-Distributed Introduction
38 pages
Distributed Database Concepts Explained
No ratings yet
Distributed Database Concepts Explained
22 pages
DDS Lecture 2
0% (1)
DDS Lecture 2
38 pages
Distributed DBMS for CS Students
No ratings yet
Distributed DBMS for CS Students
67 pages
Distributed Database Systems Guide
0% (1)
Distributed Database Systems Guide
54 pages
Chapter 5 - Distributed Database Systems
No ratings yet
Chapter 5 - Distributed Database Systems
31 pages
Parallel & Distributed DBMS Guide
No ratings yet
Parallel & Distributed DBMS Guide
58 pages
Distributed Databases and Client-Server Architectures
No ratings yet
Distributed Databases and Client-Server Architectures
60 pages
Ddbms Unit 1 Part1
No ratings yet
Ddbms Unit 1 Part1
23 pages
Client-Server Architectures in DDBs
No ratings yet
Client-Server Architectures in DDBs
73 pages
Understanding Distributed Database Systems
No ratings yet
Understanding Distributed Database Systems
18 pages
Ddbms-Unit 1 Part2
No ratings yet
Ddbms-Unit 1 Part2
16 pages
Distributed Database Management System
No ratings yet
Distributed Database Management System
36 pages
DDBMS
No ratings yet
DDBMS
44 pages
Chapter - 7 Distributed Database System
100% (1)
Chapter - 7 Distributed Database System
54 pages
Overview of Distributed DBMS Concepts
No ratings yet
Overview of Distributed DBMS Concepts
62 pages
Overview of Distributed DBMS Concepts
No ratings yet
Overview of Distributed DBMS Concepts
62 pages
SQL Assignment: Employee DB Design
0% (1)
SQL Assignment: Employee DB Design
2 pages
Data Mining Concepts & Techniques
No ratings yet
Data Mining Concepts & Techniques
13 pages
Key Concepts in Relational Databases
No ratings yet
Key Concepts in Relational Databases
11 pages
Algorithm For Google Bard
No ratings yet
Algorithm For Google Bard
2 pages
INTRODUCTION TO DATA SCIENCE - VI SEMESTER - BOOK - DR - PS - 58 COPIES
No ratings yet
INTRODUCTION TO DATA SCIENCE - VI SEMESTER - BOOK - DR - PS - 58 COPIES
190 pages
Copy Services Implemantation Guide
No ratings yet
Copy Services Implemantation Guide
202 pages
PSC Computer Operator Exam Questions
100% (1)
PSC Computer Operator Exam Questions
4 pages
AI & Data Science in Math Applications
No ratings yet
AI & Data Science in Math Applications
8 pages
DBMS Qa
No ratings yet
DBMS Qa
7 pages
Dbms CO PO
No ratings yet
Dbms CO PO
1 page
Info Acess Tools
No ratings yet
Info Acess Tools
8 pages
ETL Testing Training in Bangalore
No ratings yet
ETL Testing Training in Bangalore
7 pages
Tableau for Data Visualization Enthusiasts
100% (2)
Tableau for Data Visualization Enthusiasts
43 pages
Bizhub - c220 Repair PDF
100% (1)
Bizhub - c220 Repair PDF
76 pages
AI Agent To Chat With Files in Supabase Storage
No ratings yet
AI Agent To Chat With Files in Supabase Storage
17 pages
BI Strategy
No ratings yet
BI Strategy
14 pages
Data Analytics & Visualization (Theory and Lab Syllabus)
No ratings yet
Data Analytics & Visualization (Theory and Lab Syllabus)
3 pages
Vassiliou, Rowley - Progressing The Definition of Ebook PDF
No ratings yet
Vassiliou, Rowley - Progressing The Definition of Ebook PDF
14 pages
Tableau To Power BI Migration
No ratings yet
Tableau To Power BI Migration
16 pages
Ddce Project Guidelines Mca
No ratings yet
Ddce Project Guidelines Mca
23 pages
Main Electrical Parts List Guide
No ratings yet
Main Electrical Parts List Guide
10 pages
Using A Known Error Database Kedb
No ratings yet
Using A Known Error Database Kedb
4 pages
Pros and Cons of Management Information Systems
No ratings yet
Pros and Cons of Management Information Systems
4 pages
AWS Certified AI Practitioner (AIF-C01)
No ratings yet
AWS Certified AI Practitioner (AIF-C01)
24 pages
Deleting HP Data Protector StoreOnce
No ratings yet
Deleting HP Data Protector StoreOnce
3 pages
Zoopelearn.com: Engineering Dissertation
No ratings yet
Zoopelearn.com: Engineering Dissertation
20 pages
DDL, DML, Joins, and Views in SQL
No ratings yet
DDL, DML, Joins, and Views in SQL
20 pages
Roles in Data Management Explained
No ratings yet
Roles in Data Management Explained
1 page
Emc VNX 5200 VNX5400 VNX5600580076008000
No ratings yet
Emc VNX 5200 VNX5400 VNX5600580076008000
17 pages
Hci Assignment 6
No ratings yet
Hci Assignment 6
14 pages

Distributed Database: Source

Uploaded by

Distributed Database: Source

Uploaded by

Distributed Database

“antique curiosity” and most organizations would move towards

systems in which the distribution was mostly in terms of

client had to “know” the location of the required data. The

 The technology of computer networks promotes a mode of

 What is being distributed?

Distributed Database System

Fig. 1.6: Central database on a network (Not a DDBS)

Fig. 1.7: DDBS environment

Fig. 1.8: A distributed application

Fig. 1.8: A distributed application

2) Reliability through distributed transactions

 Query processing deals with designing algorithms that analyze

Distributed Directory Management

the distributed database, such that the integrity of the database is

and about the consistency of multiple copies of the database

exclusion of accesses to data items. Timestamping ensures the

You might also like