Distributed DB
Distributed DB
- MAGESH R (23MCA0091)
- MANISH KUMAR V (23MCA0114)
Definition
1. Complexity:
The design and management of Distributed DBMS are very complex especially the
heterogeneous DDBMS since it can use different software.
2. Increased Storage:
Data may be replicated at several sites which leads to increase storage requirements.
3. Difficulty in maintaining integrity:
Integrity refers to the consistency of data. When the data is replicated at multiple sites, all
of them need to be updated if a change is made to one.
4. Communication costs:
The need for the sites to communicate with each other adds more complexity and cost.
5. Security:
Since data is stored at multiple sites, the security risk increases.
A real life example
Consider a company like Walmart which has branches all over the USA. Each branch stores
information about the customers, products and purchases in that branch. The schema can look
something like this
Customers(ID, Name, Email, Address, Phone No)
Products(ID, Name, Category, Price)
Purchases(CustomerID, ProductID, Timestamp)
Suppose the CEO wants to know the number of purchases in the whole of USA. In the manual
approach, we would have to log in to each branch and run a query to get the count of purchases and
then combine the results. This can be very time-consuming.
But if the system is a distributed database, we can get the count of all purchases by using a single
query.