Distributed Databases: Indu Saini (Research Scholar) IIT Roorkee Enrollment No.: 10926003
Distributed Databases: Indu Saini (Research Scholar) IIT Roorkee Enrollment No.: 10926003
Submitted by
Indu Saini
(Research Scholar)
IIT Roorkee
Contents
1. Abstract…………………………………………………………..…………..2
2. Introduction to Distributed Databases……………………………………….2
3. Types of Distributed Databases……………………………………………...4
4. Advantages of Distributed Databases………………………………………..5
5. Disadvantages or Challenges of Distributed Databases……………………..7
6. Distributed Database Design Techniques……………………………………9
7. Conclusion…………………………………………………………………...6
8. References……………………………………………………………………7
Distributed Databases |2
1. ABSTRACT
The idea is that instead of having one, centralized database, we are going to
spread the data out among the cities on the distributed network, each of which
has its own computer and data storage facilities. All of this distributed data is
still considered to be a single logical database. When a person or process
anywhere on the distributed network queries the database, it is not necessary to
know where on the network the data being sought is located. The user just
issues the query, and the result is returned. This feature is known as location
Distributed Databases |3
transparency. This can become rather complex very quickly, and it must be
managed by sophisticated software known as a distributed database
management system or distributed DBMS.
In a heterogeneous system, sites may run different DBMS products, which need
not be based on the same underlying data model, and so the system may be
composed of relational, network, hierarchical, and object oriented DBMSs.
Homogeneous systems are much easier to design and manage. This approach
provides incremental growth, making the addition of a new site to the DDBMS
easy, and allows increased performance by exploiting the parallel processing
capability of multiple sites.
new sites can be added to the network without affecting the operations of
other sites.
Local Autonomy: A department can control the data about them. Means
that:
Local data is locally owned and managed;
Local operations remain purely local;
All operations at a given site are controlled by that site.
5. DISADVANTAGES OR CHALLENGES OF DISTRIBUTED
DATABASES
6.1. FRAGMENTATION
The main reasons of fragmentation of the relations are to: increase locality of
reference of the queries submitted to database, improve reliability and
availability of data and performance of the system, balance storage capacities
and minimize communication costs among sites.
Replication and duplication are two processes to that the distributive databases
are up to date and current.
Replication involves using specialized software that looks for changes in the
distributive database. Once the changes have been identified, the replication
process makes all the databases look the same. The replication process can be
very complex and time consuming depending on the size and number of the
distributive databases. This process can also require a lot of time and computer
resources.
The term replication refers to the operation of copying and maintaining database
objects in multiple databases belonging to a distributed system. While
replication relies on distributed database technology, database replication offers
D i s t r i b u t e d D a t a b a s e s | 12
applications benefits that are not possible within a pure distributed database
environment. Replication uses distributed database technology to share data
between multiple sites, but a replicated database and a distributed database are
not the same. In a distributed database, data is available at many locations, but a
particular table resides at only one location. Replication means that the same
data is available at multiple locations.
Replication provides fast, local access to shared data because it balances activity
over multiple sites. Some users can access one server while other users access
other servers, thereby reducing the load at all servers. Also, users can access
data from the replication site that has the lowest access cost, which is typically
the site that is geographically closest to them. Replication can be used to
distribute data over multiple regional locations. Then, applications can access
various regional servers instead of accessing one central server. This
configuration can reduce network load dramatically.
7. CONCLUSION
8. REFERENCES