Distributed Data Model
Distributed Data Model
Location independent
Distributed query processing
Distributed transaction management
Hardware independent
Operating system independent
Network independent
Transaction transparency
DBMS independent
Distributed Database Architecture
In a homogenous distributed database system, all the physical locations have the same
underlying hardware and run the same operating systems and database applications.
For a distributed database system to be homogenous, the data structures at each location must
be either identical or compatible. The database application used at each location must also be
either identical or compatible.
Distributed databases are capable of modular development, meaning that systems can be
expanded by adding new computers and local data to the new site and connecting them to
the distributed system without interruption.
When failures occur in centralized databases, the system comes to a complete stop. When a
component fails in distributed database systems, however, the system will continue to
function at reduced performance until the error is fixed.
Admins can achieve lower communication costs for distributed database systems if the
data is located close to where it is used the most. This is not possible in centralized
systems.
Types of distributed databases
Replication
Replicated data is used to create instances of data in different parts of the database. By using
replicated data, distributed databases can access identical data locally, thus avoiding traffic.
Replicated data can be divided into two categories: read-only and writable data.
Read-only versions of replicated data allow revisions only to the first instance; subsequent
enterprise data replications are then adjusted. Writable data can be altered, but the first
instance is immediately changed.
2. Fragmentation –
In this approach, the relations are fragmented (i.e., they’re divided into smaller parts) and each of the
fragments is stored in different sites where they’re required. It must be made sure that the fragments
are such that they can be used to reconstruct the original relation (i.e, there isn’t any loss of data).
Fragmentation is advantageous as it doesn’t create copies of data, consistency is not a problem.
Though there are many distributed databases to choose from, some examples of distributed
databases include Apache Ignite, Apache Cassandra, Apache HBase, Couchbase Server,
Amazon SimpleDB, Clusterpoint, and FoundationDB.
Apache Ignite specializes in storing and computing large volumes of data across clusters of
nodes. In 2014, Ignite was open sourced by GridGain Systems and later accepted into the
Apache Incubator program. Apache Ignite's database uses RAM as the default storage and
processing tier.
Apache Cassandra offers support for clusters that span multiple locations, and it features its
own query language, Cassandra Query Language (CQL). Additionally, Cassandra's
replication strategies are configurable.
Distributed Database Model