Chapter 7
Chapter 7
Distributed Database-
Concepts and Design
1
Distributed Databases
• A distributed database is a collection of multiple
interconnected databases, which are spread
physically across various locations that
communicate via a computer network.
• It is single logical database physically divided
among networked computers
• Distributed DBMS: is a centralized software
system that supports and manipulates distributed
databases.
• It ensures that the data modified at any site is
universally updated. 2
Concept…
• Users access the distributed database via
applications, which are classified as those that
do not require data from other sites (local
applications) and
• Those that do require data from other sites
(global applications). We require a DDBMS to
have at least one global application.
Distributed Processing and Distributed Database
Bahrdar
Mekelle
Adama
8 of 24
Distributed Database System (DDBS) Concept
Database
Database
Bardar
Mekelle
Database
Adama
• There must be different databases as opposed to a central database
9 of 24
Features of Distributed database
• Databases in the collection are logically interrelated with
each other.
• Data is stored at a number of sites
• Sites are interconnected by a network
• DDB is logically a single database
• DDBMS has full functionality of DBMS.
• A distributed database incorporates transaction
processing, but it is not synonymous with a transaction
processing system.
Advantages of DDBMSs
• Organizational Structure
– Many organizations are naturally distributed
over several locations.
• Shareability and Local Autonomy
– The users at one site can access data stored at
other sites. Data can be placed at the site close
to the users who normally use that data. In this
way, users have local control of the data and
they can consequently establish and enforce
local policies regarding the use of this data.
– A global database administrator (DBA) is
responsible for the entire system.
Advantages of DDBMSs
• Improved Availability
– In a centralized DBMS, a computer failure
terminates the operations of the DBMS. However,
a failure at one site of a DDBMS, or a failure of a
communication link making some sites
inaccessible, does not make the entire system
inoperable. Distributed DBMSs are designed to
continue to function despite such failures. If a
single node fails, the system may be able to
reroute the failed node’s requests to another site.
Advantages of DDBMSs
• Improved Reliability
– As data may be replicated so that it exists at more
than one site, the failure of a node or a
communication link does not necessarily make the
data inaccessible.
• Improved Performance
– Since each site handles only a part of the entire
database, there may not be the same contention
for CPU and I/O services as characterized by a
centralized DBMS.
Advantages of DDBMSs
• Economics
– cost saving occurs where databases are
geographically remote and the applications require
access to distributed data. In such cases, owing to
the relative expense of data being transmitted
across the network as opposed to the cost of local
access, it may be much more economical to
partition the application and perform the
processing locally at each site.
– It is also much more cost-effective to add
workstations to a network than to update a
mainframe system.
Advantages of DDBMSs
• Modular Growth
– In a distributed environment, it is much easier to
handle expansion. New sites can be added to the
network without affecting the operations of other
sites. This flexibility allows an organization to
expand relatively easily. Increasing database size
can usually be handled by adding processing and
storage power to the network. In a centralized
DBMS, growth may entail changes to both
hardware (the procurement of a more powerful
system) and software (the procurement of a more
powerful or more configurable DBMS).
Disadvantages of DDBMSs
• Complexity
• Cost
• Security
• Integrity Control More Difficult
• Lack of Standards
• Lack of Experience
• Database Design More Complex
Types of DDBMS
• Homogeneous DDBMS
• Heterogeneous DDBMS
Homogeneous DDBMS
• All sites use same DBMS product.
• Much easier to design and manage.
• Share a common global schema
• Each site provides part of its autonomy in
terms of right to change schema or sw.
• Approach provides incremental growth and
allows increased performance.:-making the
addition of a new site to the DDBMS easy and
allows increased performance by exploiting the
parallel processing capability of multiple sites.
Heterogeneous DDBMS
• Sites may run different DBMS products, with possibly different
underlying data models.
• Occurs when sites have implemented their own databases
and integration is considered later.
• Translations required to allow for:
– Different hardware and different DBMS products.
Heterogeneous DDBMS
– Fragmentation.
– Allocation
– Replication
Distributed Database Design
• Fragmentation
– Relation may be divided into a number of sub-
relations, which are then distributed.
• Allocation
– Each fragment is stored at site with "optimal"
distribution.
• Replication
– Copy of fragment may be maintained at several sites.