0% found this document useful (0 votes)
21 views

Chapter 7

This document discusses distributed databases. Key points include: 1) A distributed database is a collection of interconnected databases spread across locations that communicate over a network and appear as a single logical database. 2) Distributed databases allow for data sharing across sites for global applications, improved availability and reliability through data replication, and modular growth through adding new sites. 3) Challenges include increased complexity, costs, security issues, and database design complexity due to fragmentation, allocation, and replication concerns.

Uploaded by

mehari kiros
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
21 views

Chapter 7

This document discusses distributed databases. Key points include: 1) A distributed database is a collection of interconnected databases spread across locations that communicate over a network and appear as a single logical database. 2) Distributed databases allow for data sharing across sites for global applications, improved availability and reliability through data replication, and modular growth through adding new sites. 3) Challenges include increased complexity, costs, security issues, and database design complexity due to fragmentation, allocation, and replication concerns.

Uploaded by

mehari kiros
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 22

Chapter 7

Distributed Database-
Concepts and Design

1
Distributed Databases
• A distributed database is a collection of multiple
interconnected databases, which are spread
physically across various locations that
communicate via a computer network.
• It is single logical database physically divided
among networked computers
• Distributed DBMS: is a centralized software
system that supports and manipulates distributed
databases.
• It ensures that the data modified at any site is
universally updated. 2
Concept…
• Users access the distributed database via
applications, which are classified as those that
do not require data from other sites (local
applications) and
• Those that do require data from other sites
(global applications). We require a DDBMS to
have at least one global application.
Distributed Processing and Distributed Database

• Distributed processing shares the database’s


logical processing among two or more
physically independent sites that are
connected through a network.
• Distributed database stores a logically related
database over two or more physically
independent sites connected via a computer
network.
Distributed Processing Environment
Distributed Database Environment
Distributed Processing and Distributed Database

• Distributed processing does not require a distributed


database, but a distributed database requires
distributed processing.
• Distributed processing may be based on a single
database located on a single computer. In order to
manage distributed data, copies or parts of the
database processing functions must be distributed to
all data storage sites.
• Both distributed processing and distributed
databases require a network to connect all
components.
Distributed Databases (DDB) Concept
SELECT Sales
• Multiple independent systems FROM Bahridar.Sales
– Each has DBMS engine, queries, locking, transactions, UNION
SELECT Sales
etc. FROM Mekelle.Sales
UNION
– Usually on different machines & locations
SELECT Sales
– Can be different hardware, OS, software. FROM Adama.Sales
A.A

Bahrdar

Mekelle

Adama

8 of 24
Distributed Database System (DDBS) Concept

Database
Database

Bardar
Mekelle

Database

Adama
• There must be different databases as opposed to a central database

• Network is a part of DDB system

9 of 24
Features of Distributed database
• Databases in the collection are logically interrelated with
each other.
• Data is stored at a number of sites
• Sites are interconnected by a network
• DDB is logically a single database
• DDBMS has full functionality of DBMS.
• A distributed database incorporates transaction
processing, but it is not synonymous with a transaction
processing system.
Advantages of DDBMSs
• Organizational Structure
– Many organizations are naturally distributed
over several locations.
• Shareability and Local Autonomy
– The users at one site can access data stored at
other sites. Data can be placed at the site close
to the users who normally use that data. In this
way, users have local control of the data and
they can consequently establish and enforce
local policies regarding the use of this data.
– A global database administrator (DBA) is
responsible for the entire system.
Advantages of DDBMSs
• Improved Availability
– In a centralized DBMS, a computer failure
terminates the operations of the DBMS. However,
a failure at one site of a DDBMS, or a failure of a
communication link making some sites
inaccessible, does not make the entire system
inoperable. Distributed DBMSs are designed to
continue to function despite such failures. If a
single node fails, the system may be able to
reroute the failed node’s requests to another site.
Advantages of DDBMSs
• Improved Reliability
– As data may be replicated so that it exists at more
than one site, the failure of a node or a
communication link does not necessarily make the
data inaccessible.
• Improved Performance
– Since each site handles only a part of the entire
database, there may not be the same contention
for CPU and I/O services as characterized by a
centralized DBMS.
Advantages of DDBMSs
• Economics
– cost saving occurs where databases are
geographically remote and the applications require
access to distributed data. In such cases, owing to
the relative expense of data being transmitted
across the network as opposed to the cost of local
access, it may be much more economical to
partition the application and perform the
processing locally at each site.
– It is also much more cost-effective to add
workstations to a network than to update a
mainframe system.
Advantages of DDBMSs
• Modular Growth
– In a distributed environment, it is much easier to
handle expansion. New sites can be added to the
network without affecting the operations of other
sites. This flexibility allows an organization to
expand relatively easily. Increasing database size
can usually be handled by adding processing and
storage power to the network. In a centralized
DBMS, growth may entail changes to both
hardware (the procurement of a more powerful
system) and software (the procurement of a more
powerful or more configurable DBMS).
Disadvantages of DDBMSs
• Complexity
• Cost
• Security
• Integrity Control More Difficult
• Lack of Standards
• Lack of Experience
• Database Design More Complex
Types of DDBMS
• Homogeneous DDBMS
• Heterogeneous DDBMS
Homogeneous DDBMS
• All sites use same DBMS product.
• Much easier to design and manage.
• Share a common global schema
• Each site provides part of its autonomy in
terms of right to change schema or sw.
• Approach provides incremental growth and
allows increased performance.:-making the
addition of a new site to the DDBMS easy and
allows increased performance by exploiting the
parallel processing capability of multiple sites.
Heterogeneous DDBMS
• Sites may run different DBMS products, with possibly different
underlying data models.
• Occurs when sites have implemented their own databases
and integration is considered later.
• Translations required to allow for:
– Different hardware and different DBMS products.
Heterogeneous DDBMS

• For example, relations in the relational data model


are mapped to records and sets in the network
model.
• It is also necessary to translate the query language
used (for example, SQL SELECT statements are
mapped to the network FIND and GET statements).
• Typical solution is to use gateways, which convert the
language and model of each different DBMS into the
language and model of the relational system.
Distributed Database Design
• Three key issues:

– Fragmentation.
– Allocation
– Replication
Distributed Database Design
• Fragmentation
– Relation may be divided into a number of sub-
relations, which are then distributed.

• Allocation
– Each fragment is stored at site with "optimal"
distribution.

• Replication
– Copy of fragment may be maintained at several sites.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy