0% found this document useful (0 votes)

6 views

19516_Week 2 Parallel and Distributed Database

The document discusses Parallel and Distributed Databases, highlighting the benefits of parallel databases in improving processing speeds through multiple CPUs and disks, and detailing their architectures: Shared Memory, Shared Disk, and Shared Nothing systems. It also explains Distributed Databases, which consist of interrelated databases across a network, managed by a Distributed Database Management System (DDBMS), and outlines the advantages and disadvantages of both systems. Key differences between Parallel and Distributed DBMS are also noted, emphasizing their operational and design purposes.

Uploaded by

Fountain Josiah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

19516_Week 2 Parallel and Distributed Database

Uploaded by

Fountain Josiah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

TOPIC: Parallel and Distributed Database

Parallel database:
Parallel Database improve processing and input/output speeds by using multiple CPU and disks in
parallel. A Parallel Database system seeks to improve performance through parallelization of
various operations, such as loading data, building indexes and evaluating queries. In Parallel
processing, many operations are performed simultaneously, as opposed to serial processing, in
which the computational steps are performed sequentially.
Organizations of every size benefit from databases because they improve the management of
information. The database has a server, a specialized program that oversees all user requests.
Organization use parallel database approach for a large user base and millions of records to
process. They are fast, flexible and reliable.
Architecture for parallel database:
There are three main architectures for building parallel DBMS
Shared Memory
Shared Disk System
Shared Nothing System
1. Shared Memory System: This is where multiple processors are attached to an interconnected
network and access a common region of memory.
Advantages
1. It is closer to a conventional machine and easy to program. 2. Overhead is low.
3. OS Services are leveraged to utilized the additional CPU
Disadvantages
1. It leads to a bottleneck problem.
2. Expensive to build.
3. It is less sensitive to partitioning
2. Shared disk system: where each processor has its own main memory, and direct access to all
disks through an interconnected network.
Advantages
1. The same with shared memory
Disadvantages
1. More interference
2. Increases N/ W bandwidth.
3. The shared disk is less sensitive to partitioning.
3. Shared nothing: This is where each processor has local main memory and disk space, but no
two processors can access the same storage area and all communication between processor is
through a network connection. It has its own mass storage as well as main memory.
Advantages
1. It provides a linear scale-up and linear speedup. 2. Shared nothing benefit from “good”
partitioning. 3. Cheap to build.
Disadvantages
1. It is hard to program.
2. Addition of new nodes requires reorganization.
Parallel query evaluation
A relational query execution plan is a graph/ tree of relational algebra operators (based on this
operators can execute in parallel) and the operators in a graph can be executed in parallel. If an
operator consumes the output of a second operator, we have pipelined parallelism.
Data partitioning: In this case, a large database is partitioned horizontally across several disks, this
enables us to exploit the I/O bandwidth of the disk by reading and writing them in parallel. This
can be done in the following ways:
1. Round-robin partitioning: If there are n processors, the 1st tuple is assigned to processor mod
n round-robin partitioning. Round-robin partitioning is suitable for efficiently evaluating queries
that access the entire relation. If only a subset of the tuples is required, hash partitioning and range
partitioning are better than round-robin partitioning.
2. Hash partitioning: A hash function is applied to (selected fields of) a tuple to determine its
processor. Hash partitioning has the additional virtue that it keeps data evenly distributed even if
the data grows and shrinks over time.
3. Range partitioning: Tuples are sorted and ranges are chosen for the sort key values so that each
range contains roughly the same number of tuples, tuples in range, I reassigned to processor i.
Range Partitioning can lead to data skew.
Advantages of parallel databases
A parallel database runs on many computers at the same time.
1. High Performances 2. Speed
3. Reliability
4. Capacity
Disadvantages of Parallel database
1. Implementation is highly expensive.
2. Handling Parallel database simultaneously is difficult and complex. 3. A lot of resources are
needed to support and maintain the database.
Distributed Database
A Distributed database (DDB) is a collection of multiple, logical interrelated database distributed
over a computer network.
A Distributed database management system (DDBMS) is the software that manages the DDB and
provides an access mechanism that makes this distribution transparent to the users. A distributed
database system is a system that permits physical data storage across several sites and each
site/node is managed by a DBMS that is capable of running independently of the other sites. It is
a database in which storage devices are not all attached to a common processing unit as the CPU,
controlled by a distributed database management system. It may be stored in multiple computers,
located in the same physical location; or may be dispersed over a network of interconnected
computers. System administrators can distribute collections of data (e. g in a database) across
multiple physical locations. A distributed database can reside on network servers on the internet,
on corporate intranets, or other company networks.
Two processing ensure that the distributed database remain up- to date and current:
Replication: involves using specialized software that looks for changes in the distributed database.
Once the changes have been identified, the replication process makes all the databases look the
same. The replication process can be complex and time –consuming depending on the size and
number of the distributed databases
Duplication: This process has less complexity, it basically identifies one database as a master and
then duplicates that database. The duplication process is normally done at a set time hour. This is
to ensure that each distributed location has the same data. In the duplication process, users may
change only the master database, which ensures that local data will not be overwritten.
A Distributed Database management system is designed for heterogeneous database platforms that
focus on heterogeneous database management systems. The following property is considered
desirable:
1. Distributed Data Independence: Users should be able to ask queries without specifying where
the referenced relations or copies or fragments of the relations are located.
2. Distributed Transaction Atomicity: User should be able to write transactions that access and
update at several sites just as they would write transactions over purely local data
Types of distributed database
There are two major types of distributed database systems: they are:
1. Homogenous distributed database
2. Heterogeneous distributed database.
Homogenous distributed database:
1. The following conditions must be satisfied for the homogeneous database:
2. The operating system use, at each location, must be the same.
3. the operating system, must, data structures and database application used at each
location must be same or compatible.
Heterogeneous distributed database:
The following conditions must be satisfied for the heterogeneous database:
1. Different sites may use different schema and software.
2. In heterogeneous systems, different nodes may have different hardware, software and
data structure at various nodes or locations.
The three major distributed DBMS architectures are:
Client-Server Collaborating Server Middleware
1. Client-Server Architecture: In this architecture, the Client (front end) does data presentation or
processing, while the Server (back- end) does storage, security and major data processing. The
client is held responsible for user-interface issues and servers manage data and execute
transactions. A client-server system has one or more client processes and one or more server
processes, and a client process can send a query to any one server process. Thus a client process
could run on a personal computer and send queries to a server running on a mainframe.
Clients characteristics
1. Always initiate requests to servers.
2. Waits for replies.
3. Receives replies.
4. Usually connects to a small number of servers at one time.
Servers characteristics
1. Always wait for a request from one of the clients
2. Servers client request then replies with requested data to the clients
3. A server may communicate with other servers to serve a client request.
4. A server is a source which sends a request to the client to get the needed data users.
Advantages of client-server architecture
1. Very easy to implement because of its clear separation of functionally and a centralized server.
2. Allow user to run a graphical user interface.
3. It enables the roles and responsibilities of a computing system to be distributed among
several independent computers known to each other only through the network. It also provides
greater ease of maintenance.
4. Servers provide better security control access and resources to guarantee that only those clients
with the appropriate permissions may access and change data.
5. Since data storage is centralized, updates to that data are much easier for administrators.
6. Many advanced client-server technologies are designed to ensure security, user-friendly
interfaces and ease of use.
7. It works with multiple different clients of different specifications.

Disadvantages of client-server
1. The client-Server architecture does not permit a single query to span multiple servers.
2. Some times to separate and distinguish between clients and server architecture become
harder.
3. The problem of overlapping, the client process and the server.
4. Networks traffic blocking is one of the problems related to the client-server model.
2. Collaborating server system: This is a collection of database servers, each capable of running
transactions against local data, which cooperatively execute transactions spanning multiple
servers. This overcomes the problem of client-server architecture.
3. Middleware architecture: All web transactions take place on the servers. The web server is
responsible for communicating with the browser while the database server is responsible for
storing the required information.
Advantages of distributed databases
1. Data is stored at many sites, also referred to as nodes.
2. The processors at nodes are interconnected by a computer network rather than a
multiprocessor configuration.
3. The distributed database is indeed a true database, not a collection of files that can be
stored individually at each node.
4. The overall system has the full functionality of a database management system.
5. Reliable transactions due to the replication of database
6. Hardware, operating system, network, fragmentation, DBMS, replication and location
independence.
7. Continuous operation, even if some nodes go offline.
8. Distributed query processes can improve performance.
9. Easier expansion.
10. Local autonomy of site autonomy: a department can control the data about them.
11. Protection of valuable data if there is a fire outbreak as a result of the distributed data in multiple
sites.
12. Modularity systems can be modified added and removed fro the distributed database without
affecting other systems or modules.
13. It is very economical.
Disadvantages of distributed databases
1. Data integrity is difficult to maintain.
2. Distributed data is very complex in nature. For example, extra work must be done to maintain
multiple disparate systems, instead of one big one.
3. It is not really economical because a more extensive infrastructure implies extra labour costs.
4. Absence of standards right.
5. Additional software is needed.
6. Complexity in database design.
7. The operating system should support a distributed environment.
Storing data in DDBS
Data storage in a distributed database involve two concepts
1. Fragmentation 2. Replication
1. Fragmentation: This is a process of splitting a relation into smaller relation or fragments, and
storing the fragment possibly at different sites. In horizontal fragmentation, each fragment consists
of a subset of rows of the original relation. While in vertical fragmentation, each fragment consists
of a subset of columns of the original relations.
2. Replication: This means that several copies of a relation or relation fragment can be stored. An
entire relation can be replicated at one or more sites. Similarly, one or more fragments of a relation
can be replicated at other sites. For example, if a relation R is fragmented into R1, R2 and R3,
there might be just one copy of R1, whereas R2 is replicated at two other sites and R3 is replicated
at all sites.
Parallel DBMS against distributed DBMS
Parallel Distributed System: seeks to improve performance through parallelization of various
operations, such as data loading, index building and query evaluating. Distributed Database
System: Data is physically stored across several sites, and each site is typically managed by a
DBMS capable of running independently of the other sites. The distribution of data is governed by
factors such as local ownership and increased availability.
1. System component: Distributed DBMS consists of many Geo-distributed, low –bandwidth link
connected, autonomic site. While parallel DBMS consists of tightly coupled, high- bandwidth link
connected, non- autonomic node.
2. Component role: Sites in distributed DBMS can work independently to handle local transaction
or work together to handle global transactions. While nodes in parallel DBMS can only work
together to handle global transactions.
3. Design purposes: Distributed DBMS is for sharing data, local autonomy, high availability, while
parallel DBMSA is for high-performance high availability.

MetroPCS MSL SPC Code Calculator
80% (5)
MetroPCS MSL SPC Code Calculator
2 pages
Distributed DBMS Architecture
No ratings yet
Distributed DBMS Architecture
49 pages
The Essential Teaching Skills Ebook PDF
No ratings yet
The Essential Teaching Skills Ebook PDF
11 pages
Parallel Databases
No ratings yet
Parallel Databases
23 pages
Tybca Recent Trends in It Chpter 1
No ratings yet
Tybca Recent Trends in It Chpter 1
16 pages
CH.4
No ratings yet
CH.4
16 pages
Advanced Data Base Management Systems
No ratings yet
Advanced Data Base Management Systems
35 pages
Team:DBMS: by Navdeep Kaur Assistant Professor Computer Science Department
No ratings yet
Team:DBMS: by Navdeep Kaur Assistant Professor Computer Science Department
19 pages
DBMS Basics
No ratings yet
DBMS Basics
6 pages
ADT unit 1 to 5 (1)
No ratings yet
ADT unit 1 to 5 (1)
160 pages
Distributed DB
No ratings yet
Distributed DB
4 pages
Notes_1071_MCA-20-23 Unit- 4.1
No ratings yet
Notes_1071_MCA-20-23 Unit- 4.1
48 pages
MC4202 - Adavanced Database Technology
No ratings yet
MC4202 - Adavanced Database Technology
159 pages
ADT Notes
No ratings yet
ADT Notes
36 pages
ADBMS_Chapter_No._3
No ratings yet
ADBMS_Chapter_No._3
37 pages
Chapter - 7 Distributed Database System
0% (1)
Chapter - 7 Distributed Database System
54 pages
1 Distributed DB
No ratings yet
1 Distributed DB
67 pages
Database Notes
No ratings yet
Database Notes
62 pages
DBMS - Chapter 1
No ratings yet
DBMS - Chapter 1
45 pages
Distributed Database Vs Conventional Database
50% (2)
Distributed Database Vs Conventional Database
4 pages
Chapter 6 Distributed System Management
No ratings yet
Chapter 6 Distributed System Management
12 pages
Book Summary
No ratings yet
Book Summary
31 pages
Introduction To Parallel and Distributed Databases
No ratings yet
Introduction To Parallel and Distributed Databases
12 pages
SQL Unit 3 Distributed DB
No ratings yet
SQL Unit 3 Distributed DB
10 pages
Lecture3-Distributed Introduction
No ratings yet
Lecture3-Distributed Introduction
38 pages
Chapter - 6 Distributed Database System
No ratings yet
Chapter - 6 Distributed Database System
50 pages
Distributed Database: Database Database Management System Storage Devices CPU Computers Network
No ratings yet
Distributed Database: Database Database Management System Storage Devices CPU Computers Network
15 pages
Unit V NoSQL Databases
No ratings yet
Unit V NoSQL Databases
124 pages
A2Z Dbms
No ratings yet
A2Z Dbms
23 pages
UNIT 1 Notes
No ratings yet
UNIT 1 Notes
74 pages
Distributed Database Management
No ratings yet
Distributed Database Management
7 pages
Unit-Iii Distributed Database: System
No ratings yet
Unit-Iii Distributed Database: System
55 pages
Module 3 ADS
No ratings yet
Module 3 ADS
17 pages
Advance Concept in Data Bases Unit-3 by Arun Pratap Singh
100% (2)
Advance Concept in Data Bases Unit-3 by Arun Pratap Singh
81 pages
DDBS Lec1
No ratings yet
DDBS Lec1
20 pages
Distributed Database-Chapter 3
No ratings yet
Distributed Database-Chapter 3
26 pages
Topic 7 - Distributed Database Systems
No ratings yet
Topic 7 - Distributed Database Systems
44 pages
Parallal Databases
No ratings yet
Parallal Databases
4 pages
Dbms Notes
No ratings yet
Dbms Notes
48 pages
Database Management Systems and Distributed Systems Lesson 3
No ratings yet
Database Management Systems and Distributed Systems Lesson 3
34 pages
Adv DBMS-Unit 2
No ratings yet
Adv DBMS-Unit 2
15 pages
Distributed Databases: Daniel Marcous
No ratings yet
Distributed Databases: Daniel Marcous
41 pages
DBMS Ques and Ans
No ratings yet
DBMS Ques and Ans
6 pages
Distributeddbms Er. Inderjeet Bal
No ratings yet
Distributeddbms Er. Inderjeet Bal
60 pages
Lecture Note 1 (DBMS Basics and Languages)
No ratings yet
Lecture Note 1 (DBMS Basics and Languages)
22 pages
Unit 5 Parallel and Distributed Databases
No ratings yet
Unit 5 Parallel and Distributed Databases
22 pages
Fundamental Research of Distributed Database PDF
No ratings yet
Fundamental Research of Distributed Database PDF
9 pages
Database Systems
No ratings yet
Database Systems
86 pages
Distributed Database Management Systems
No ratings yet
Distributed Database Management Systems
12 pages
Database Management System
No ratings yet
Database Management System
88 pages
Distributed Databases: Indu Saini (Research Scholar) IIT Roorkee Enrollment No.: 10926003
No ratings yet
Distributed Databases: Indu Saini (Research Scholar) IIT Roorkee Enrollment No.: 10926003
14 pages
Unit - 2 (1) DBMS
No ratings yet
Unit - 2 (1) DBMS
25 pages
Lesson Two DMS
No ratings yet
Lesson Two DMS
11 pages
MySQL Tutorial
No ratings yet
MySQL Tutorial
176 pages
CC-6-UNIT 1 & 2_240227_115659
No ratings yet
CC-6-UNIT 1 & 2_240227_115659
20 pages
Unit_I DBMS
No ratings yet
Unit_I DBMS
74 pages
Distributed Databases
No ratings yet
Distributed Databases
39 pages
Distributed Databases Introduction
100% (1)
Distributed Databases Introduction
16 pages
DBMS & SQL
No ratings yet
DBMS & SQL
34 pages
UNIT-1 DBMS AKTU Class Nots PDF
No ratings yet
UNIT-1 DBMS AKTU Class Nots PDF
24 pages
O o o o o o O: What Is Database
No ratings yet
O o o o o o O: What Is Database
20 pages
Database And Computer Management: SERIES 1, #3
From Everand
Database And Computer Management: SERIES 1, #3
Elias Mutegi
No ratings yet
Technical Seminar SAHANA
No ratings yet
Technical Seminar SAHANA
27 pages
Comparison of Fatigue Provisions in Various Codes
No ratings yet
Comparison of Fatigue Provisions in Various Codes
11 pages
Stylus For Sony
No ratings yet
Stylus For Sony
1 page
Kernel Sentence LP
No ratings yet
Kernel Sentence LP
2 pages
Service Manual Chasis TK 2080
No ratings yet
Service Manual Chasis TK 2080
21 pages
1.assignment 1 Lingusitics
No ratings yet
1.assignment 1 Lingusitics
7 pages
Table 2.4 BS 1490:1988 Alloys and Approximate Equivalents
No ratings yet
Table 2.4 BS 1490:1988 Alloys and Approximate Equivalents
1 page
IB - SWAYAM July 2022 Semester 21.12.2022
No ratings yet
IB - SWAYAM July 2022 Semester 21.12.2022
54 pages
Drishya Yuva Site Internship Report
No ratings yet
Drishya Yuva Site Internship Report
60 pages
Cyber Crime Trends: Darrent NG APAC - Enterprise Sales
No ratings yet
Cyber Crime Trends: Darrent NG APAC - Enterprise Sales
32 pages
ATM
100% (2)
ATM
81 pages
Concept Map - Mathematical Structure
No ratings yet
Concept Map - Mathematical Structure
10 pages
Trading Journal
No ratings yet
Trading Journal
34 pages
Describing Forces: Jenny Rose N. Pangilinan Science Teacher
No ratings yet
Describing Forces: Jenny Rose N. Pangilinan Science Teacher
41 pages
Affairscloud April Week 3 PDF
No ratings yet
Affairscloud April Week 3 PDF
24 pages
Installation & Retreivable BPV
No ratings yet
Installation & Retreivable BPV
5 pages
Book Review of Rural Elite 160412
0% (1)
Book Review of Rural Elite 160412
20 pages
SAFETY DATA SHEET Shell Rimula R2 10W
No ratings yet
SAFETY DATA SHEET Shell Rimula R2 10W
15 pages
Course Outline HRM
No ratings yet
Course Outline HRM
3 pages
Example 06
No ratings yet
Example 06
5 pages
TLBrochure PDF
No ratings yet
TLBrochure PDF
11 pages
Mikhail Chigorin Top Chess Players - Chess.com
No ratings yet
Mikhail Chigorin Top Chess Players - Chess.com
1 page
The Western 1st Edition David Lusted - The 2025 ebook edition is available with updated content
100% (1)
The Western 1st Edition David Lusted - The 2025 ebook edition is available with updated content
53 pages
CONGA - Coding Test - Automation
No ratings yet
CONGA - Coding Test - Automation
2 pages
Measuring Job Satisfaction
100% (2)
Measuring Job Satisfaction
5 pages
2021Jan-GROUP PROJECT Topic-Sent ST
No ratings yet
2021Jan-GROUP PROJECT Topic-Sent ST
14 pages
Ceiling Mount Occupancy Sensor: Features
No ratings yet
Ceiling Mount Occupancy Sensor: Features
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

19516_Week 2 Parallel and Distributed Database

Uploaded by

19516_Week 2 Parallel and Distributed Database

Uploaded by

TOPIC: Parallel and Distributed Database

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.