Iii. Current Trends: Distributed Databases and DBMSS: Concepts and Design
Iii. Current Trends: Distributed Databases and DBMSS: Concepts and Design
Current Trends
Slide 1/32
12.0 Content
Content
12.1 Objectives
12.2 Overview of Networking
12.3 Introduction to DDBMSs
- Concepts 12.6 Transparency in a DDBMS
- Advantages and Disadvantages - Distribution Transparency
- Homogeneous and Heterogeneous - Transaction Transparency
12.4 Functions and Architecture - Performance Transparency
- Functions of a DDBMS 12.7 Date’s 12 Rules for DDBMs
- Reference Architecture for a 12.8 Summary
DDBMS/ Federated MDBS
12.5 Distributed Relational Database Design
- Data Allocation
- Fragmentation
Slide 2/32
12.1 Objectives
Objectives
In this Lecture you will learn:
• Concepts.
• Advantages and disadvantages of distributed
databases.
• Functions and architecture for a DDBMS.
• Distributed database design.
• Levels of transparency.
• Comparison criteria for DDBMSs.
Slide 3/32
12.2 Overview of Networking
Overview of Networking
Network: interconnected collection of autonomous computers,
capable of exchanging information.
• Local Area Network (LAN) intended for connecting computers at
same site.
• Wide Area Network (WAN) used when computers or LANs need
to be connected over long distances.
Slide 4/32
12.2 Overview of Networking
Overview of Networking
Network: interconnected collection of autonomous computers,
capable of exchanging information.
• Local Area Network (LAN) intended for connecting computers at
same site.
• Wide Area Network (WAN) used when computers or LANs need
to be connected over long distances.
Slide 5/32
12.3 Introduction
Concepts
Databases and networks:
Slide 6/32
12.3 Introduction
Concepts
DDBMS to Avoid `islands of information’ problem…
Slide 7/32
12.3 Introduction
Concepts
Slide 8/32
12.3 Introduction
Slide 9/32
12.3 Introduction
Distributed Processing
Slide 10/32
12.3 Introduction
Slide 11/32
12.3 Introduction
Slide 12/32
12.3 Introduction
Homogeneous &
Heterogeneous DDBMSs
Homogeneous: All sites use same DBMS product.
• Much easier to design and manage.
• Approach provides incremental growth
• Allows increased performance.
Heterogeneous: Sites may run different DBMS products,
underlying data models.
• Sites implemented their own databases - integration considered later.
•Translations required to allow for • Different hardware.
• Different DBMS products.
• Different hardware and DBMS products.
•Typical solution is to use gateways.
Slide 13/32
12.3 Introduction
Slide 14/32
12.3 Introduction
2 types:
• Federated MDBS: looks like a DDBMS for global users and a
centralized DBMS for local users.
• Unfederated MDBS: has no “local” users
Slide 15/32
12.4 Functions and Architecture of a DDBMS
Slide 16/32
12.4 Functions and Architecture of a DDBMS
Functions of a DDBMS
• Expect DDBMS to have at least the functionality of a DBMS.
Slide 17/32
12.4 Functions and Architecture of a DDBMS
Slide 18/32
12.4 Functions and Architecture of a DDBMS
Slide 19/32
12.5 Distributed Relational Database Design
Data Allocation !
Four alternative strategies regarding placement of data:
• Centralized: single database and DBMS stored at one site with
users distributed across the network.
• Partitioned: Database partitioned into disjoint fragments, each
fragment assigned to one site.
• Complete Replication: Consists of maintaining complete copy of
database at each site.
• Selective Replication: Combination of partitioning, replication,
and centralization.
Comparison of strategies
Slide 20/32
12.5 Distributed Relational Database Design
Data Allocation
Four alternative strategies regarding placement of data:
• Centralized: single database and DBMS stored at one site with
users distributed across the network.
• Partitioned: Database partitioned into disjoint fragments, each
fragment assigned to one site.
• Complete Replication: Consists of maintaining complete copy of
database at each site.
• Selective Replication: Combination of partitioning, replication,
and centralization.
Comparison of strategies
Slide 21/32
12.5 Distributed Relational Database Design
Fragmentation
Why fragment?
Disadvantages: Performance & Integrity.
Usage:
- Apps work with views rather than entire relations.
Efficiency:
- Data stored close to where most frequently used.
- Data not needed by local applications is not stored.
Security:
- and so not available to unauthorized users.
Parallelism:
- With fragments as unit of distribution, T can be divided
into several subqueries that operate on fragments.
Slide 22/32
12.5 Distributed Relational Database Design
Fragmentation !
Three Correctness of fragmentation rules:
1. Completeness: If relation R decomposed into fragments R1, R2, ...
Rn, each data item that can be found in R must appear in at least one
fragment.
Fragmentation !
Four types of fragmentation:
1. Horizontal: Consists of a subset of the tuples of a relation.
Slide 24/32
12.5 Distributed Relational Database Design
Fragmentation !
Other possibility is no
Four types of fragmentation: fragmentation:
Slide 25/32
12.6 Distributed Relational Database Design
Transparency in a DDBMS
Transparency hides implementation details from users.
1. Distribution Transparency
Distribution transparency: allows user to perceive database as
single, logical entity.
2. Transaction Transparency
Transaction transparency: Ensures all distributed Ts
maintain distributed database’s integrity and consistency.
Slide 28/32
12.6 Distributed Relational Database Design
2. Transaction Transparency
Concurrency transparency: All Ts must execute independently and be
logically consistent with results obtained if Ts executed in some arbitrary
serial order.
• Replication makes concurrency more complex
Failure transparency: must ensure atomicity and durability of global T.
• Means ensuring that subTs of global T either all commit or all abort.
• Classification transparency: In IBM’s Distributed Relational
Database Architecture (DRDA), four types of Ts:
– Remote request
– Remote unit of work
– Distributed unit of work
– Distributed request.
Slide 29/32
12.6 Distributed Relational Database Design
3. Performance Transparency
DDBMS: - no performance degradation due to distributed architecture.
- determine most cost-effective strategy to execute a request.
Slide 30/32
12.7 Dates 12 Rules for DDBMS
Slide 31/32
12.8 Summary
Summary
12.1 Objectives 12.6 Transparency in a DDBMS
12.2 Overview of Networking - Distribution Transparency
12.3 Introduction to DDBMSs - Transaction Transparency
Concepts - Performance Transparency
Advantages and Disadvantages 12.7 Date’s 12 Rules for DDBMs
Homogeneous and Heterogeneous
NEXT LECTURE:
12.4 Functions and Architecture III Current Trends
Functions of a DDBMS Part 2: Distributed DBMSs-
Reference Architecture for a Advanced concepts
- advanced concepts
DDBMS/ Federated MDBS
- protocols for distributed
12.5 Distributed Relational Database Design deadlock control
Data Allocation - X/Open Distributed Transaction
Fragmentation Processing Model
- Oracle.
Slide 32/32