0% found this document useful (0 votes)

20 views

PDB Partitioning

Uploaded by

leena sakri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views

PDB Partitioning

Uploaded by

leena sakri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Data Partitioning Strategies in Parallel

Database Systems
 Data partitioning distributes data over a
number of processing elements.
 Each processing element is then executed

simultaneously with other processing

elements thereby creating parallelism.
 By partitioning the data distributed equally

into many different processor’s workload,

we can achieve better performance (better
parallelism) of the whole system.
Partitioning strategies
 There are various partitioning strategies
proposed to manage the data distribution
into multiple processors evenly.
 Let us assume that in our parallel database
system we have
◦ n processors from P0,P1,P2,…..Pn-1 and
◦ n disks D0,D1,D2,….Dn-1 where we partition our data.
 The value of n is chosen according to the
degree of parallelism required.
 The partitioning strategies are
◦ Round robin partitioning
◦ Hash partitioning and
◦ Range partitioning
Round Robin Partitioning
 The Emp_table has 14 records
and every record stores
information about the name of
the employee, his or her work
grade and the department name
 Assume that we have 3
processors namely P0,P1,P2 and
three disk associated with those
three processors namely D0,D1,D2.
 In Round Robin strategy, we
partition records in a round robin-
manner using the function i mod
n, where i is the record position
in the table and n is the number
of partitions or disk in our case it
is 3.
 On the application of partitioning
technique, first record goes to D1,
second record goes into D2, third
record goes into D0, fourth record
goes into D1 and so on.
Hash Partitioning

 Let us take GRADE attribute of the EMP_table to

explain Hash partitioning.
 Let us choose a hash function as follows:
◦ h(GRADE)= (GRADE mod n)
 Where
◦ GRADE is the value of GRADE attribute of a record
◦ n is the number of partitions is 3 in our case
 While applying the Hash partitioning on GRADE ,we
will get the following partitions of EMP_table.
 For example,
◦ the grade of Smith is 1 while hashing the function shows
partition 1 i.e 1 mod 3 = 1.
◦ The GRADE of Blake is 4 i.e..,(4 mod 3) directs to Partition 1.
◦ The GRADE of King is 5 which directs to partition 2 (5 mod
3)=2
Range Partitioning
 In range partitioning we identify one or more attributes

as partitioning attributes, then we choose a range

partition Vector to partition the table into n disk. The
vector is the values present in the partitioning attribute.
 Let us consider grade of emp_table to partition under

range partitioning.
 For applying range partition, we need to first identify

partitioning Vector.
 Let us choose the following Vector as range partitioning

Vector for our case [2,4].

◦ According to the vector the records having the grade value 2
and less will go into partition 0
◦ greater than 2 and less than or equal to 4 will go into partition
1
◦ all other values that is greater than 4 will go into partition
number 2
SOLVE
 Consider a parallel DBMS in which each relation is stored by horizontally
partitioning its tuples across all disks.

 The mgrid field of Departments is the eid of the manager. Each relation
contains 20-byte tuples, and the sal and budget fields both contain uniformly
distributed values in the range 0 to 1,000,000. The Employees relation
contains 100,000 pages, the Departments relation contains 5,000 pages, and
each processor has 100 buffer pages of 4,000 bytes each. The cost of one
page I/O is td, and the cost of shipping one page is ts; tuples are shipped in
units of one page by waiting for a page to be filled before sending a message
from processor i to processor j. There are no indexes, and all joins that are
local to a processor are carried out using a sort-merge join. Assume that the
relations are initially partitioned using a round-robin algorithm and that there
are 10 processors.
 For each of the following queries, describe the evaluation plan briefly and give
its cost in terms of td and ts. You should compute the total cost across all sites
as well as the ‘elapsed time’ cost (i.e., if several operations are carried out
concurrently, the time taken is the maximum over these operations).
 1. Find the highest paid employee.
 2. Find the highest paid employee in the department with did
55.
 3. Find the highest paid employee over all departments with
budget less than 100,000.
 4. Find the highest paid employee over all departments with
budget less than 300,000.
 5. Find the average salary over all departments with budget
less than 300,000.
 6. Find the salaries of all managers.
 7. Find the salaries of all managers who manage a department
with a budget less than 300,000 and earn more than 100,000.
 8. Print the eids of all employees, ordered by increasing
salaries. Each processor is connected to a separate printer,
and the answer can appear as several sorted lists, each
printed by a different processor, as long as we can obtain a
fully sorted list by concatenating the printed lists (in some
order)

20762C 03
No ratings yet
20762C 03
29 pages
2 Parallel Databases
No ratings yet
2 Parallel Databases
44 pages
CH14
No ratings yet
CH14
43 pages
Parallel Databases
No ratings yet
Parallel Databases
19 pages
Unit I
No ratings yet
Unit I
43 pages
Advanced Database Management System
No ratings yet
Advanced Database Management System
3 pages
Practical No 2: Title: Aim: Theory/Explanation
No ratings yet
Practical No 2: Title: Aim: Theory/Explanation
7 pages
Assignment - 10 Parallel Sorting Techniques: Range-Partitioning Sort
No ratings yet
Assignment - 10 Parallel Sorting Techniques: Range-Partitioning Sort
6 pages
A Comprehensive Guide To Oracle Partitioning With Samples
No ratings yet
A Comprehensive Guide To Oracle Partitioning With Samples
36 pages
I/O Parallelism Interquery Parallelism Intraquery Parallelism Intraoperation Parallelism Interoperation Parallelism Design of Parallel Systems
No ratings yet
I/O Parallelism Interquery Parallelism Intraquery Parallelism Intraoperation Parallelism Interoperation Parallelism Design of Parallel Systems
42 pages
Third Year Engineering: 21BTCS604 - Advanced DBMS
No ratings yet
Third Year Engineering: 21BTCS604 - Advanced DBMS
51 pages
Lecture 2 Lecture PPT #3,4,5,6
No ratings yet
Lecture 2 Lecture PPT #3,4,5,6
34 pages
M.C.a. (Sem - IV) Paper - IV - Adavanced Database Techniques
No ratings yet
M.C.a. (Sem - IV) Paper - IV - Adavanced Database Techniques
114 pages
Lecture 1 Parallel Databases
No ratings yet
Lecture 1 Parallel Databases
30 pages
Ab Initio - V1.6
No ratings yet
Ab Initio - V1.6
50 pages
CAS CS 460/660 Introduction To Database Systems Query Evaluation I
No ratings yet
CAS CS 460/660 Introduction To Database Systems Query Evaluation I
32 pages
TDD: Topics in Distributed Databases: Parallel Database Management Systems
No ratings yet
TDD: Topics in Distributed Databases: Parallel Database Management Systems
38 pages
Parallel Databases: Solutions To Practice Exercises
No ratings yet
Parallel Databases: Solutions To Practice Exercises
3 pages
IO Parallelism
No ratings yet
IO Parallelism
4 pages
Chapter 21: Parallel Databases
No ratings yet
Chapter 21: Parallel Databases
43 pages
ADB25Lab5
No ratings yet
ADB25Lab5
6 pages
Slide 5
No ratings yet
Slide 5
43 pages
parrel query processing
No ratings yet
parrel query processing
13 pages
Lecture 10: Parallel Query Evaluation: CS 838: Foundations of Data Management Spring 2016
No ratings yet
Lecture 10: Parallel Query Evaluation: CS 838: Foundations of Data Management Spring 2016
4 pages
Introduction To DBMS
No ratings yet
Introduction To DBMS
37 pages
Query-Processing
No ratings yet
Query-Processing
77 pages
DAA Chandrakanta Mahanty
No ratings yet
DAA Chandrakanta Mahanty
103 pages
DAA Chandrakanta Mahanty
No ratings yet
DAA Chandrakanta Mahanty
103 pages
Fundamentals of Database Systems: (Parallel and Distributed Databases)
No ratings yet
Fundamentals of Database Systems: (Parallel and Distributed Databases)
46 pages
u4 - 5 i o Parallelism
No ratings yet
u4 - 5 i o Parallelism
8 pages
Ads Mse
No ratings yet
Ads Mse
22 pages
18s PDF
No ratings yet
18s PDF
6 pages
Oracle Performance Tuning - Oracle Partitioning - Introduction
No ratings yet
Oracle Performance Tuning - Oracle Partitioning - Introduction
57 pages
Where To Leave The Data ?: - Parallel Systems - Scalable Distributed Data Structures - Dynamic Hash Table (P2P)
No ratings yet
Where To Leave The Data ?: - Parallel Systems - Scalable Distributed Data Structures - Dynamic Hash Table (P2P)
39 pages
Where To Leave The Data ?: - Parallel Systems - Scalable Distributed Data Structures - Dynamic Hash Table (P2P)
No ratings yet
Where To Leave The Data ?: - Parallel Systems - Scalable Distributed Data Structures - Dynamic Hash Table (P2P)
39 pages
Parallel Dbs
No ratings yet
Parallel Dbs
42 pages
Unary Query Processing Operators: CS 186, Spring 2006 Background For Homework 2
No ratings yet
Unary Query Processing Operators: CS 186, Spring 2006 Background For Homework 2
18 pages
DAA Unit - II
No ratings yet
DAA Unit - II
38 pages
Unit I: Introduction: Algorithm
No ratings yet
Unit I: Introduction: Algorithm
17 pages
Chapter 20: Parallel Databases
No ratings yet
Chapter 20: Parallel Databases
6 pages
Lab5_partitioning2
No ratings yet
Lab5_partitioning2
5 pages
Daa Part 2C
No ratings yet
Daa Part 2C
10 pages
I/O Parallelism Interquery Parallelism Intraquery Parallelism Intraoperation Parallelism Interoperation Parallelism Design of Parallel Systems
No ratings yet
I/O Parallelism Interquery Parallelism Intraquery Parallelism Intraoperation Parallelism Interoperation Parallelism Design of Parallel Systems
42 pages
Oracle 11g Partitioning
No ratings yet
Oracle 11g Partitioning
11 pages
LN 2
No ratings yet
LN 2
33 pages
Centralized Versus Distributed DBMS: T T T T A A A A
No ratings yet
Centralized Versus Distributed DBMS: T T T T A A A A
3 pages
Notes ML for Data science
No ratings yet
Notes ML for Data science
14 pages
Query Processing + Optimization: Outline: Operator Evaluation Strategies
No ratings yet
Query Processing + Optimization: Outline: Operator Evaluation Strategies
53 pages
ParallelDBs PDF
No ratings yet
ParallelDBs PDF
23 pages
Parallel & Distributed Databases: C S 5 6 1 - S P R I N G 2 0 1 2 Wpi, Mohamed Eltabakh
No ratings yet
Parallel & Distributed Databases: C S 5 6 1 - S P R I N G 2 0 1 2 Wpi, Mohamed Eltabakh
23 pages
Table Partitioning in SQL Server
No ratings yet
Table Partitioning in SQL Server
11 pages
hw3 Sols
No ratings yet
hw3 Sols
5 pages
ADS_QB
No ratings yet
ADS_QB
17 pages
Datastage Fundamentals: January 2008 Module 01: Introduction Slide 1-1
No ratings yet
Datastage Fundamentals: January 2008 Module 01: Introduction Slide 1-1
42 pages
Partitioning in Oracle
No ratings yet
Partitioning in Oracle
5 pages
Lecture 1 DAA
No ratings yet
Lecture 1 DAA
52 pages
Homework #3 Join Algorithms After - 12
No ratings yet
Homework #3 Join Algorithms After - 12
4 pages
Partition Table
No ratings yet
Partition Table
5 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
MVS JCL Utilities Quick Reference, Third Edition
From Everand
MVS JCL Utilities Quick Reference, Third Edition
Robert Wingate
5/5 (1)
Lenovo Ideapad Z380/Z480/Z485/ Z580/Z585: User Guide
No ratings yet
Lenovo Ideapad Z380/Z480/Z485/ Z580/Z585: User Guide
54 pages
Chapter 6 DF Merged
No ratings yet
Chapter 6 DF Merged
332 pages
It Chapter 3
No ratings yet
It Chapter 3
11 pages
Win Installation
No ratings yet
Win Installation
41 pages
Master Boot Record - Wikipedia, The Free Encyclopedia
No ratings yet
Master Boot Record - Wikipedia, The Free Encyclopedia
21 pages
Useful SAN Storage Command and OS System Commands
No ratings yet
Useful SAN Storage Command and OS System Commands
19 pages
Lab-Project 10: Static Acquisition With Backtrack: What You Need For This Project
No ratings yet
Lab-Project 10: Static Acquisition With Backtrack: What You Need For This Project
12 pages
Windowsxp Tips: Performance
No ratings yet
Windowsxp Tips: Performance
52 pages
Centos 7 Partition Management With Fdisk Utility: File System
No ratings yet
Centos 7 Partition Management With Fdisk Utility: File System
103 pages
Cisco Hyperflex Hyperconverged Infrastructure All Flash Solution For Sap Hana
No ratings yet
Cisco Hyperflex Hyperconverged Infrastructure All Flash Solution For Sap Hana
38 pages
Server Fundamentals Notes
No ratings yet
Server Fundamentals Notes
9 pages
Linux All in One Desk Reference For Dummies 3rd Edition Emmett Dulaney - The latest ebook edition with all chapters is now available
100% (5)
Linux All in One Desk Reference For Dummies 3rd Edition Emmett Dulaney - The latest ebook edition with all chapters is now available
41 pages
How To Fix Inaccessible Boot Device
No ratings yet
How To Fix Inaccessible Boot Device
9 pages
A+ Guide To IT Technical Support, 9th Edition: Maintaining Windows
No ratings yet
A+ Guide To IT Technical Support, 9th Edition: Maintaining Windows
76 pages
Ubuntu 18.04 LTS Desktop Installation
No ratings yet
Ubuntu 18.04 LTS Desktop Installation
38 pages
It&Ites: General Information For Computer Hardware Assistant Under Mes
No ratings yet
It&Ites: General Information For Computer Hardware Assistant Under Mes
14 pages
E200 Smart Array Fail Procedure
No ratings yet
E200 Smart Array Fail Procedure
8 pages
HMC+and+Firmware+AIX+VUG Feb+2011
No ratings yet
HMC+and+Firmware+AIX+VUG Feb+2011
99 pages
Ncla Q
No ratings yet
Ncla Q
13 pages
CIA Commander en
No ratings yet
CIA Commander en
8 pages
Windows Installation Using Flash Drive
No ratings yet
Windows Installation Using Flash Drive
1 page
Digital Forensics Professional: The World's Premier Online Digital Forensics Course
0% (1)
Digital Forensics Professional: The World's Premier Online Digital Forensics Course
12 pages
Forensics
No ratings yet
Forensics
6 pages
MC3100 Operating System BSP 06.37.13 - Localized Release Notes
No ratings yet
MC3100 Operating System BSP 06.37.13 - Localized Release Notes
10 pages
NILAI 100%: Jawaban IT Essentials Final Exam 1-10 (V 4.1)
No ratings yet
NILAI 100%: Jawaban IT Essentials Final Exam 1-10 (V 4.1)
8 pages
Five Best Computer Diagnostic Tools
No ratings yet
Five Best Computer Diagnostic Tools
14 pages
ATI2023Micron Userguide en-US
No ratings yet
ATI2023Micron Userguide en-US
106 pages
Best Pracsticbestes PTC Windchill On SQL Server
No ratings yet
Best Pracsticbestes PTC Windchill On SQL Server
35 pages
Lenovo Diagnostics - LOG: Modules Test Results
No ratings yet
Lenovo Diagnostics - LOG: Modules Test Results
15 pages
Instant Download How Linux Works What Every Superuser Should Know Brian Ward PDF All Chapters
100% (2)
Instant Download How Linux Works What Every Superuser Should Know Brian Ward PDF All Chapters
55 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

PDB Partitioning

Uploaded by

PDB Partitioning

Uploaded by

Data Partitioning Strategies in Parallel

simultaneously with other processing

into many different processor’s workload,

 Let us take GRADE attribute of the EMP_table to

as partitioning attributes, then we choose a range

Vector for our case [2,4].

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.