0% found this document useful (0 votes)

24 views

Lec7 - B-Trees

B-Trees are m-way search trees used for disk-based data structures. They minimize disk accesses for operations like insertion and search. Nodes can have between m/2 and m children, allowing for efficient splitting and merging as keys are added or removed. Common operations take O(h) time where h is the tree height, which is typically low for large m.

Uploaded by

Nour Hesham

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views

Lec7 - B-Trees

Uploaded by

Nour Hesham

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

B-Trees

1
B-Trees
Considerations for disk-based storage
systems.
Indexed Sequential Access Method
(ISAM)
m-way search trees
B-trees
2
Data Layout on Disk
• Track: one ring
• Sector: one pie-shaped piece.
• Block: intersection of a track and a sector.

3
Considerations for Disk Based
Dictionary Structures
Use a disk-based method when the dictionary is too big to
fit in RAM at once.

Minimize the expected or worst-case number of disk

accesses for the essential operations (put, get, remove).

Keep space requirements reasonable -- O(n).

Methods based on binary trees, such as red-black search

trees, are not optimal for disk-based representations. The
number of disk accesses can be greatly reduced by using
m-way search trees.
4
Indexed Sequential Access
Method (ISAM)

Store m records in each disk block.

Use an index that consists of an array with

one element for each disk block, holding a
copy of the largest key that occurs in that
block.

5
ISAM (Continued)

1.7 5.1 21.2 26.8 ...

6
ISAM (Continued)
To perform a get(k) operation:

Look in the index using, say, either a

sequential search or a binary search, to
determine which disk block should hold the
desired record.

Then perform one disk access to read that

block, and extract the desired record, if it
exists.

7
ISAM Limitations
Problems with ISAM:

What if the index itself is too large to fit entirely in

RAM at the same time?

Insertion and deletion could be very expensive if

all records after the inserted or deleted one have
to shift up or down, crossing block boundaries.

8
A Solution: B-Trees
Idea 1: Use m-way search trees.
(ISAM uses a root and one level under the root.)
m-way search trees can be as high as we need.

Idea 2: Don’t require that each node always be full.

Empty space will permit insertion without rebalancing.
Allowing empty space after a deletion can also avoid
rebalancing.

Idea 3: Rebalancing will sometimes be necessary: figure

out how to do it in time proportional to the height of the
tree.

9
B-Tree Example with m = 5

2 3 8 13 27

The root has been 2 and m children.

Each non-root internal node has between m/2 and m children.
All external nodes are at the same level. (External nodes are
actually represented by null pointers in implementations.)
10
Insert 10

2 3 8 10 13 27

We find the location for 10 by following a path from the root using the
stored key values to guide the search.
The search falls out the tree at the 4th child of the 1st child of the root.
The 1st child of the root has room for the new element, so we store it
there.
11
Insert 11

2 3 8 10 11 13 27

We fall out of the tree at the child to the right of key 10.
But there is no more room in the left child of the root to hold 11.
Therefore, we must split this node...

12
Insert 11 (Continued)

8 12

2 3 10 11 13 27

The m + 1 children are divided evenly between the old and new nodes.
The parent gets one new child. (If the parent become overfull, then it,
too, will have to be split).

13
Remove 8

8 12

2 3 10 11 13 27

Removing 8 might force us to move another key up from one of the

children. It could either be the 3 from the 1st child or the 10 from the
second child.
However, neither child has more than the minimum number of children
(3), so the two nodes will have to be merged. Nothing moves up.
14
Remove 8 (Continued)

2 3 10 11 13 27

The root contains one fewer key, and has one fewer child.

15
Remove 13

2 3 10 11 13 27

Removing 13 would cause the node containing it to become underfull.

To fix this, we try to reassign one key from a sibling that has spares.

16
Remove 13 (Cont)

2 3 10 12 27

The 13 is replaced by the parent’s key 12.

The parent’s key 12 is replaced by the spare key 11 from the left sibling.
The sibling has one fewer element.

17
Remove 11

2 3 10 12 27

11 is in a non-leaf, so replace it by the value immediately preceding: 10.

10 is at leaf, and this node has spares, so just delete it there.

18
Remove 11 (Cont)

2 3 12 27

19
Remove 2

2 3 12 27

Although 2 is at leaf level, removing it leads to an underfull node.

The node has no left sibling. It does have a right sibling, but that node
is at its minimum occupancy already.
Therefore, the node must be merged with its right sibling.

20
Remove 2 (Cont)

3 10 12 27

The result is illegal, because the root does not have at least 2 children.
Therefore, we must remove the root, making its child the new root.

21
Remove 2 (Cont)

3 10 12 27

The new B-tree has only one node, the root.

22
Insert 49

3 10 12 27

Let’s put an element into this B-tree.

23
Insert 49 (Cont)

3 10 12 27 49

Adding this key make the node overfull, so it must be split into two.
But this node was the root.
So we must construct a new root, and make these its children.

24
Insert 49 (Cont)

3 10 27 49

The middle key (12) is moved up into the root.

The result is a B-tree with one more level.

25
B-Tree performance

Let h = height of the B-tree.

get(k): at most h disk accesses. O(h)
put(k): at most 3h + 1 disk accesses. O(h)
remove(k): at most 3h disk accesses. O(h)

h < log d (n + 1)/2 + 1 where d = m/2 (Sahni, p.641).

An important point is that the constant factors are relatively low.
m should be chosen so as to match the maximum node size to
the block size on the disk.
Example: m = 128, d = 64, n  643 = 262144 , h = 4.

26
2-3 Trees
A B-tree of order m is a kind of m-way search
tree.
A B-Tree of order 3 is called a 2-3 Tree.
In a 2-3 tree, each internal node has either 2
or 3 children.
In practical applications, however, B-Trees of
large order (e.g., m = 128) are more common
than low-order B-Trees such as 2-3 trees.
27

File Organization in DBMS
No ratings yet
File Organization in DBMS
23 pages
Dbms PPT For Chapter 7
No ratings yet
Dbms PPT For Chapter 7
45 pages
Dbms Unit 5.2 (Ar16)
No ratings yet
Dbms Unit 5.2 (Ar16)
23 pages
B-Trees DS
No ratings yet
B-Trees DS
28 pages
B-Trees: CSE 373 Data Structures
No ratings yet
B-Trees: CSE 373 Data Structures
29 pages
B Trees
No ratings yet
B Trees
51 pages
Find All Students With Gpa 3.0'': Can Do Binary Search On (Smaller) Index File!
No ratings yet
Find All Students With Gpa 3.0'': Can Do Binary Search On (Smaller) Index File!
42 pages
Tree-Structured Indexes: R & G Chapter 9
No ratings yet
Tree-Structured Indexes: R & G Chapter 9
34 pages
Btrees Animated
No ratings yet
Btrees Animated
77 pages
B+-Trees: Adapted From Mike Franklin
No ratings yet
B+-Trees: Adapted From Mike Franklin
21 pages
20MCA14C-U5
No ratings yet
20MCA14C-U5
26 pages
Module-4 Trees-Search Tree
No ratings yet
Module-4 Trees-Search Tree
110 pages
B Trees
No ratings yet
B Trees
31 pages
Ads 2 Part 3
No ratings yet
Ads 2 Part 3
60 pages
Unit 5
No ratings yet
Unit 5
99 pages
Advanced Data Structures: B-Trees
No ratings yet
Advanced Data Structures: B-Trees
29 pages
Unit V
No ratings yet
Unit V
55 pages
Data Structure Lecture 7 Tree
No ratings yet
Data Structure Lecture 7 Tree
49 pages
B - Trees
No ratings yet
B - Trees
19 pages
5b Tree Indexes
No ratings yet
5b Tree Indexes
41 pages
Data Structures Using C, 2e Jhalak Dutta
No ratings yet
Data Structures Using C, 2e Jhalak Dutta
16 pages
Multiway Search Tree
No ratings yet
Multiway Search Tree
16 pages
B-Tree Resume
No ratings yet
B-Tree Resume
4 pages
B Trees and B Trees
No ratings yet
B Trees and B Trees
24 pages
B-trees
No ratings yet
B-trees
42 pages
B Trees
No ratings yet
B Trees
62 pages
B tree
No ratings yet
B tree
5 pages
9.CCS224_PART 2_Lecture 4 (August 3, 2021)
No ratings yet
9.CCS224_PART 2_Lecture 4 (August 3, 2021)
30 pages
Tree Structured Indexing: Dr. Hari Om Gupta Professor, Department of Electrical Engineering IIT Roorkee
No ratings yet
Tree Structured Indexing: Dr. Hari Om Gupta Professor, Department of Electrical Engineering IIT Roorkee
27 pages
B Trees
No ratings yet
B Trees
27 pages
Software Design Using C++: An Online Book
No ratings yet
Software Design Using C++: An Online Book
11 pages
External Sorting and Searching: B-Trees, Etc
No ratings yet
External Sorting and Searching: B-Trees, Etc
68 pages
B Tree: Muhammad Haris Department of Computer Science M.haris@nu - Edu.pk
No ratings yet
B Tree: Muhammad Haris Department of Computer Science M.haris@nu - Edu.pk
27 pages
B Tree
No ratings yet
B Tree
6 pages
Class 15
No ratings yet
Class 15
18 pages
Lect0208 PDF
No ratings yet
Lect0208 PDF
7 pages
11. Hafta. (2)
No ratings yet
11. Hafta. (2)
60 pages
Multilevel Indexing and B+ Trees
No ratings yet
Multilevel Indexing and B+ Trees
33 pages
B Tree Application
100% (2)
B Tree Application
6 pages
B Trees and Its Variants
No ratings yet
B Trees and Its Variants
55 pages
Lecture02_BTree
No ratings yet
Lecture02_BTree
5 pages
12 - M-Way Tree - Btree - Heap
No ratings yet
12 - M-Way Tree - Btree - Heap
79 pages
DSA-II UNIT-II B Tree
No ratings yet
DSA-II UNIT-II B Tree
46 pages
B Tree
No ratings yet
B Tree
58 pages
Software Design Using C++: An Online Book
No ratings yet
Software Design Using C++: An Online Book
15 pages
Btree Data Structure
No ratings yet
Btree Data Structure
25 pages
Ch18 - B-Trees
No ratings yet
Ch18 - B-Trees
75 pages
ESO 207A / 211 Data Structures and Algorithms
No ratings yet
ESO 207A / 211 Data Structures and Algorithms
13 pages
Algorithms: Modern Systems
No ratings yet
Algorithms: Modern Systems
21 pages
B Trees
No ratings yet
B Trees
25 pages
Chapter_3_File_Organization_Trees_Structures
No ratings yet
Chapter_3_File_Organization_Trees_Structures
21 pages
B Trees 130126021111 Phpapp02
No ratings yet
B Trees 130126021111 Phpapp02
31 pages
B Trees
No ratings yet
B Trees
24 pages
B-Tree Documentation
No ratings yet
B-Tree Documentation
12 pages
Imp Reddy B Trees
No ratings yet
Imp Reddy B Trees
25 pages
B-Trees: Balanced Tree Data Structures
No ratings yet
B-Trees: Balanced Tree Data Structures
0 pages
Search Trees
No ratings yet
Search Trees
55 pages
B-Trees Slides
No ratings yet
B-Trees Slides
24 pages
L04-X-B-Trees của cô nguyễn bích vân- đại học công nghệ thông tin
No ratings yet
L04-X-B-Trees của cô nguyễn bích vân- đại học công nghệ thông tin
24 pages
Tree
No ratings yet
Tree
117 pages
B Tree.pptx
No ratings yet
B Tree.pptx
17 pages
Unit-4 Tree Notes
No ratings yet
Unit-4 Tree Notes
81 pages
Search Tree: Fundamentals and Applications
From Everand
Search Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Unit 2 Data Structures, File Organisation and Physical Database Design
No ratings yet
Unit 2 Data Structures, File Organisation and Physical Database Design
13 pages
Study Module 2
No ratings yet
Study Module 2
17 pages
ss2 DPR Second Term
No ratings yet
ss2 DPR Second Term
5 pages
Easy Tri Eve Plus
No ratings yet
Easy Tri Eve Plus
131 pages
File Organization Unit 4 Notes
No ratings yet
File Organization Unit 4 Notes
29 pages
Module 03 Data and Business Intelligence
No ratings yet
Module 03 Data and Business Intelligence
61 pages
140+ SQL Interview Questions and Answers (2022) - Great Learning
No ratings yet
140+ SQL Interview Questions and Answers (2022) - Great Learning
60 pages
OS
No ratings yet
OS
19 pages
Error Messages Version 5x PDF
No ratings yet
Error Messages Version 5x PDF
742 pages
ERP (Enterprise Resource Planning) : AX Training Documentation
No ratings yet
ERP (Enterprise Resource Planning) : AX Training Documentation
24 pages
FINAL MANAGEMENT DOCUMENTS
No ratings yet
FINAL MANAGEMENT DOCUMENTS
15 pages
Database Management Systems Ramakrishnan 3rd Edition Raghu Ramakrishnan All Chapters Instant Download
100% (1)
Database Management Systems Ramakrishnan 3rd Edition Raghu Ramakrishnan All Chapters Instant Download
47 pages
Database Management Systems Ramakrishnan 3rd Edition Raghu Ramakrishnan download
100% (3)
Database Management Systems Ramakrishnan 3rd Edition Raghu Ramakrishnan download
59 pages
Database Management Systems 3rd Edition Raghu Ramakrishnan - The 2025 ebook edition is available with updated content
100% (3)
Database Management Systems 3rd Edition Raghu Ramakrishnan - The 2025 ebook edition is available with updated content
66 pages
Cs8493 - Operating Systems
No ratings yet
Cs8493 - Operating Systems
9 pages
Dbms Notes
No ratings yet
Dbms Notes
16 pages
Introduction To ISAM
No ratings yet
Introduction To ISAM
8 pages
IMS Part 1 - Concepts
No ratings yet
IMS Part 1 - Concepts
17 pages
Download ebooks file Database Management Systems 3rd Edition Raghu Ramakrishnan all chapters
100% (3)
Download ebooks file Database Management Systems 3rd Edition Raghu Ramakrishnan all chapters
65 pages
Database Systems - BIT - University of Colombo - Year 3 (Lecture Note 3)
No ratings yet
Database Systems - BIT - University of Colombo - Year 3 (Lecture Note 3)
64 pages
DBMS Course File
100% (1)
DBMS Course File
39 pages
Database Concepts
No ratings yet
Database Concepts
74 pages
Database Management NOTES
No ratings yet
Database Management NOTES
15 pages
Department of Computer Science CMP 222: File Organization and Management
No ratings yet
Department of Computer Science CMP 222: File Organization and Management
19 pages
Dbms Question Bank Unit I
100% (1)
Dbms Question Bank Unit I
2 pages
DBMS_UNIT_5_NOTES
No ratings yet
DBMS_UNIT_5_NOTES
28 pages
UNIT 2 Part2
No ratings yet
UNIT 2 Part2
35 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lec7 - B-Trees

Uploaded by

Lec7 - B-Trees

Uploaded by

B-Trees

Minimize the expected or worst-case number of disk

Keep space requirements reasonable -- O(n).

Methods based on binary trees, such as red-black search

Store m records in each disk block.

Use an index that consists of an array with

1.7 5.1 21.2 26.8 ...

Look in the index using, say, either a

Then perform one disk access to read that

What if the index itself is too large to fit entirely in

Insertion and deletion could be very expensive if

Idea 2: Don’t require that each node always be full.

Idea 3: Rebalancing will sometimes be necessary: figure

The root has been 2 and m children.

Removing 8 might force us to move another key up from one of the

Removing 13 would cause the node containing it to become underfull.

The 13 is replaced by the parent’s key 12.

11 is in a non-leaf, so replace it by the value immediately preceding: 10.

Although 2 is at leaf level, removing it leads to an underfull node.

The new B-tree has only one node, the root.

Let’s put an element into this B-tree.

The middle key (12) is moved up into the root.

Let h = height of the B-tree.

h < log d (n + 1)/2 + 1 where d = m/2 (Sahni, p.641).

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.