0% found this document useful (0 votes)

8 views27 pages

Indexing

Uploaded by

Thanuja Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views27 pages

Indexing

Uploaded by

Thanuja Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

PHYSICAL ORGANIZATION

Indexing
Outline

1. Indexes

2. Types of single-level indexes

3. Summary of single level indexes

4. Multiple level indexes

1. Indexes (1)

•A single-level index is an auxiliary file that makes it more efficient to

search for a record in the data file.
•The index is usually specified on one field of the file called the indexing
field (although it could be specified on several fields).
•One form of an index is a file of index entries
<field value, pointer to block>, which is ordered
by field value.
•The index is called an access path on the field.
•The index file usually occupies considerably less disk blocks than the
data file because its entries are much smaller and, depending on the type
of index, fewer than the records of the data file.
•A binary search on the index yields a pointer to the data file record or to
the block of the data file that holds the record.
2. Types of single-level indexes (1)

•There are several types of single-level ordered indexes.

•A primary index is specified on the ordering key field of an ordered
file of data records.
•A clustering index is specified on the ordering non-key field of an
ordered file of data records.
•A secondary index can be specified on any
non-ordering field of a file of data records.
•A data file can have at most one primary index or one clustering index
but not both.
•A data file can have multiple secondary indexes in addition to its primary
access path.
2. Types of single-level indexes (2)

•Indexes can be characterized as dense or sparse.

• A dense index has an index entry for every record in the data file.
• A non-dense (or sparse) index has index entries for only some
of the records in the data file.
2.1. Primary indexes (1)

•Defined on an ordered data file

•The data file is ordered on a key field
•Includes one index entry for each block in the data file – hence it is a
sparse index.
•The index entry has the key field value for the first record in the data
block, which is called the block anchor
•A similar scheme can use the last record in a block
2.1. Primary indexes (2)
2.1. Primary indexes (3)

Example: Consider the following data file: EMPLOYEE(NAME,

SSN, ADDRESS, JOB, ... )
Assume that SSN is the ordering key field.
Suppose that:
number of records in the data file r = 30,000 records
record size R =100 bytes
block size B = 1024 bytes
Then, we get:
blocking factor bfr = ⎣B / R ⎦= ⎣1024 / 100 ⎦ =10 records/block
number of data file blocks b = ⎡r / bfr ⎤=(30000 / 10) = 3000
blocks

The binary search cost of the data file would be:

⎡log2b⎤ = ⎡log23000⎤ = 12 block accesses
2.1. Primary indexes (4)

For an index on the SSN field, assume the field size V = 9 bytes,
assume the block pointer size P = 6 bytes.

Then:
index entry size Ri = ( V + P ) = (9+6) = 15 bytes
index blocking factor bfri = ⎣B / Ri ⎦=
⎣1024 / 15 ⎦= 68 entries/block
number of index entries ri = 3000 (since this is a primary index we have one
index entry per data block)
number of index blocks bi = ⎡ri/bfri ⎤=(3000 / 68) = 45 blocks
binary search of the index file needs ⎡log2bi⎤ =
⎡log245⎤ = 6 block accesses
(+1 additional access to the data file to get the block)
Compared 7 to 12 block accesses required for a binary search of the data
file.
2.1. Primary indexes (5)

•A major problem with a primary index is insertion and deletion of

records.

The techniques used for ordered files can be used here too.

•For insertion of records we can use an unordered overflow file or a

linked list of overflow records for each block in the data file.

•For deletion of records we can use deletion markers.

Both the data file and the index file are periodically
reorganized.
2.2. Clustering indexes (1)

•Defined on an ordered data file

The data file is ordered on a non-key field

Includes one index entry for each distinct value of the field; the index
entry points to the first data block that contains records with that field
value.

It is a sparse index since there is one index entry per distinct index field
value rather than per record in the data file.
2.2. Clustering indexes (2)
2.2. Clustering indexes (3)

•To facilitate insertion and deletion of records it is common to reserve in

the data file a whole block (or a cluster of contiguous blocks) for each
value of the index field.

All data records with the same index field value are placed in the same
block (or cluster of blocks).

→
2.2. Clustering indexes (4)
Clustering Index with a Separate Block Cluster for Each Group
of Records That Share the Same Value for the Clustering Field
2.3. Secondary indexes (1)

•Defined on a data file not ordered based on the indexing field.

Can be defined on a key field or a non-key field

2.3. Secondary indexes on a key field

•A secondary index on a key field includes one entry for each record in the
data file; hence, it is a dense index

A pointer in an index entry points to a record or to a block in which the

record is stored.

•A secondary index usually needs more storage space and longer search
time than a primary index (why?)

•The improvement in search time for an arbitrary record is much greater

for a secondary index than for a primary index (why?)
2.3. Secondary indexes on a key field

A Dense Secondary Index (with Block Pointers)

on a Nonordering Key Field of a File
2.3. Secondary indexes on a key field

Example: Consider the previous example: EMPLOYEE(NAME, SSN,

ADDRESS, JOB, ... )
We construct a secondary index on a non-ordering key field of size V = 9
bytes.
As previously:
number of records in the data file r = 30,000 records
record size R =100 bytes
block size B = 1024 bytes
We have computed:
blocking factor bfr = ⎣B / R ⎦= ⎣1024 / 100 ⎦ =10 records/block
number of data file blocks b = ⎡r / bfr ⎤=(30000 / 10) = 3000 blocks
Then:
The average linear search cost of the data file is: (b/2) = 3000/2 = 1500
block accesses (assuming that the record exists in the file)
2.3. Secondary indexes on a key field

For the secondary index on a key field, assume the block pointer size
P = 6 bytes.

Then:
index entry size Ri = ( V + P ) = (9+6) = 15 bytes
index blocking factor bfri = ⎣B / Ri ⎦=
⎣1024 / 15 ⎦= 68 entries/block

number of index blocks bi = ⎡r / bfri ⎤=

(30000 / 68) = 442 blocks (we have one index entry per record in
the data file)
binary search of the index file needs ⎡log2bi⎤ =
⎡log2442⎤ = 9 block accesses
(+1 additional access to the data file to get the data block)
Compare 10 to the 1500 block accesses required on the average for a linear
search of the data file.
2.3. Secondary indexes on a non-key field

• If the indexing field is not a key field of the data file, multiple data records can
have the same value for the indexing field.

• There are different implementation options:

Option 1: Include in the index one entry for each data record – dense index.

Option 2: Use a variable length record for the index entries with a repeating
field for the pointer.

Option 3: (most common) have a single entry for each index field value and
have the pointer point to a block of pointers (or to a cluster or linked list of
blocks if necessary).

• A secondary index provides a logical ordering of the data records by the

indexing field.
→
2.3. Secondary indexes on a non-key field
3. Summary of single-level indexes (1)

Types of indexes based on the properties of the

indexing field.

Ordering Non
field ordering
field
Key field Primary index Secondary
index (key)
Non key field Clustering Secondary
index index (non-key)
3. Summary of single-level indexes (2)

Properties of index types.

TYPE OF NUMBER OF FIRST LEVEL DENSE/ BLOCK

INDEX INDEX ENTRIES SPARSE ANCHORING COMMENT

Primary Number of Blocks in data file Sparse Yes

Yes No distinct values of the ordering

Number of distinct index field field in the same data block
Clustering values Sparse
No

Secondary
(key) Number of records in data file Dense No No

Number of records in data file Dense No

Secondary One index entry for each data
(non key) record
Number of distinct index field Sparse No One index entry for each distinct
value
values of the index field, and separate
blocks of data record pointers
4. Multiple level indexes (1)

•Because a single-level index is an ordered file, we can create a primary

index to the index itself ; in this case, the original index file is called the
first- level index and the index to the index is called the second-level
index.

•The blocking factor of the second (and higher level) indexes is called
fan-out of the multilevel index.

•We can repeat the process, creating a third, fourth, ..., top level until all
entries of the top level fit in one disk block.

•A multi-level index can be created for any type of first-level index

(primary, secondary, clustering) as long as the first-level index consists of
more than one disk block.
4. Multiple level indexes (2)
4. Multiple level indexes (3)
Example: Consider the previous example: EMPLOYEE(NAME, SSN,
ADDRESS, JOB, ...
We convert the dense secondary index into a
multilevel index.
We have computed:
number of first level index blocks b1 = ⎡r/bfr1⎤= (30000 / 68) = 442 blocks.
Then:
The fan-out fo of the multilevel index equals bfr1.
number of second level index blocks b2 = ⎡b1 / fo⎤ =(442 / 68) = 7 blocks
number of third level index blocks b3 = ⎡b2 / fo⎤ =(7 / 68) = 1 block (top level of
the index)

To access a data record through the multilevel index we need 3 + 1 = 4 block

accesses.

Compare 4 to 10 block accesses needed when a single-level index and binary

search is used.
4. Multiple level indexes (4)

•A multi-level index is a form of search tree ; however, insertion and

deletion of new index entries is a severe problem because every level of
the index is an ordered file.

•Because of the insertion and deletion problem, most multi-level indexes

use dynamic multilevel indexes, which leave space in each tree node
(disk block) to allow for new index entries.

•Dynamic multilevel indexes are often implemented using data structures

called B-trees and B+-trees.

FD Controller Instruction Manual Command Reference: 4th Edition
No ratings yet
FD Controller Instruction Manual Command Reference: 4th Edition
124 pages
SingleLevelIndexing Examples
No ratings yet
SingleLevelIndexing Examples
24 pages
Week 15 Physical Database Design Index - CH 17 Updated
No ratings yet
Week 15 Physical Database Design Index - CH 17 Updated
35 pages
Chapter - 3 - Indexing Structures For Files
No ratings yet
Chapter - 3 - Indexing Structures For Files
83 pages
Indexing Structures For Files: Database Design Database Design
No ratings yet
Indexing Structures For Files: Database Design Database Design
9 pages
Ch17Notes Indexing Structures For Files
No ratings yet
Ch17Notes Indexing Structures For Files
39 pages
Index 2
No ratings yet
Index 2
24 pages
FALLSEM2024-25 BCSE302L TH VL2024250101553 2024-09-02 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE302L TH VL2024250101553 2024-09-02 Reference-Material-I
48 pages
Indexing
No ratings yet
Indexing
89 pages
Indexing Dbms
No ratings yet
Indexing Dbms
22 pages
Indexing Lecture Nov 2023 Detailed
No ratings yet
Indexing Lecture Nov 2023 Detailed
37 pages
Single-Level Ordered Indexes
No ratings yet
Single-Level Ordered Indexes
12 pages
Chapter 3
No ratings yet
Chapter 3
50 pages
File Organization and Indexing
No ratings yet
File Organization and Indexing
38 pages
Indexing in Database
No ratings yet
Indexing in Database
33 pages
Indexing Structures For Files
No ratings yet
Indexing Structures For Files
30 pages
FALLSEM2019-20 ITE1003 ETH VL2019201002592 Reference Material I 06-Nov-2019 Indexing
No ratings yet
FALLSEM2019-20 ITE1003 ETH VL2019201002592 Reference Material I 06-Nov-2019 Indexing
32 pages
Indexing Structures For Files
No ratings yet
Indexing Structures For Files
23 pages
CO3 Notes Indexing
No ratings yet
CO3 Notes Indexing
11 pages
Lec 09
No ratings yet
Lec 09
52 pages
File Organizations and Indexes
No ratings yet
File Organizations and Indexes
51 pages
Indexing
No ratings yet
Indexing
53 pages
Chapter 3 File Organization Indexed Methods
No ratings yet
Chapter 3 File Organization Indexed Methods
31 pages
Co3 Session 21
No ratings yet
Co3 Session 21
53 pages
Single Level Indexing
No ratings yet
Single Level Indexing
9 pages
Lec06-Indexing in Dbms
No ratings yet
Lec06-Indexing in Dbms
21 pages
08 File Handling
No ratings yet
08 File Handling
18 pages
CO3-Session-09 & 10
No ratings yet
CO3-Session-09 & 10
41 pages
9 Files, Indices and Database Tuning
No ratings yet
9 Files, Indices and Database Tuning
17 pages
Indexing Structures For Files
No ratings yet
Indexing Structures For Files
25 pages
Index and Hashing 2017 Combined
No ratings yet
Index and Hashing 2017 Combined
60 pages
Indexing
No ratings yet
Indexing
41 pages
Indexing
No ratings yet
Indexing
8 pages
Indexing Structures: Professor Navneet Goyal Department of Computer Science & Information Systems BITS, Pilani
No ratings yet
Indexing Structures: Professor Navneet Goyal Department of Computer Science & Information Systems BITS, Pilani
87 pages
File Org & Indexing - DPP 02
No ratings yet
File Org & Indexing - DPP 02
5 pages
Indexing - II
No ratings yet
Indexing - II
57 pages
DBMS1 Week 4
No ratings yet
DBMS1 Week 4
14 pages
Index 1
No ratings yet
Index 1
25 pages
Week 7 - Indexing Structures
No ratings yet
Week 7 - Indexing Structures
25 pages
20-M4-File Organization - Single Level Indexing-09-09-2024
No ratings yet
20-M4-File Organization - Single Level Indexing-09-09-2024
28 pages
Module 4 Indexing
No ratings yet
Module 4 Indexing
20 pages
CH 14
No ratings yet
CH 14
6 pages
Chapter - 2 - Revision
No ratings yet
Chapter - 2 - Revision
26 pages
CNG351 Lecture 12 A
No ratings yet
CNG351 Lecture 12 A
21 pages
7-Indexing and Block
No ratings yet
7-Indexing and Block
20 pages
CNG351 Lecture 12 A
No ratings yet
CNG351 Lecture 12 A
21 pages
Indexing Lecture Nov 2023 Summary
No ratings yet
Indexing Lecture Nov 2023 Summary
41 pages
Weekly Exercises 01
No ratings yet
Weekly Exercises 01
16 pages
Lecture-13 Indexing and Its Types: Subject: DBMS Subject Code: BCA-S301T Faculty: Saurabh Jha
No ratings yet
Lecture-13 Indexing and Its Types: Subject: DBMS Subject Code: BCA-S301T Faculty: Saurabh Jha
16 pages
Lec 20-24
No ratings yet
Lec 20-24
91 pages
File Organization and Indexing
No ratings yet
File Organization and Indexing
13 pages
Indexing
No ratings yet
Indexing
62 pages
Module-5 Dbms Cs208 Notes
No ratings yet
Module-5 Dbms Cs208 Notes
11 pages
Screenshot 2025-03-12 at 9.41.04 AM
No ratings yet
Screenshot 2025-03-12 at 9.41.04 AM
41 pages
Index Method1
No ratings yet
Index Method1
24 pages
Indexing
No ratings yet
Indexing
6 pages
Elmasri - 6e - Ch18
No ratings yet
Elmasri - 6e - Ch18
53 pages
Assignment 3
No ratings yet
Assignment 3
4 pages
Primary Indexing
No ratings yet
Primary Indexing
7 pages
Search Tree: Fundamentals and Applications
From Everand
Search Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet
4th Sem 2nd Internal Exam Routine
No ratings yet
4th Sem 2nd Internal Exam Routine
1 page
Capgemini Interview Questions
No ratings yet
Capgemini Interview Questions
6 pages
Toolbox PLUS Users Manual 3.11.0
No ratings yet
Toolbox PLUS Users Manual 3.11.0
238 pages
Multiple Output Power Supply
No ratings yet
Multiple Output Power Supply
15 pages
Introduction To Embedded Systems: Printed Book
No ratings yet
Introduction To Embedded Systems: Printed Book
1 page
Q3 Module1 G11 CSS-NCII Sison-Central-Is
No ratings yet
Q3 Module1 G11 CSS-NCII Sison-Central-Is
10 pages
HTML Part2
No ratings yet
HTML Part2
9 pages
A6K Product Manual
No ratings yet
A6K Product Manual
16 pages
Installation Instructions Model DB2-HR: Detector Relay Base
No ratings yet
Installation Instructions Model DB2-HR: Detector Relay Base
8 pages
EQP S3 Software
No ratings yet
EQP S3 Software
57 pages
DDCS V3.1 MANUAL V3 Projeto Final 2
100% (2)
DDCS V3.1 MANUAL V3 Projeto Final 2
84 pages
Invoice 08 26 2021 1671159
No ratings yet
Invoice 08 26 2021 1671159
5 pages
Webinar - Online Conference
No ratings yet
Webinar - Online Conference
4 pages
Devops Unit 4
No ratings yet
Devops Unit 4
6 pages
CX Programer-Help-OMRON FB Library Reference Ver.090612: CPU Position Controller
No ratings yet
CX Programer-Help-OMRON FB Library Reference Ver.090612: CPU Position Controller
31 pages
Revision 2 Board Examination
No ratings yet
Revision 2 Board Examination
9 pages
Cbse - Department of Skill Education Artificial Intelligence
No ratings yet
Cbse - Department of Skill Education Artificial Intelligence
10 pages
Workshop 3-1: Antenna Post-Processing: ANSYS HFSS For Antenna Design
No ratings yet
Workshop 3-1: Antenna Post-Processing: ANSYS HFSS For Antenna Design
51 pages
Elasticsearch and Apache Lucene
No ratings yet
Elasticsearch and Apache Lucene
7 pages
2.PC Jotun Chart1011 PDF
No ratings yet
2.PC Jotun Chart1011 PDF
3 pages
Download: Solutions Intermediate Progress Tests Unit 1answer
No ratings yet
Download: Solutions Intermediate Progress Tests Unit 1answer
2 pages
DCW20 960W DIN Rail Combo DC-UPS / DC-DC Converter: Main Features Embedded User Interface
No ratings yet
DCW20 960W DIN Rail Combo DC-UPS / DC-DC Converter: Main Features Embedded User Interface
4 pages
Manual
No ratings yet
Manual
17 pages
Introduction To Digital Radiography and Pacs: Areej Aloufi
No ratings yet
Introduction To Digital Radiography and Pacs: Areej Aloufi
12 pages
ES Syllabus (E-Next - In)
No ratings yet
ES Syllabus (E-Next - In)
2 pages
Updated Dbms Lab Obe
No ratings yet
Updated Dbms Lab Obe
4 pages
Messaging Gateway
No ratings yet
Messaging Gateway
5 pages
Ak Unit 6 Codetantra Updated
No ratings yet
Ak Unit 6 Codetantra Updated
18 pages
Red PPT Template-71-75
No ratings yet
Red PPT Template-71-75
5 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Indexing

Uploaded by

Indexing

Uploaded by

PHYSICAL ORGANIZATION

2. Types of single-level indexes

3. Summary of single level indexes

4. Multiple level indexes

•A single-level index is an auxiliary file that makes it more efficient to

•There are several types of single-level ordered indexes.

•Indexes can be characterized as dense or sparse.

•Defined on an ordered data file

Example: Consider the following data file: EMPLOYEE(NAME,

The binary search cost of the data file would be:

•A major problem with a primary index is insertion and deletion of

•For insertion of records we can use an unordered overflow file or a

•For deletion of records we can use deletion markers.

•Defined on an ordered data file

The data file is ordered on a non-key field

•To facilitate insertion and deletion of records it is common to reserve in

•Defined on a data file not ordered based on the indexing field.

Can be defined on a key field or a non-key field

A pointer in an index entry points to a record or to a block in which the

•The improvement in search time for an arbitrary record is much greater

A Dense Secondary Index (with Block Pointers)

Example: Consider the previous example: EMPLOYEE(NAME, SSN,

number of index blocks bi = ⎡r / bfri ⎤=

• There are different implementation options:

• A secondary index provides a logical ordering of the data records by the

Types of indexes based on the properties of the

Properties of index types.

TYPE OF NUMBER OF FIRST LEVEL DENSE/ BLOCK

Primary Number of Blocks in data file Sparse Yes

Yes No distinct values of the ordering

Number of records in data file Dense No

•Because a single-level index is an ordered file, we can create a primary

•A multi-level index can be created for any type of first-level index

To access a data record through the multilevel index we need 3 + 1 = 4 block

Compare 4 to 10 block accesses needed when a single-level index and binary

•A multi-level index is a form of search tree ; however, insertion and

•Because of the insertion and deletion problem, most multi-level indexes

•Dynamic multilevel indexes are often implemented using data structures

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.