0% found this document useful (0 votes)

2 views5 pages

Hashing in DBMS

Hashing in Database Management Systems (DBMS) is a technique for efficient data retrieval and storage by transforming keys into fixed-size hash codes used for indexing in hash tables. It includes concepts such as hash functions, hash tables, and collision handling methods like chaining and open addressing. Types of hashing include static, dynamic, open addressing, and bucket hashing, each with its advantages and disadvantages regarding efficiency and complexity.

Uploaded by

nvcwrbqonznpijorhc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views5 pages

Hashing in DBMS

Uploaded by

nvcwrbqonznpijorhc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Hashing in DBMS (Database Management Systems) is a technique used for efficient data

retrieval and storage. It involves transforming a key (often a piece of data) into a fixed-size
value, called a hash code. This hash code is used to index data in a hash table, which allows for
quick searching, insertion, and deletion operations. Hashing is commonly used for indexing,
especially when dealing with large datasets.
• Hash Function: A function that takes an input (key) and produces a fixed-size string of
characters, typically a number, known as the hash value or hash code.
• Hash Table: A data structure that stores data in an array format, where the position of
each data item is determined by the hash code generated by the hash function.
• Collision: When two different keys produce the same hash value, this is called a
collision. Handling collisions is an important part of the hashing process.
How Hashing Works:
• Key: A piece of data, like a student ID, a name, or any other attribute you want to search
for in the database.
• Hash Function: The key is passed through a hash function, which computes a hash code
for that key.
• Hash Table: The hash code is used as an index to insert the data into a hash table (or a
similar structure like a hash map). This allows quick access to the data.
• Search Operation: To search for a key, the system computes the hash code for the key,
directly accessing the corresponding position in the hash table.
Example:
Let's say we are creating a hash table to store student records, and each student has a unique
student ID.
• Step 1: Define a Hash Function Suppose the hash function is a simple one:
hash(key)=keymod table size\text{hash}(key) = \text{key} \mod \text{table size}
where key is the student ID and table size is the size of the hash table.
• Step 2: Hash Table Creation Suppose we have a hash table of size 10, and the student
IDs are as follows:
• Student ID: 123, 456, 789, 234, 567
• Step 3: Compute Hash Values For each student ID, we apply the hash function to
compute the hash value:
• Hash(123) = 123 % 10 = 3
• Hash(456) = 456 % 10 = 6
• Hash(789) = 789 % 10 = 9
• Hash(234) = 234 % 10 = 4
• Hash(567) = 567 % 10 = 7
• Step 4: Insert into the Hash Table The data will be stored in the hash table at the
corresponding index:
• Index 0: (empty)
• Index 1: (empty)
• Index 2: (empty)
• Index 3: Student ID 123
• Index 4: Student ID 234
• Index 5: (empty)
• Index 6: Student ID 456
• Index 7: Student ID 567
• Index 8: (empty)
• Index 9: Student ID 789
• Step 5: Searching To find a student with a given ID, say 456, we calculate the hash
value:
• Hash(456) = 456 % 10 = 6
• We then look at index 6 in the hash table and find the student record.
Handling Collisions:
If two student IDs were to hash to the same index (a collision), there are various methods to
handle this:
• Chaining: Store multiple values at the same index using a linked list.
• Open Addressing: Search for the next available slot in the hash table.
Example of Collision:
Suppose the following IDs hash to the same value:
• Hash(123) = 3
• Hash(113) = 3
With chaining, both records would be stored at index 3:
Index 3: (113 -> 123)
This allows both student IDs to be stored at the same location without overwriting each other.
Advantages of Hashing:
• Efficient Search: Hashing provides fast data retrieval (O(1) time complexity on
average).
• Efficient Insertion and Deletion: Adding or removing data can also be done quickly.
Disadvantages:
• Collision Handling: Managing collisions can become complex and may degrade
performance.
• Memory Usage: Hash tables may require a significant amount of memory, especially if
the table size is too large.
1. Static Hashing
In static hashing, a fixed-size hash table is used to store the data, and the size of the table remains
unchanged. The hash function is used to map a key to a particular index within this fixed-size
table.
• Example: Consider a hash table of size 10, and the hash function is Hash(Key) = Key %
10. If we have keys (e.g., 21, 32, 43, 54), the data will be stored based on the modulus of
the key:
• Hash(21) = 21 % 10 = 1 → Stored at index 1
• Hash(32) = 32 % 10 = 2 → Stored at index 2
• Hash(43) = 43 % 10 = 3 → Stored at index 3
• Hash(54) = 54 % 10 = 4 → Stored at index 4
Since the table size is fixed, if more data needs to be inserted beyond the table’s capacity, it leads
to a problem called overflow.
2. Dynamic Hashing
Dynamic hashing addresses the issue of static hashing where the size of the hash table is fixed
and may lead to overflow. In dynamic hashing, the hash table grows or shrinks dynamically
based on the number of records. This helps in reducing collisions and provides flexibility in
dealing with the overflow situation.
Types of Dynamic Hashing:
• Extendible Hashing
• Linear Hashing
Extendible Hashing
Extendible hashing uses a directory of pointers to hash buckets and grows the directory size
dynamically as needed. It allows for splitting of buckets and doubling of the directory size to
accommodate additional records.
• Example: Let's assume the hash table has a global depth of 1, which means there are
only 2 buckets (each corresponding to hash values 0 and 1). When the table overflows,
we double the directory size and split the existing bucket into two new buckets.
If a new record (say 5) is inserted into the table, it’s hashed as Hash(5) = 5 % 2 = 1,
but bucket 1 already has a record and overflows. The directory size doubles to
accommodate more records.
Linear Hashing
Linear hashing works by gradually increasing the hash table size in a linear manner. When the
table reaches a certain threshold, it is resized by adding new buckets. New records are inserted
into these new buckets, and old records are rehashed to maintain a consistent distribution.
• Example: If a table is using bucket size 4 and becomes full, the system will add a new
bucket and rehash the data into these buckets in a linear fashion. This ensures that at any
point, no bucket is overly full.
3. Open Addressing Hashing
In open addressing, all data is stored directly in the hash table itself. When a collision occurs
(i.e., two keys hash to the same index), the system tries to find another open slot within the table
based on a probe sequence. Open addressing is suitable when there is a high number of
collisions.
Types of Open Addressing:
• Linear Probing
• Quadratic Probing
• Double Hashing
Linear Probing
In linear probing, when a collision occurs, the system checks the next available index (i.e., it
checks index + 1, index + 2, etc., until an empty slot is found).
• Example: If we have a hash table of size 5 and a hash function Hash(Key) = Key % 5:
• Hash(12) = 12 % 5 = 2 → Insert at index 2.
• Hash(17) = 17 % 5 = 2, but index 2 is already occupied (by 12). So, the system
checks index 3.
• Hash(17) will be inserted at index 3.
Quadratic Probing
Quadratic probing works similarly to linear probing, but instead of checking the next slot, it
checks slots that increase quadratically (e.g., index + 1^2, index + 2^2, index + 3^2, etc.).
• Example: Using the same hash table as before with Hash(Key) = Key % 5:
• Hash(12) = 12 % 5 = 2 → Insert at index 2.
• Hash(17) = 17 % 5 = 2, but index 2 is occupied, so the system checks index 2
+ 1^2 = 3 (if it's occupied, it checks 2 + 2^2 = 6).

Double Hashing
Double hashing uses two hash functions to calculate the index. If a collision occurs, the second
hash function is used to find the next index.
• Example: Let’s assume two hash functions:
• Hash1(Key) = Key % 5
• Hash2(Key) = 1 + (Key % 4)
If Hash1(17) = 2 and index 2 is occupied, double hashing calculates a new index:
• Hash2(17) = 1 + (17 % 4) = 1 + 1 = 2 The system will then try index = 2
+ 2 = 4.

4. Bucket Hashing
In bucket hashing, a bucket is used to store multiple records that have the same hash value (i.e.,
when collisions occur). This is similar to chaining but in the context of hash tables.
• Example: Suppose we have a hash table with the hash function Hash(Key) = Key % 5.
If keys 12 and 17 both hash to index 2:
• At index 2, we store both keys in a bucket.
The bucket allows us to store multiple items at the same index, reducing collisions
significantly.
Summary of Hashing Types:
• Static Hashing: A fixed-size hash table; prone to overflow issues.
• Dynamic Hashing: The hash table grows/shrinks dynamically; extendible hashing and
linear hashing are common types.
• Open Addressing: The hash table stores elements directly in the table; uses linear
probing, quadratic probing, or double hashing to handle collisions.
• Bucket Hashing: Stores multiple records in a bucket to handle collisions, reducing the
impact of a high number of collisions.
Advantages and Disadvantages:
• Advantages:
• Efficient data retrieval and insertion.
• Reduces the search space for finding data.
• Disadvantages:
• Collisions: Can still be problematic depending on the method used.
• Complexity: Some methods (like dynamic hashing or double hashing) can be
complex to implement.

Hashing
No ratings yet
Hashing
7 pages
What is Hashing
No ratings yet
What is Hashing
11 pages
Unit-6c DBMS - Hashing
No ratings yet
Unit-6c DBMS - Hashing
21 pages
Hashing
No ratings yet
Hashing
44 pages
DSA Unit VI Hashing and File Organization
No ratings yet
DSA Unit VI Hashing and File Organization
56 pages
Hashing Methods (1)
No ratings yet
Hashing Methods (1)
20 pages
Unit 2
No ratings yet
Unit 2
55 pages
GROUP 15.Pptx Presentation
No ratings yet
GROUP 15.Pptx Presentation
29 pages
UNIT 1- Hashing
No ratings yet
UNIT 1- Hashing
118 pages
Hashing
No ratings yet
Hashing
20 pages
Hash
No ratings yet
Hash
7 pages
DSA G5 Hashing Handouts
No ratings yet
DSA G5 Hashing Handouts
7 pages
DS Module-X
No ratings yet
DS Module-X
74 pages
Hashing and Skiplist_removed
No ratings yet
Hashing and Skiplist_removed
113 pages
Hashing Techniques
No ratings yet
Hashing Techniques
13 pages
Unit 7
No ratings yet
Unit 7
27 pages
UNIT V - Hashing
No ratings yet
UNIT V - Hashing
20 pages
Unit-5
No ratings yet
Unit-5
50 pages
UNIT 1- Hashing
No ratings yet
UNIT 1- Hashing
118 pages
Hashing
No ratings yet
Hashing
5 pages
Top Solar Panel Manufacturers in India
No ratings yet
Top Solar Panel Manufacturers in India
10 pages
Module 5: HASHING: Functions. The Values Are Then Stored in A Data Structure Called Hash Table
No ratings yet
Module 5: HASHING: Functions. The Values Are Then Stored in A Data Structure Called Hash Table
39 pages
Hashing
No ratings yet
Hashing
23 pages
Hash Tables: Dr. Dibakar Saha
No ratings yet
Hash Tables: Dr. Dibakar Saha
26 pages
Hashing
No ratings yet
Hashing
30 pages
HAshing (Satish sir)
No ratings yet
HAshing (Satish sir)
52 pages
Unit 5 Session 5 Hashing
No ratings yet
Unit 5 Session 5 Hashing
20 pages
MODULE-5
No ratings yet
MODULE-5
33 pages
Hashing
No ratings yet
Hashing
16 pages
Unit 1 Dsa Hashing
No ratings yet
Unit 1 Dsa Hashing
137 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
MODULE 5_BCS304_HASHING_Leftisht trees_OBST_Notes
No ratings yet
MODULE 5_BCS304_HASHING_Leftisht trees_OBST_Notes
32 pages
Data Structure
No ratings yet
Data Structure
21 pages
3 Hashing
No ratings yet
3 Hashing
20 pages
Computational - Thinking Week1 To Week12
100% (2)
Computational - Thinking Week1 To Week12
97 pages
Hashing new
No ratings yet
Hashing new
48 pages
Unit-5 2
No ratings yet
Unit-5 2
9 pages
Unit 3.Docx Dbms
No ratings yet
Unit 3.Docx Dbms
25 pages
unit 1 Hashing
No ratings yet
unit 1 Hashing
61 pages
Hashing PDF
No ratings yet
Hashing PDF
56 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
26 pages
Hashing
No ratings yet
Hashing
30 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
43 pages
HTBMS- 262 Karan Jawahrani Black Book Project
No ratings yet
HTBMS- 262 Karan Jawahrani Black Book Project
100 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
32 pages
Hashing
No ratings yet
Hashing
56 pages
Hashing
No ratings yet
Hashing
37 pages
Hashing Algorithms
No ratings yet
Hashing Algorithms
22 pages
Stainless Steel Pipe Weight Per Meter and Pipe Thickness Chart in MM
No ratings yet
Stainless Steel Pipe Weight Per Meter and Pipe Thickness Chart in MM
4 pages
11 What Is Hashing in DBMS
No ratings yet
11 What Is Hashing in DBMS
20 pages
Unit28 Hashing1
No ratings yet
Unit28 Hashing1
19 pages
Hashing and Graphs
No ratings yet
Hashing and Graphs
28 pages
Hashing Slide
No ratings yet
Hashing Slide
16 pages
Poc Unit-1 Notes
No ratings yet
Poc Unit-1 Notes
46 pages
Hashing
No ratings yet
Hashing
34 pages
ADI Hashing
No ratings yet
ADI Hashing
47 pages
AI in CRE - APAC - VFINAL
No ratings yet
AI in CRE - APAC - VFINAL
18 pages
Introduction To Hashing & Hashing Techniques: Review of Searching Techniques
No ratings yet
Introduction To Hashing & Hashing Techniques: Review of Searching Techniques
19 pages
Delta Ia-Asda Asda-A2 C en 20230214
No ratings yet
Delta Ia-Asda Asda-A2 C en 20230214
72 pages
42LF652V_SB-EX-SI_1463991388
No ratings yet
42LF652V_SB-EX-SI_1463991388
100 pages
Tmua TT B1 B2
No ratings yet
Tmua TT B1 B2
14 pages
Hashing: Amar Jukuntla
No ratings yet
Hashing: Amar Jukuntla
22 pages
E - Broucher SMG INDUSTRIES
No ratings yet
E - Broucher SMG INDUSTRIES
8 pages
DSA MK Lect2 PDF
No ratings yet
DSA MK Lect2 PDF
92 pages
4000 Service Connector Kit
No ratings yet
4000 Service Connector Kit
16 pages
Sample Question Etool (Excel) - Secondary
No ratings yet
Sample Question Etool (Excel) - Secondary
8 pages
We Carry The Saving Cross
No ratings yet
We Carry The Saving Cross
12 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
27 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
Vdocument - in - Tneb Limited Tangedco Tantransco Bulletin May Tneb 2019pdf 3 Per CMD Tangedco PDF
No ratings yet
Vdocument - in - Tneb Limited Tangedco Tantransco Bulletin May Tneb 2019pdf 3 Per CMD Tangedco PDF
52 pages
Load Cell Application
No ratings yet
Load Cell Application
9 pages
Topic 1 - Intro To STS-2
No ratings yet
Topic 1 - Intro To STS-2
49 pages
Hash Function
No ratings yet
Hash Function
9 pages
Scoring Introduction
No ratings yet
Scoring Introduction
18 pages
Plan Petronas Pump
No ratings yet
Plan Petronas Pump
2 pages
Hashing PPT For Student
No ratings yet
Hashing PPT For Student
53 pages
Architectural Thesis Presentation Ideas
100% (2)
Architectural Thesis Presentation Ideas
8 pages
Hashing and Indexing
No ratings yet
Hashing and Indexing
28 pages
ACONIS Maintenance
No ratings yet
ACONIS Maintenance
15 pages
Lester Khiets Roa Bsce 2-A 10 Engineers Who Became President or General Manager of A Large Company
No ratings yet
Lester Khiets Roa Bsce 2-A 10 Engineers Who Became President or General Manager of A Large Company
8 pages
Datasheet BlueSolar Charge Controller MPPT 150 45 Up To 150 100
No ratings yet
Datasheet BlueSolar Charge Controller MPPT 150 45 Up To 150 100
1 page
Hashing
From Everand
Hashing
Prakash Hegade
No ratings yet
Csm-Form School
No ratings yet
Csm-Form School
2 pages
Package List For Website
No ratings yet
Package List For Website
1 page
Eltek Flatpack 2 User Manual
100% (9)
Eltek Flatpack 2 User Manual
101 pages
Best Practices - Supplier Scorecard
100% (1)
Best Practices - Supplier Scorecard
16 pages
Relief Valve (Line) - Test and Adjust - Hydraulic Hammer PDF
100% (1)
Relief Valve (Line) - Test and Adjust - Hydraulic Hammer PDF
4 pages
Search Algorithm: Fundamentals and Applications
From Everand
Search Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
OmniCom ReleaseNotes
No ratings yet
OmniCom ReleaseNotes
46 pages
RHCSA Mockpaperpractice
No ratings yet
RHCSA Mockpaperpractice
4 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Hashing in DBMS

Uploaded by

Hashing in DBMS

Uploaded by

Hashing in DBMS (Database Management Systems) is a technique used for efficient data

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.