0% found this document useful (0 votes)

16 views

Lec 7 Query Processing, Optimization & Indexing

The document covers the fundamentals of query processing in database management systems, detailing the stages of parsing, optimization, and execution. It discusses various optimization techniques, including heuristic and cost-based methods, as well as different join algorithms such as nested loop, hash, and merge joins. Additionally, it provides insights into execution plans and how to optimize SQL queries for efficiency.

Uploaded by

mhariskhan513

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views

Lec 7 Query Processing, Optimization & Indexing

Uploaded by

mhariskhan513

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Query Processing, Optimization &

Indexing
Lecture Agenda
⚫ Query Processing in DBMS
⚫ Query Parsing & Optimization
⚫ Execution Plans & Join Algorithms
⚫ Indexing & Index Structures
⚫ B-Trees, Hash Indexes
⚫ Unique, Composite, and Covering Indexes
What is Query Processing?
⚫ Query processing is the series of steps a database
management system (DBMS) follows to interpret and
execute a user's request (query) to retrieve or
manipulate data stored in a database.
⚫ Translation of SQL queries into low-level instructions
Query Processing Phases
Query Processing Involves multiple stages: Parsing,
Optimization, Execution
⚫ Parsing: Analyzing SQL syntax and semantics
⚫ Optimization: Finding the most efficient execution
plan
⚫ Execution: Running the query using the selected plan
Query Parsing
⚫ Query parsing is the process of analyzing a query
written in SQL (or another query language) to
understand its syntax and structure.
⚫ The parser checks whether the query follows the
correct grammar of the query language. If the syntax is
correct, it translates the query into a parse tree or
abstract syntax tree (AST), which is a tree
representation of the query's logical structure.
Steps in Query Parsing
⚫ Lexical Analysis: The query string is divided into
tokens (e.g., keywords, operators, identifiers).
⚫ Syntax Analysis: The tokens are checked against the
grammar of the query language to ensure correct
structure.
⚫ Semantic Analysis: The parser checks for semantic
errors (e.g., referencing non-existent tables or
columns).
⚫ Query Rewrite: Some queries are rewritten for
optimization purposes (e.g., transforming a subquery
into a join).
Query Optimization
⚫ Query Optimization is the process where the database
system evaluates multiple ways to execute a query and
chooses the most efficient one, usually based on cost
estimates (e.g., time, memory, or I/O operations).
⚫ It happens after parsing and translating the query into a
logical plan, but before actual execution.
⚫ Why Optimize? Reduce I/O, CPU, memory use
Types of Query Optimization
⚫ Query optimization can be categorized in several ways
based on how and when the optimization occurs
⚫ Key Types are
1. Heuristic query optimization
2. Cost-based Query optimization
Heuristic Query Optimization
⚫ Uses a set of rules of thumb (heuristics) to transform
and simplify the query into a more efficient form
without considering cost estimates.
Key Techniques:
⚫ Apply selection operations as early as possible (push
down selections).
⚫ Combine selections and projections to reduce the
number of columns/rows.
⚫ Use smaller tables first in joins.
⚫ Reorder joins and other operations based on known
patterns.
Heuristic optimization
Advantages:
⚫ Fast and simple
⚫ Good for basic improvements
Disadvantage:
Doesn’t always find the most efficient plan
Cost-Based Query Optimization
⚫ Evaluates multiple possible query execution plans using
statistics (e.g., table size, indexes, row selectivity) and chooses
the one with the lowest estimated cost.
Key Steps:
⚫ Generate All Possible Execution Plans: The optimizer
generates all possible ways to execute the query using
different combinations of joins, scans, and sorting.
⚫ Estimate the Cost of Each Plan: Each plan’s cost is
estimated based on factors such as:
⚫ Number of I/O operations: Reading and writing data.
⚫ CPU time: Time spent processing data.
⚫ Memory usage: Storage used during query processing.
⚫ Select the Best Plan: The plan with the lowest cost is
selected as the optimal execution plan.
Cost Based optimization
Factors Affecting Query Cost:
⚫ Table Size: Larger tables require more I/O
operations to read.
⚫ Index Availability: If indexes are available, they
can speed up data retrieval.
⚫ Join Methods: Different join algorithms (e.g.,
nested loop, merge join, hash join) have different
costs.
⚫ Sort Operations: Sorting data can be expensive if
it requires additional memory or disk I/O.
Cost Based optimization
Advantage:
⚫ More accurate and efficient plans
⚫ Adapts to real data distribution
Disadvantage:
⚫ More time-consuming than heuristic optimization
⚫ Depends heavily on up-to-date statistics
Query Execution
⚫ A Query Execution Plan (QEP) is a step-by-step
strategy chosen by the database management system
(DBMS) to execute a SQL query efficiently. After
optimization, the DBMS selects the best plan and uses
it to retrieve or modify the data.
⚫ Generated by the optimizer
⚫ Describes how tables are accessed and joined
Execution Plan
⚫ Access Paths
How data will be accessed: full table scan, index scan, etc.
⚫ Join Methods
How tables will be joined: nested loop, hash join, merge
join.
⚫ Join Order
In what sequence tables will be joined.
⚫ Selection/Projection
When and how WHERE and SELECT clauses are applied.
⚫ Intermediate Steps
Temporary tables, sorting, filtering, grouping.
⚫ Estimated Costs
Estimated time, CPU usage, I/O operations, number of
rows processed at each step.
Example: Optimized Query Execution
Plan
⚫ Student(student_id, name, age, course_id)
⚫ Course(course_id, course_name)
SELECT s.name, c.course_name
FROM Student s
JOIN Course c ON s.course_id = c.course_id
WHERE s.age > 20;
Execution Plan
Step Operation Table Method Notes
Use index on
1 Index Scan Student B-Tree Index age to filter
early
2 Filter Student - s.age > 20
Student + Join on
3 Hash Join Hash Table
Course course_id
Load all
Table Access Full Table
4 Course course
(Full) Scan
records
Output
s.name,
5 Projection Result -
c.course_na
me
How to View Execution Plans

⚫ MYSQL EXPLAIN Select

Join Algorithms Overview
The type of join used in the execution plan of a query
depends on various factors, such as:
⚫ The size of the tables involved
⚫ Availability of indexes
⚫ Join condition (e.g., equality vs inequality)
⚫ The database engine's optimizer
Selection depends on: Input size, Sort order, Join
condition
Types of Join Algorithms
Types of Join Algorithms:
⚫ Nested Loop Join,
⚫ Hash Join,
⚫ Merge Join
Nested Loop Join
⚫ Brute-force method: Compare every pair
⚫ Time Complexity: O(n × m)
⚫ Best For: Small datasets or indexed lookups
How it works: For each row in the outer table, the DBMS
searches for matching rows in the inner table.
Best when: One table is small, or there's an index on the join
column of the inner table.
⚫ Example in plan:
Nested Loop
-> Index Scan on Student
-> Index Lookup on Course
Hash Join
⚫ Build Phase: Hash smaller table
⚫ Probe Phase: Scan larger table and match using hash
⚫ Efficient for equi-joins
How it works:
⚫ Build a hash table on the smaller table using the join key.
⚫ Scan the larger table and use the hash table to find matches.
Best when: No indexes, large tables, and equality joins.
Example in plan:
Hash Join
-> Seq Scan on Student
-> Hash on Course
Merge Join
⚫ Works on sorted inputs
⚫ Time Complexity: O(n + m)
⚫ Efficient for large, pre-sorted data
How it works: Both tables are sorted on the join key,
and then merged together like in merge sort.
Best when: Tables are already sorted or sorting is
efficient.
Example in plan:
Merge Join
-> Sort on Student.course_id
-> Sort on Course.course_id
Example
Consider Schema
Student(student_id, name, age, course_id)
Course(course_id, course_name)

Query:
Select the names of students enrolled in course of
database
Query optimization & Joins Example
⚫ Plan 1:
Select s.name from student
Where course_id
IN (select course_id from course where course_name =
Database)
⚫ Plan 2:
Select s.name from student s JOIN course c
ON s.course_id = c.course_id
Where c.course_name = ‘Database’;
Query optimization & Joins Example
⚫ Plan 3:
WITH registration AS(select course_id from course
where course_name = ‘Database’)
Select s.name from student s JOIN registration r ON
s.course_id = r.course_id;
Join Algorithm Comparison
⚫ Nested Loop: Small tables, O(n × m)
⚫ Hash Join: Medium/Large, EQ joins, O(n + m)
⚫ Merge Join: Sorted data, O(n + m)

Join Type Best For Common in execution

plan when
Nested Loop Small + Large Index exists on inner
Join (with index) table
Hash Join Large + Large No index, but equality
(equality only) join
Merge Join Sorted data Data already sorted or
sorted easily
Class Assignment
⚫ Optimize this query:
⚫ SELECT E.name, D.name FROM Employees E JOIN
Departments D ON E.dept_id = D.id WHERE E.salary
> 70000;
⚫ Suggest 2 execution plans, Recommend suitable
indexes

Sop DBA Ajay
100% (1)
Sop DBA Ajay
76 pages
Amazon DEA-C01 AWS Certified Data Engineer - Associate Dumps
No ratings yet
Amazon DEA-C01 AWS Certified Data Engineer - Associate Dumps
20 pages
THE ETL PROCESS - Abboub - Mohamed - El - Mehdi
100% (1)
THE ETL PROCESS - Abboub - Mohamed - El - Mehdi
14 pages
Chapter - 1 - Query Optimization
No ratings yet
Chapter - 1 - Query Optimization
38 pages
Chapter 5
No ratings yet
Chapter 5
45 pages
Dbms Seminar
No ratings yet
Dbms Seminar
24 pages
Optimization of Queries
No ratings yet
Optimization of Queries
6 pages
query
No ratings yet
query
10 pages
Query Processing
No ratings yet
Query Processing
5 pages
14. DB - Lecture Query Optimization
No ratings yet
14. DB - Lecture Query Optimization
80 pages
Rdbms Assignment
No ratings yet
Rdbms Assignment
12 pages
SQL Query Optimization Help Book
No ratings yet
SQL Query Optimization Help Book
8 pages
SQL Server Execution Plan
No ratings yet
SQL Server Execution Plan
17 pages
Chapter 2 Query Optimization
No ratings yet
Chapter 2 Query Optimization
31 pages
Adir QB
No ratings yet
Adir QB
27 pages
Query Processing and Optimization
No ratings yet
Query Processing and Optimization
127 pages
Querry Optimization
No ratings yet
Querry Optimization
13 pages
Ivunit Query Processing
No ratings yet
Ivunit Query Processing
12 pages
Query Proc Notes
No ratings yet
Query Proc Notes
10 pages
Query Processing and Optimization
No ratings yet
Query Processing and Optimization
23 pages
Chapter 2 Querry Proccessing
No ratings yet
Chapter 2 Querry Proccessing
7 pages
Advancedchapter 2 2013
No ratings yet
Advancedchapter 2 2013
16 pages
Advanced Database System Chapter Three Query Processing and Optimization
No ratings yet
Advanced Database System Chapter Three Query Processing and Optimization
94 pages
NICE ONE - SQL Optimization
No ratings yet
NICE ONE - SQL Optimization
60 pages
Chapter One1
No ratings yet
Chapter One1
21 pages
Chapter 4 Query Optimization
100% (2)
Chapter 4 Query Optimization
35 pages
BCS Topic
No ratings yet
BCS Topic
66 pages
Chapter 2 Query Processing and Optimization
No ratings yet
Chapter 2 Query Processing and Optimization
58 pages
chapter 2
No ratings yet
chapter 2
47 pages
Week09 QPO
No ratings yet
Week09 QPO
56 pages
Presentation9 - Query Processing and Query Optimization in DBMS
No ratings yet
Presentation9 - Query Processing and Query Optimization in DBMS
36 pages
2 Algorithms For Query Processing Optimization
No ratings yet
2 Algorithms For Query Processing Optimization
46 pages
Data Warehousing: Need For Speed: Join Techniques
No ratings yet
Data Warehousing: Need For Speed: Join Techniques
22 pages
Module - 4
No ratings yet
Module - 4
60 pages
Lecture 7
No ratings yet
Lecture 7
25 pages
Lecture11 Query Processing
No ratings yet
Lecture11 Query Processing
37 pages
Advance Database Management System: Unit - 2 .Query Processing and Optimization
No ratings yet
Advance Database Management System: Unit - 2 .Query Processing and Optimization
38 pages
Advanced Database Chapter Two Query Processing and Optimization
100% (1)
Advanced Database Chapter Two Query Processing and Optimization
43 pages
SQL Tuning
No ratings yet
SQL Tuning
27 pages
Query Processing Concepts
No ratings yet
Query Processing Concepts
99 pages
Query
No ratings yet
Query
14 pages
IT212 LECTURE 7
No ratings yet
IT212 LECTURE 7
9 pages
Chapter 1 Query Processing
No ratings yet
Chapter 1 Query Processing
58 pages
Chapter 8
No ratings yet
Chapter 8
65 pages
mastering sql query performance_ an in-depth optimization g…
No ratings yet
mastering sql query performance_ an in-depth optimization g…
6 pages
Ad Database All Slide
No ratings yet
Ad Database All Slide
49 pages
Ch1 Query Processing (2)
No ratings yet
Ch1 Query Processing (2)
49 pages
Chapter - 2 Query Processing
No ratings yet
Chapter - 2 Query Processing
61 pages
ADBChapter 1
No ratings yet
ADBChapter 1
32 pages
Chapter 2 - Query Optimization
No ratings yet
Chapter 2 - Query Optimization
40 pages
Adbs CH2
No ratings yet
Adbs CH2
56 pages
Query Optimization
No ratings yet
Query Optimization
9 pages
Chapter 2 Query Processing
No ratings yet
Chapter 2 Query Processing
56 pages
Chapter - 2 Query Processing
No ratings yet
Chapter - 2 Query Processing
61 pages
Chapter - 2 Query Processing
No ratings yet
Chapter - 2 Query Processing
63 pages
Query Evaluation
No ratings yet
Query Evaluation
51 pages
ADB Chapter 2 DB Part1
No ratings yet
ADB Chapter 2 DB Part1
10 pages
Advanced Database Systems Chapter 2
100% (1)
Advanced Database Systems Chapter 2
16 pages
05 - Strategies For Query Processing (Ch18)
No ratings yet
05 - Strategies For Query Processing (Ch18)
50 pages
Chapter 1 Query Processing
100% (1)
Chapter 1 Query Processing
63 pages
Proven Process For SQL Tuning: Dean Richards Senior DBA, Confio Software
No ratings yet
Proven Process For SQL Tuning: Dean Richards Senior DBA, Confio Software
30 pages
Search Algorithm: Fundamentals and Applications
From Everand
Search Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
TAFJ-DB2 Install
No ratings yet
TAFJ-DB2 Install
17 pages
Bharti_DataAnalyst
No ratings yet
Bharti_DataAnalyst
2 pages
ToadforOracle 13.2 ReleaseNotes
No ratings yet
ToadforOracle 13.2 ReleaseNotes
35 pages
L3 Unix Handling Ordinary Files
No ratings yet
L3 Unix Handling Ordinary Files
10 pages
110 SQL Query Interview Questions and Practice Exercises for Experienced and Fre
No ratings yet
110 SQL Query Interview Questions and Practice Exercises for Experienced and Fre
40 pages
Ks Manual
No ratings yet
Ks Manual
299 pages
98 364 Questions
No ratings yet
98 364 Questions
6 pages
Unit III-Hashing
100% (1)
Unit III-Hashing
135 pages
Various MDX Cheat Sheet
No ratings yet
Various MDX Cheat Sheet
2 pages
Abb SLC 220 Controller Manual JCTDKNQH
No ratings yet
Abb SLC 220 Controller Manual JCTDKNQH
2 pages
Mining Frequent Patterns Without Candidate Generation
No ratings yet
Mining Frequent Patterns Without Candidate Generation
12 pages
How To Backup & Restore Your Linked Helper Data - Linked Helper
No ratings yet
How To Backup & Restore Your Linked Helper Data - Linked Helper
1 page
Tableau Whitepaper LOD
50% (2)
Tableau Whitepaper LOD
26 pages
Building The Data WareHouse - Chapter 03
No ratings yet
Building The Data WareHouse - Chapter 03
95 pages
ETL Testing Topics1
No ratings yet
ETL Testing Topics1
46 pages
Introduction To Stream Concepts - Stream Data Model and Architecture
No ratings yet
Introduction To Stream Concepts - Stream Data Model and Architecture
8 pages
ACID Properties
No ratings yet
ACID Properties
3 pages
Lexical Parameter Is Used To Replace A Specific
No ratings yet
Lexical Parameter Is Used To Replace A Specific
6 pages
Power Query Documentation
No ratings yet
Power Query Documentation
840 pages
WhitePaper LCM Customize Charges and Taxes
No ratings yet
WhitePaper LCM Customize Charges and Taxes
14 pages
Change Data Capture Error 14234
No ratings yet
Change Data Capture Error 14234
2 pages
DBMS-UNIT-6 R16 (1)
No ratings yet
DBMS-UNIT-6 R16 (1)
16 pages
CSQA-Steps For Generating Pareto Charts
No ratings yet
CSQA-Steps For Generating Pareto Charts
2 pages
CLOUD COMPUTING UNIT 3
No ratings yet
CLOUD COMPUTING UNIT 3
10 pages
NED COMPUTER PP II
100% (1)
NED COMPUTER PP II
7 pages
Rest of the Ip Project
No ratings yet
Rest of the Ip Project
26 pages
Dbms Practical File
0% (1)
Dbms Practical File
15 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lec 7 Query Processing, Optimization & Indexing

Uploaded by

Lec 7 Query Processing, Optimization & Indexing

Uploaded by

Query Processing, Optimization &

⚫ MYSQL EXPLAIN Select

Join Type Best For Common in execution

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.