Query Processing

Query processing involves translating SQL queries into low-level instructions, optimizing them for efficiency, and executing them to retrieve data from databases. The goal is to minimize costs associated with disk accesses and read/write operations while ensuring accurate results. Key steps include parsing, translation, optimization, execution planning, and evaluation of query plans to determine the most efficient method for data retrieval.

Uploaded by

thakur.sahil18042020

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views8 pages

Query Processing

Uploaded by

thakur.sahil18042020

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

\Query Processing

Query Processing would mean the entire process or activity which involves query
translation into low level instructions, query optimization to save resources, cost
estimation or evaluation of query, and extraction of data from the database.
Goal: To find an efficient Query Execution Plan for a given SQL query which would
minimize the cost considerably, especially time.
Cost Factors: Disk accesses [which typically consumes time], read/write
operations [which typically needs resources such as memory/RAM].
The major steps involved in query processing are depicted in the figure below;

Figure 1 - Steps in Database Query Processing

Let us discuss the whole process with an example. Let us consider the following
two relations as the example tables for our discussion;

Employee(Eno, Ename, Phone)

Proj_Assigned(Eno, Proj_No, Role, DOP)
where,
Eno is Employee number,
Ename is Employee name,
Proj_No is Project Number in which an employee is assigned,
Role is the role of an employee in a project,
DOP is duration of the project in months.
With this information, let us write a query to find the list of all employees who are
working in a project which is more than 10 months old.
SELECT Ename
FROM Employee, Proj_Assigned
WHERE Employee.Eno = Proj_Assigned.Eno AND DOP > 10;
Input:
A query written in SQL is given as input to the query processor. For our case, let
us consider the SQL query written above.
Step 1: Parsing
In this step, the parser of the query processor module checks the syntax of the
query, the user’s privileges to execute the query, the table names and attribute
names, etc. The correct table names, attribute names and the privilege of the
users can be taken from the system catalog (data dictionary).
Step 2: Translation
If we have written a valid query, then it is converted from high level language
SQL to low level instruction in Relational Algebra.
For example, our SQL query can be converted into a Relational Algebra
equivalent as follows;
πEname(σDOP>10 Λ Employee.Eno=Proj_Assigned.Eno(Employee X Prof_Assigned))
Step 3: Optimizer
Optimizer uses the statistical data stored as part of data dictionary. The
statistical data are information about the size of the table, the length of records,
the indexes created on the table, etc. Optimizer also checks for the conditions
and conditional attributes which are parts of the query.
Step 4: Execution Plan
A query can be expressed in many ways. The query processor module, at this
stage, using the information collected in step 3 to find different relational algebra
expressions that are equivalent and return the result of the one which we have
written already.
For our example, the query written in Relational algebra can also be written as
the one given below;

πEname(Employee ⋈ Eno (σDOP>10 (Prof_Assigned)))

So far, we have got two execution plans. Only condition is that both plans should
give the same result.
Step 5: Evaluation
Though we got many execution plans constructed through statistical data,
though they return same result (obvious), they differ in terms of Time
consumption to execute the query, or the Space required executing the query.
Hence, it is mandatory choose one plan which obviously consumes less cost.
At this stage, we choose one execution plan of the several we have developed.
This Execution plan accesses data from the database to give the final result.
In our example, the second plan may be good. In the first plan, we join two
relations (costly operation) then apply the condition (conditions are considered as
filters) on the joined relation. This consumes more time as well as space.
In the second plan, we filter one of the tables (Proj_Assigned) and the result is
joined with the Employee table. This join may need to compare less number of
records. Hence, the second plan is the best (with the information known, not
always).
Output:
The final result is shown to the user.

The overall information discussed above are depicted in Figure 2 below;

Figure 2 - Query Processing [Note: in Step 4, NJ means Natural Join]

○ Indexing is used to optimize the performance of a database by minimizing the number

of disk accesses required when a query is processed.

○ The index is a type of data structure. It is used to locate and access the data in a
database table quickly.

Index structure:
Indexes can be created using some database columns.

○ The first column of the database is the search key that contains a copy of the primary
key or candidate key of the table. The values of the primary key are stored in sorted
order so that the corresponding data can be accessed easily.

○ The second column of the database is the data reference. It contains a set of pointers
holding the address of the disk block where the value of the particular key can be
found.

8. Query Processing
Goals: Understand the basic concepts underlying the
steps in
query processing and optimization and estimating query
processing
cost; apply query optimization techniques;
Contents:
_ Overview
_ Catalog Information for Cost Estimation
_ Measures of Query Cost
_ Selection
_ Join Operations
_ Other Operations
_ Evaluation and Transformation of Expressions
Query Processing & Optimization
Task: Find an efficient physical query plan (aka execution
plan)
for an SQL query
Goal: Minimize the evaluation time for the query, i.e.,
compute
query result as fast as possible
Cost Factors: Disk accesses, read/write operations, [I/O,
page
transfer] (CPU time is typically ignored)
Catalog Information for Cost Estimation
Information about relations and attributes:
_ NR: number of tuples in the relation R.
_ BR: number of blocks that contain tuples of the relation
R.
_ SR: size of a tuple of R.
_ FR: blocking factor; number of tuples from R that _t into
one
block (FR = dNR=BRe)
_ V(A; Measures of Query Cost
_ There are many possible ways to estimate cost, e.g.,
based on
disk accesses, CPU time, or communication overhead.
_ Disk access is the predominant cost (in terms of time);
relatively easy to estimate; therefore, number of block
transfers
from/to disk is typically used as measure.
{ Simplifying assumption: each block transfer has the
same
cost.
_ Cost of algorithm (e.g., for join or selection) depends on
database buffer size; more memory for DB buffer reduces
disk
accesses. Thus DB buffer size is a parameter for
estimating
cost.
_ We refer to the cost estimate of algorithm S as cost(S).
We
do not consider cost of writing output to disk.
Selection Operation
_A=a(R) where a is a constant value, A an attribute of R
_ File Scan { search algorithms that locate and retrieve
records
that satisfy a selection condition
_ S1 { Linear search
cost(S1)= BRR): number of distinct values for attribute A
in R.
_ SC(A; R): selectivity of attribute A
_ average number of tuples of R that satisfy an
equality condition on A.
SC(A; R) = NR=V(A; R).
Information about indexes:
_ HTI: number of levels in index I (B+-tree).
_ LBI: number of blocks occupied by leaf nodes in index I
(first-level blocks).
_ Vali: number of distinct values for the search key.

Understanding Query Processing Steps
No ratings yet
Understanding Query Processing Steps
5 pages
Query Processing and Optimization
No ratings yet
Query Processing and Optimization
7 pages
Database Query Optimization Guide
No ratings yet
Database Query Optimization Guide
127 pages
Measures of Query Cost
No ratings yet
Measures of Query Cost
15 pages
Measures of Query Cost
No ratings yet
Measures of Query Cost
15 pages
Query Processing in Database Systems
0% (1)
Query Processing in Database Systems
27 pages
Query Processing for DBMS Students
No ratings yet
Query Processing for DBMS Students
13 pages
Ch-2 Query Processing and Optimization
No ratings yet
Ch-2 Query Processing and Optimization
21 pages
Ivunit Query Processing
No ratings yet
Ivunit Query Processing
12 pages
Query Processing Concepts
No ratings yet
Query Processing Concepts
99 pages
Final DBMS Unit 7
No ratings yet
Final DBMS Unit 7
48 pages
Query Processing and Optimization
No ratings yet
Query Processing and Optimization
33 pages
Chapter 5
No ratings yet
Chapter 5
45 pages
Query Processing in DBMS Explained
No ratings yet
Query Processing in DBMS Explained
20 pages
Query Processing and Optimization
No ratings yet
Query Processing and Optimization
31 pages
Database Query Processing & Security
No ratings yet
Database Query Processing & Security
39 pages
Query Optimization Techniques
100% (1)
Query Optimization Techniques
38 pages
Ad Database All Slide
No ratings yet
Ad Database All Slide
49 pages
Query Evaluation
No ratings yet
Query Evaluation
51 pages
29-Query Optimization-04-10-2024
No ratings yet
29-Query Optimization-04-10-2024
35 pages
CO3-Notes-Query Processing and Optimization
No ratings yet
CO3-Notes-Query Processing and Optimization
5 pages
3 Query Processing and Optimization-1
No ratings yet
3 Query Processing and Optimization-1
18 pages
Presentation9 - Query Processing and Query Optimization in DBMS
No ratings yet
Presentation9 - Query Processing and Query Optimization in DBMS
36 pages
Ch1 Query Processing
No ratings yet
Ch1 Query Processing
49 pages
Introduction To Database Management Systems CS470
No ratings yet
Introduction To Database Management Systems CS470
11 pages
DBMS
No ratings yet
DBMS
24 pages
Ch-2 Query Processing and Optimization
No ratings yet
Ch-2 Query Processing and Optimization
26 pages
Dbms Seminar
No ratings yet
Dbms Seminar
24 pages
Query Processing and Optimization Guide
No ratings yet
Query Processing and Optimization Guide
28 pages
Chapter 2 Adb
No ratings yet
Chapter 2 Adb
21 pages
Rdbms Assignment
No ratings yet
Rdbms Assignment
12 pages
Query Optimization
No ratings yet
Query Optimization
103 pages
CH 02
No ratings yet
CH 02
127 pages
Database Query Optimization Guide
No ratings yet
Database Query Optimization Guide
38 pages
Advancedchapter 2 2013
No ratings yet
Advancedchapter 2 2013
16 pages
ADBMS Chapter One
No ratings yet
ADBMS Chapter One
21 pages
CO3 Session 11
No ratings yet
CO3 Session 11
27 pages
Query Processing and Optimization Guide
No ratings yet
Query Processing and Optimization Guide
44 pages
Database Query Optimization Guide
100% (1)
Database Query Optimization Guide
43 pages
Co3 Session 23
No ratings yet
Co3 Session 23
27 pages
Chapter 2 - Query Processing and Optimization
100% (3)
Chapter 2 - Query Processing and Optimization
28 pages
ADBMS CH 2
No ratings yet
ADBMS CH 2
35 pages
Query Optimization Part1
No ratings yet
Query Optimization Part1
52 pages
Chapter 2 Query Processing and Optimization (Autosaved)
No ratings yet
Chapter 2 Query Processing and Optimization (Autosaved)
35 pages
Advanced Database Systems: Chapter 3:query Processing and Evaluation
100% (1)
Advanced Database Systems: Chapter 3:query Processing and Evaluation
36 pages
Chapter 4 Query Optimization
100% (2)
Chapter 4 Query Optimization
35 pages
Basic Concepts of Query Processing
No ratings yet
Basic Concepts of Query Processing
10 pages
Query Proc Notes
No ratings yet
Query Proc Notes
10 pages
Chapter 1 Query Processing
No ratings yet
Chapter 1 Query Processing
58 pages
Query Processing and Optimization Steps
No ratings yet
Query Processing and Optimization Steps
40 pages
ADBChapter 1
No ratings yet
ADBChapter 1
32 pages
Chapter 1 Query Processing and Optimization
No ratings yet
Chapter 1 Query Processing and Optimization
40 pages
Query Processing and Optimization Steps
No ratings yet
Query Processing and Optimization Steps
44 pages
DBMS Unit 4
No ratings yet
DBMS Unit 4
9 pages
Query Optimization Techniques by Warih Maharani
No ratings yet
Query Optimization Techniques by Warih Maharani
39 pages
Itm661 Lecture03 Part2 2015
No ratings yet
Itm661 Lecture03 Part2 2015
47 pages
Heuristic Query Optimization Steps
No ratings yet
Heuristic Query Optimization Steps
43 pages
Query Processing and Optimization
No ratings yet
Query Processing and Optimization
23 pages
IT2305 Database Systems 1 PDF
No ratings yet
IT2305 Database Systems 1 PDF
498 pages
National University of Modern Languages - NUML: (Department of Computer Science)
No ratings yet
National University of Modern Languages - NUML: (Department of Computer Science)
56 pages
SQL Solution Full
No ratings yet
SQL Solution Full
17 pages
BCS403 Database Management Exam Paper
No ratings yet
BCS403 Database Management Exam Paper
102 pages
DBMS Exam Solutions for CSEDU Students
0% (1)
DBMS Exam Solutions for CSEDU Students
63 pages
Module - 1
No ratings yet
Module - 1
54 pages
DBMS Question Bank: Concepts & Queries
No ratings yet
DBMS Question Bank: Concepts & Queries
6 pages
Relational Algebra
No ratings yet
Relational Algebra
39 pages
Distributed Query Processing Using Different Semijoin Operations
No ratings yet
Distributed Query Processing Using Different Semijoin Operations
26 pages
KMBNIT03 - Unit 2
No ratings yet
KMBNIT03 - Unit 2
12 pages
QB Unit - 2 Answers
No ratings yet
QB Unit - 2 Answers
20 pages
23mca551 DBMS
No ratings yet
23mca551 DBMS
3 pages
Database Management Systems: Relational Algebra
No ratings yet
Database Management Systems: Relational Algebra
28 pages
Question Bank Unit-1
No ratings yet
Question Bank Unit-1
9 pages
Understanding Relational Algebra Operations
No ratings yet
Understanding Relational Algebra Operations
150 pages
DBMS Multiple Choice Questions Guide
No ratings yet
DBMS Multiple Choice Questions Guide
13 pages
Database System Solutions
No ratings yet
Database System Solutions
67 pages
Relational Algebra Basics and Schema
No ratings yet
Relational Algebra Basics and Schema
10 pages
Relational Algebra and Relational Calculus Chapter#5
No ratings yet
Relational Algebra and Relational Calculus Chapter#5
42 pages
Vtu 5TH Sem Cse DBMS Notes
67% (3)
Vtu 5TH Sem Cse DBMS Notes
35 pages
Assignment of Relational Algebra With Solutions
No ratings yet
Assignment of Relational Algebra With Solutions
4 pages
Unit - Iii
No ratings yet
Unit - Iii
39 pages
Dbms Relational Algebra
No ratings yet
Dbms Relational Algebra
16 pages
Relational Model Solutions Guide
No ratings yet
Relational Model Solutions Guide
61 pages
Week 5-6: Relational Algebra (Part III) and Relational Calculus
No ratings yet
Week 5-6: Relational Algebra (Part III) and Relational Calculus
22 pages
DBMS
No ratings yet
DBMS
251 pages
Database Concepts
No ratings yet
Database Concepts
22 pages
DBMS UNIT II Tuple - Relational - Calculus Ant
No ratings yet
DBMS UNIT II Tuple - Relational - Calculus Ant
14 pages
DDM Unit 2
No ratings yet
DDM Unit 2
23 pages

Query Processing

Uploaded by

Query Processing

Uploaded by

\Query Processing

Figure 1 - Steps in Database Query Processing

Employee(Eno, Ename, Phone)

πEname(Employee ⋈ Eno (σDOP>10 (Prof_Assigned)))

The overall information discussed above are depicted in Figure 2 below;

○ Indexing is used to optimize the performance of a database by minimizing the number

You might also like