0% found this document useful (0 votes)

281 views8 pages

Database Indexing Techniques Guide

There are three main types of indexing in databases: 1. Primary indexing is done on the primary key field and orders records by this field. 2. Secondary indexing is done on other fields and may index unique candidate keys or duplicate non-key fields. It requires pointers to all records with the same search key value. 3. Clustering indexing orders records by a non-key field, rather than the primary key. Indexing structures can be dense, with an index record for every search key value, or sparse, with index records only for some keys. Multilevel indexing breaks large indexes into smaller nested indexes for more efficient searching.

Uploaded by

Vijay Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

281 views8 pages

Database Indexing Techniques Guide

Uploaded by

Vijay Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

We know that data is stored in the form of records.

Every record has a key field, which helps it to

be recognized uniquely.
Indexing is a data structure technique to efficiently retrieve records from the database files based
on some attributes on which the indexing has been done. Indexing in database systems is similar
to what we see in books.
Indexing is defined based on its indexing attributes. Indexing can be of the following types

Primary Index Primary index is defined on an ordered data file. The data file is
ordered on a key field. The key field is generally the primary key of the relation.

Secondary Index Secondary index may be generated from a field which is a candidate
key and has a unique value in every record, or a non-key with duplicate values.

Clustering Index Clustering index is defined on an ordered data file. The data file is
ordered on a non-key field.

Ordered Indexing is of two types

Dense Index

Sparse Index

Dense Index
In dense index, there is an index record for every search key value in the database. This makes
searching faster but requires more space to store index records itself. Index records contain
search key value and a pointer to the actual record on the disk.

Sparse Index
In sparse index, index records are not created for every search key. An index record here contains
a search key and an actual pointer to the data on the disk. To search a record, we first proceed by
index record and reach at the actual location of the data. If the data we are looking for is not

where we directly reach by following the index, then the system starts sequential search until the
desired data is found.

Multilevel Index
Index records comprise search-key values and data pointers. Multilevel index is stored on the
disk along with the actual database files. As the size of the database grows, so does the size of
the indices. There is an immense need to keep the index records in the main memory so as to
speed up the search operations. If single-level index is used, then a large size index cannot be
kept in memory which leads to multiple disk accesses.

Multi-level Index helps in breaking down the index into several smaller indices in order to make
the outermost level so small that it can be saved in a single disk block, which can easily be
accommodated anywhere in the main memory.
Dense and Sparse Indices
1. There are Two types of ordered indices:

Dense Index:
o

An index record appears for every search key value in file.

This record contains search key value and a pointer to the actual
record.

Sparse Index:

Index records are created only for some of the records.

To locate a record, we find the index record with the largest search key
value less than or equal to the search key value we are looking for.

We start at that record pointed to by the index record, and proceed

along the pointers in the file (that is, sequentially) until we find the
desired record.

2. Figures 11.2 and 11.3 show dense and sparse indices for the deposit file.

Figure 11.2: Dense index.

3. Notice how we would find records for Perryridge branch using both methods.
(Do it!)

Figure 11.3: Sparse index.

4. Dense indices are faster in general, but sparse indices require less space and
impose less maintenance for insertions and deletions. (Why?)

5. A good compromise: to have a sparse index with one entry per block.

Why is this good?

Biggest cost is in bringing a block into main memory.

We are guaranteed to have the correct block with this method, unless
record is on an overflow block (actually could be several blocks).

Index size still small.

Multi-Level Indices
1. Even with a sparse index, index size may still grow too large. For 100,000
records, 10 per block, at one index record per block, that's 10,000 index
records! Even if we can fit 100 index records per block, this is 100 blocks.
2. If index is too large to be kept in main memory, a search results in several
disk reads.
o

If there are no overflow blocks in the index, we can use binary search.

This will read as many as

blocks).

If index has overflow blocks, then sequential search typically used,

reading all b index blocks.

blocks (as many as 7 for our 100

3. Solution: Construct a sparse index on the index (Figure 11.4).

Figure 11.4: Two-level sparse index.

4. Use binary search on outer index. Scan index block found until correct index
record found. Use index record as before - scan block pointed to for desired
record.
5. For very large files, additional levels of indexing may be required.
6. Indices must be updated at all levels when insertions or deletions require it.
7. Frequently, each level of index corresponds to a unit of physical storage (e.g.
indices at the level of track, cylinder and disk).
Index Update

Regardless of what form of index is used, every index must be updated whenever a record is
either inserted into or deleted from the file.
1. Deletion:
o

Find (look up) the record

If the last record with a particular search key value, delete that search
key value from index.

For dense indices, this is like deleting a record in a file.

For sparse indices, delete a key value by replacing key value's entry in
index by next search key value. If that value already has an index
entry, delete the entry.

2. Insertion:
o

Find place to insert.

Dense index: insert search key value if not present.

Sparse index: no change unless new block is created. (In this case, the
first search key value appearing in the new block is inserted into the
index).

Secondary Indices
1. If the search key of a secondary index is not a candidate key, it is not enough to point to
just the first record with each search-key value because the remaining records with the
same search-key value could be anywhere in the file. Therefore, a secondary index must
contain pointers to all the records.

Figure 11.5: Sparse secondary index on cname.

2. We can use an extra-level of indirection to implement secondary indices on search keys
that are not candidate keys. A pointer does not point directly to the file but to a bucket
that contains pointers to the file.
o See Figure 11.5 on secondary key cname.

o To perform a lookup on Peterson, we must read all three records pointed to by

entries in bucket 2.
o Only one entry points to a Peterson record, but three records need to be read.
o As file is not ordered physically by cname, this may take 3 block accesses.
3. Secondary indices must be dense, with an index entry for every search-key value, and a
pointer to every record in the file.
4. Secondary indices improve the performance of queries on non-primary keys.
5. They also impose serious overhead on database modification: whenever a file is updated,
every index must be updated.
6. Designer must decide whether to use secondary indices or not.

Database Indexing Techniques Explained
No ratings yet
Database Indexing Techniques Explained
6 pages
Indexing Benefits and Drawbacks in SQL
No ratings yet
Indexing Benefits and Drawbacks in SQL
7 pages
Optimistic Locking With Concurrency - PLSQL
No ratings yet
Optimistic Locking With Concurrency - PLSQL
9 pages
Oracle 11g Materialized View Replication
No ratings yet
Oracle 11g Materialized View Replication
3 pages
Understanding NoSQL Databases Explained
No ratings yet
Understanding NoSQL Databases Explained
8 pages
02 Transactions
No ratings yet
02 Transactions
5 pages
Object-Oriented Databases Guide
No ratings yet
Object-Oriented Databases Guide
31 pages
NoSQL Scaling and Consistency
No ratings yet
NoSQL Scaling and Consistency
76 pages
Java Application Vulnerabilities:: What They Are and How To Fix Them
No ratings yet
Java Application Vulnerabilities:: What They Are and How To Fix Them
9 pages
LAB4
No ratings yet
LAB4
8 pages
Linux Performance Tuning and Performance
100% (4)
Linux Performance Tuning and Performance
13 pages
ER Diagrams
No ratings yet
ER Diagrams
35 pages
Unit - 1: Cloud Architecture and Model
No ratings yet
Unit - 1: Cloud Architecture and Model
9 pages
Adapter and Abstract Factory Patterns
No ratings yet
Adapter and Abstract Factory Patterns
10 pages
E-Queue Management System
No ratings yet
E-Queue Management System
24 pages
3 Object Modeling
No ratings yet
3 Object Modeling
17 pages
Understanding Priority Queues in Data Structures
No ratings yet
Understanding Priority Queues in Data Structures
9 pages
Enable VNCR in Oracle RAC for Security
No ratings yet
Enable VNCR in Oracle RAC for Security
4 pages
Cloud Computing Chapter 3
No ratings yet
Cloud Computing Chapter 3
17 pages
DDD Principles for Distributed Systems
No ratings yet
DDD Principles for Distributed Systems
5 pages
Computer Practice Lab Manual
No ratings yet
Computer Practice Lab Manual
104 pages
System Modeling SE
No ratings yet
System Modeling SE
23 pages
Concurrency Control in Distributed Transactions
No ratings yet
Concurrency Control in Distributed Transactions
17 pages
Unified Modeling Language (UML)
100% (2)
Unified Modeling Language (UML)
44 pages
Queuing (Or Waiting Line) : Theory
No ratings yet
Queuing (Or Waiting Line) : Theory
8 pages
Online - Shoppee Java Micro Project
No ratings yet
Online - Shoppee Java Micro Project
20 pages
Concurrency Control in DBMS
No ratings yet
Concurrency Control in DBMS
4 pages
Data Modeling
No ratings yet
Data Modeling
3 pages
Database Normalization Explained
No ratings yet
Database Normalization Explained
4 pages
OOAD
No ratings yet
OOAD
52 pages
Priority Queue
No ratings yet
Priority Queue
3 pages
Oracle Block Change Tracking
No ratings yet
Oracle Block Change Tracking
12 pages
Distributed DB Transaction Guide
100% (1)
Distributed DB Transaction Guide
9 pages
E ComerceSystem
No ratings yet
E ComerceSystem
11 pages
Database Indexing Techniques
100% (1)
Database Indexing Techniques
4 pages
Supply Chain Management & Logistics (SCML) : Department of Operation Management
No ratings yet
Supply Chain Management & Logistics (SCML) : Department of Operation Management
58 pages
1Z0 033 S
No ratings yet
1Z0 033 S
24 pages
Creating An E-Commerce Site With MERN Stack - Part III - Medium
No ratings yet
Creating An E-Commerce Site With MERN Stack - Part III - Medium
23 pages
UML Diagram Online Shopping UML Use Case
No ratings yet
UML Diagram Online Shopping UML Use Case
10 pages
Difference Between Clustered and Non-Clustered Index
No ratings yet
Difference Between Clustered and Non-Clustered Index
7 pages
Oracle Database Administration FAQs
No ratings yet
Oracle Database Administration FAQs
23 pages
Understanding The Top 5 Redis Performance Metrics
No ratings yet
Understanding The Top 5 Redis Performance Metrics
22 pages
Concurrency Control in Distributed Databases
No ratings yet
Concurrency Control in Distributed Databases
5 pages
Getting Started With Docker: Improve Performance, Minimize Cost
No ratings yet
Getting Started With Docker: Improve Performance, Minimize Cost
7 pages
Oracle Unified and Internet Directory
No ratings yet
Oracle Unified and Internet Directory
7 pages
Exadata Em12c PDF
No ratings yet
Exadata Em12c PDF
313 pages
B+ Tree in DBMS
No ratings yet
B+ Tree in DBMS
21 pages
React Fullnotes
No ratings yet
React Fullnotes
44 pages
Oracle Index Optimization Guide
No ratings yet
Oracle Index Optimization Guide
14 pages
Design and Implementation of Online Shopping Syste
No ratings yet
Design and Implementation of Online Shopping Syste
5 pages
Relational Databases for CS Students
No ratings yet
Relational Databases for CS Students
111 pages
Queuing Theory
100% (1)
Queuing Theory
25 pages
08 Reactjs Notes
No ratings yet
08 Reactjs Notes
24 pages
19c Sharding
No ratings yet
19c Sharding
43 pages
Module 4 Indexing
No ratings yet
Module 4 Indexing
20 pages
DBMS Indexing 5
No ratings yet
DBMS Indexing 5
63 pages
Indexing - II
No ratings yet
Indexing - II
57 pages
Indexes
No ratings yet
Indexes
70 pages
IN3020/4020 - Database Systems Spring 2020, Week 3.1 Indexing
No ratings yet
IN3020/4020 - Database Systems Spring 2020, Week 3.1 Indexing
44 pages
Screenshot 2025-03-12 at 9.41.04 AM
No ratings yet
Screenshot 2025-03-12 at 9.41.04 AM
41 pages
Spatial Data Management: Database Management Systems, 3ed, R. Ramakrishnan and J. Gehrke 1
No ratings yet
Spatial Data Management: Database Management Systems, 3ed, R. Ramakrishnan and J. Gehrke 1
21 pages
BDND Percentage Data Analysis
No ratings yet
BDND Percentage Data Analysis
1 page
Threshold Decomposition in Filters
No ratings yet
Threshold Decomposition in Filters
3 pages
C280 Computer Vision C280, Computer Vision: Prof. Trevor Darrell
No ratings yet
C280 Computer Vision C280, Computer Vision: Prof. Trevor Darrell
68 pages
Morphology
No ratings yet
Morphology
18 pages
Os 1
No ratings yet
Os 1
145 pages
Finalterm Date Sheet Spring-21 Semester-Updated
No ratings yet
Finalterm Date Sheet Spring-21 Semester-Updated
3 pages
Code Coverage
No ratings yet
Code Coverage
30 pages
Control Arrays in Visual Basic 6.0: Sharing Event Handlers
No ratings yet
Control Arrays in Visual Basic 6.0: Sharing Event Handlers
4 pages
OpenBuildings Deployment Guide For ProjectWise Managed Configurations - v1.1
No ratings yet
OpenBuildings Deployment Guide For ProjectWise Managed Configurations - v1.1
59 pages
CPPS Unit 3 Pointers
No ratings yet
CPPS Unit 3 Pointers
39 pages
Apple Device Management For Beginners
100% (1)
Apple Device Management For Beginners
25 pages
Mindtree Brochures Selenium Automation Framework Saf PDF
No ratings yet
Mindtree Brochures Selenium Automation Framework Saf PDF
3 pages
Lecture SP05 Overview of C Language
No ratings yet
Lecture SP05 Overview of C Language
32 pages
SE&PM - Module 5 - Software Quality
No ratings yet
SE&PM - Module 5 - Software Quality
9 pages
A Simple Project - Student Admission Management System
No ratings yet
A Simple Project - Student Admission Management System
11 pages
SYLLABUS
No ratings yet
SYLLABUS
3 pages
Phase 1
No ratings yet
Phase 1
8 pages
I MSC CS Ooad
No ratings yet
I MSC CS Ooad
110 pages
Vsi Web Programming
No ratings yet
Vsi Web Programming
14 pages
Python Basics: Functions & Division
No ratings yet
Python Basics: Functions & Division
5 pages
Apache Airflow Fundamentals Study Guide
No ratings yet
Apache Airflow Fundamentals Study Guide
7 pages
Online Leaves Management System Report
No ratings yet
Online Leaves Management System Report
36 pages
HTML5, VueJS, Vuetify Form Validation
No ratings yet
HTML5, VueJS, Vuetify Form Validation
10 pages
LabVIEW Quiz for Engineers
100% (1)
LabVIEW Quiz for Engineers
18 pages
JSON-LD for Developers
No ratings yet
JSON-LD for Developers
6 pages
Bridge Course: Class XII Computer Science
No ratings yet
Bridge Course: Class XII Computer Science
59 pages
CC Module-1 Notes
No ratings yet
CC Module-1 Notes
33 pages
ABAP Certification Exam Insights for SAP 7.50
No ratings yet
ABAP Certification Exam Insights for SAP 7.50
18 pages
MIC Prelim Question Paper
No ratings yet
MIC Prelim Question Paper
2 pages
Microservices with Spring Boot on GCP
No ratings yet
Microservices with Spring Boot on GCP
36 pages
Top-Level Code Walk-Through: Scalartransportfoam and Magu: Hrvoje Jasak
No ratings yet
Top-Level Code Walk-Through: Scalartransportfoam and Magu: Hrvoje Jasak
12 pages
Ankita Kumari
No ratings yet
Ankita Kumari
1 page
Anup Kumar
No ratings yet
Anup Kumar
2 pages
Sequential Logic Synthesis Guide
No ratings yet
Sequential Logic Synthesis Guide
3 pages
Com - Fnatic.loader Logcat
No ratings yet
Com - Fnatic.loader Logcat
125 pages

Database Indexing Techniques Guide

Uploaded by

Database Indexing Techniques Guide

Uploaded by

We know that data is stored in the form of records.

Every record has a key field, which helps it to

Ordered Indexing is of two types

An index record appears for every search key value in file.

Index records are created only for some of the records.

We start at that record pointed to by the index record, and proceed

Figure 11.2: Dense index.

Figure 11.3: Sparse index.

Why is this good?

Biggest cost is in bringing a block into main memory.

Index size still small.

This will read as many as

If index has overflow blocks, then sequential search typically used,

blocks (as many as 7 for our 100

3. Solution: Construct a sparse index on the index (Figure 11.4).

Figure 11.4: Two-level sparse index.

Find (look up) the record

For dense indices, this is like deleting a record in a file.

Find place to insert.

Dense index: insert search key value if not present.

Figure 11.5: Sparse secondary index on cname.

o To perform a lookup on Peterson, we must read all three records pointed to by

You might also like