LecturePPT Chapter 13 HashTable

Hash tables provide an efficient way to store and access data elements in a dictionary. They work by using a hash function to map each key to an index in a hash table. Collisions, where different keys hash to the same index, are resolved using open addressing or chaining. Open addressing handles collisions by probing to the next available empty slot, while chaining stores multiple entries in a linked list at each index. Common operations like search, insert and delete can be performed in constant average time on hash tables.

Uploaded by

一鸿

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

99 views

LecturePPT Chapter 13 HashTable

Uploaded by

一鸿

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Hash Tables

Chapter 13
Outline
●
Introduction
– Dictionaries
●
Hash Table Structure
●
Hash Functions
– Building hash functions
●
Linear open addressing
– Operations on linear open addressed hash tables
– Performance analysis
– Other collision resolution techniques with open addressing
●
Chaining
– Operations on chained hash tables
– Performance analysis
●
Applications
Introduction
● Dictionary is a collection of data elements
uniquely identified by a field called key.
● A dictionary supports the operations of search,
insert and delete. The ADT of a dictionary is
defined as a set of elements with distinct keys
supporting the operations of search, insert, delete
and create (which creates an empty dictionary).
● A dictionary supports both sequential and
random access
● Hash tables are ideal data structures for
dictionaries
Hash Table Structure
● A hash function H(X) is a mathematical function
which given a key X of the dictionary D, maps it to a
position P in a storage table termed hash table
● The process of mapping the keys to their respective
positions in the hash table is called hashing
● If the hash table is implemented using a sequential
data structure, for example arrays, then the hash
function H(X) may be so chosen to yield a value that
corresponds to the index of the array.
● In such a case the hash function is a mere mapping of
the keys to the array indices.
Example 1
● A set of distinct keys { AB12, VP99, RK32,
CG45, KL78, OW31, ST65, EX44 } to be
represented as a hash table
● H(XYmn) = ord(X) where X, Y are the
alphabetical characters, m, n are the
numerical characters of the key and ord(X) is
the ordinal number of the alphabet X.
Example 1
Key XYmm H(XYmm) Position in hash table
AB12 ord(A) 1
VP99 ord(V) 22
RK32 ord(R) 18
CG45 ord(C) 3
KL78 ord(K) 11
OW31 ord(O) 15
ST65 ord(S) 19
EX44 ord(E) 5
1 AB12 ……….
In hash table 2 ….
3 CG45
4 ….
5 EX44 ………
…. …..
11 KL78
…
15 OW31 ………….
…
18 RK32
19 ST65 ……………
….
22 VP99 ….
… … …
Hash Functions
● The choice of the hash function plays a significant
role in the structure and performance of the hash table.
It is therefore essential that a hash function satisfies
the following characteristics:
– (i) easy and quick to compute
– (ii) even distribution of keys across the hash table.
In other words, a hash function must minimize
collisions.
Building hash functions
● Folding
– The key is partition into 2 or 3 or more parts. Each of parts are
combine using basic math.
– e.g 719532 → 71 | 95 | 32 → 71+95+32 = 198
– Yield – 98. Use 98 as index in the hash table
● Truncation
– Selective digits of key are extracted
– Lets say a hash table has 100 location, we choose to
select pos 3 and 6 to determine the index score.
– Therefore key 719532 – yield index 92
● Modular Arithmetic
● Let k be the numerical key or the numerical
equivalent if it is an alphabetical key. The hash
function is given by
H(k) = k mod L
● Let hash table size is 111. Example key = 145682
H(K) = 145682 mod 111
= 50 the location in the hash table
Linear Open Addressing
● Let HT[0:L-1] be the hash table
● The L locations of the hash table are termed as
buckets. Every bucket provides accommodation for
the data elements
● To accommodate keys which map to the same bucket,
partition buckets into what are called slots to
accommodate keys.
● Now what happens if a key is unable to find a slot in
the bucket? In other words, if the bucket is full, then
where do we find place for the key?
Hash table structure
Hash Table
HT <---------------------------s slot------------------------>
<--------L Bucket-------->

[0] [1] …... [s-1]

[0]
[1]
[2]
:
:
:

[L-1]
Overflow case
● In this case it is said to be overflow
● All collisions need not result in overflows. But in the case
of a hash table with single slot buckets, collisions mean
overflows.
● The bucket to which the key is mapped by the hash function is
known as the home bucket.
● To tackle overflows we move further down, beginning from the
home bucket and look for the closest slot that is empty and
place the key in it. Such a method of handling overflows is
known as Linear probing or Linear open addressing or
closed hashing.
Example
● L={45,98,12,55,46,89,65,88,36,21}
● H(X) = X mod 11
Key x 45 98 12 55 46 89 65 88 36 21
H(X) 1 10 1 0 2 1 10 0 3 10
[0] 55 88

[1] 45 12 89

[2] 46
[3] 36
[4]
[5]
[6]
[7]
[8]
[9]
[10]
98 65 21
Operations On Linear Open Addressed Hash
Tables
●
Searching
●
if the searched key is available in the home bucket then
the search is done. The time complexity in such a case
is O(1).
●
if there had been overflows while inserting the key, then
a sequential search has to be called for which searches
through each slot of the buckets following the home
bucket, until either.
●
(i) the key is found or (ii) an empty slot is encountered
in which case the search terminates or (iii) the search
path has curled back to the home bucket
●
In the case of (i) the search is said to be successful. In
the case of (ii) and (iii) it is said to be unsuccessful.
Operations On Linear Open Addressed Hash
Tables
● Insertion
– 1) Compute the H(X)
– 2) If collision then find the empty slot
● Deletion
– It is generally recommended that deletions in a hash
table are avoided as much as possible due to their
clumsy implementation.
Performance analysis
● On an average the performance of the hash table is much
more efficient than that of the linear lists.
● It has been shown that the average case performance of
a linear open addressed hash table for successful and
unsuccessful search where U n and S n are the number
of buckets examined on an average during an
unsuccessful and successful search respectively, is given
by where α = n/b; b=bucket
1  1  
Un ~  1 2 α is loading factor smaller
2 (1   )  α is better performance
1 1 
Sn ~  1  
2 (1   ) 
Other collision resolution techniques with
open addressing
● Rehashing (Problem 13.6)
– Major drawback of open addressing is clustering where
leads to long sequence wih gaps in between. To overcome
this a second H'(X) is needed. If the slot is not empty
another function is called.
– hi =(H(X) +i.H'(X) mod b, i=1,2,3....
● Quadratic probing (Problem 13.5)
– Another method to reduce clustering by probe bucket at
address h+1, h+2, h+9 …. etc (h+i2)
● Random probing
– Use random number to probe the next bucket
Chaining
● Keep all key that are mapped to the same bucket
chained to it.
● In other words, every bucket is maintained as a
singly linked list with synonyms represented as
nodes
● The buckets continue to be represented as a
sequential data structure as before, to favor the
hash function computation
● Such a method of handling overflows is called
chaining or open hashing or separate chaining
Chaining

[0]
[1]
[2]
[3]
[4]
[5]
[6]
[7]
[8]
[9]
Operations on chained hash tables

● Search
– 1) Computing the hash function value (HX)
– 2) search the sequential linked list slot
● Insert
– 1) Computing the hash function value H(X)
– 2) Insert key in an empty linked slot
– 3) In the case of same bucket, insert front or insert end into
the slot linkrd list
● Delete
Performance Analysis

● Best case is O(1)

● Worse case O(n)
● Example – problem 13.7
APPLICATIONS

● Representation of a keyword table in a compiler

● Evaluation of a join operation on relational
databases
● Direct file organization

BSBINS401 Assessment (Student Pack)
No ratings yet
BSBINS401 Assessment (Student Pack)
99 pages
Queratoglobo Expo 2012
100% (1)
Queratoglobo Expo 2012
42 pages
Example A Small Signal Analysis of A BJT Amp
100% (1)
Example A Small Signal Analysis of A BJT Amp
10 pages
Calculation and Spesification of Compressed Air System: Design Iv Machinery Department of Marine Engineering
0% (1)
Calculation and Spesification of Compressed Air System: Design Iv Machinery Department of Marine Engineering
7 pages
Hashing
No ratings yet
Hashing
22 pages
Chapter 8 - Hashing
No ratings yet
Chapter 8 - Hashing
26 pages
Modifed Hash
No ratings yet
Modifed Hash
42 pages
DSA LABTASK 12
No ratings yet
DSA LABTASK 12
5 pages
Lab 09 - Hashing
No ratings yet
Lab 09 - Hashing
47 pages
unit 1 Hashing
No ratings yet
unit 1 Hashing
61 pages
DS 8
No ratings yet
DS 8
30 pages
Hashing
No ratings yet
Hashing
38 pages
Chapter 8 - Hashing
No ratings yet
Chapter 8 - Hashing
78 pages
629314285 Hashing in Data Structure
No ratings yet
629314285 Hashing in Data Structure
23 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
23 pages
Hashing Techniques - U3
No ratings yet
Hashing Techniques - U3
9 pages
Hashing Slide
No ratings yet
Hashing Slide
16 pages
Collision Resolution Techniques
No ratings yet
Collision Resolution Techniques
10 pages
DSAL Ass1 Writeup
No ratings yet
DSAL Ass1 Writeup
4 pages
Chapter 8 - Hashing
No ratings yet
Chapter 8 - Hashing
78 pages
CHAPTER 8 Hashing: Instructors: C. Y. Tang and J. S. Roger Jang
No ratings yet
CHAPTER 8 Hashing: Instructors: C. Y. Tang and J. S. Roger Jang
78 pages
Hashing
No ratings yet
Hashing
24 pages
Module 5
No ratings yet
Module 5
25 pages
Matrix Hashing With Two Level of Collision Resolution: National Institute of Technology Raipur
No ratings yet
Matrix Hashing With Two Level of Collision Resolution: National Institute of Technology Raipur
7 pages
Hash Table: Didih Rizki Chandranegara
No ratings yet
Hash Table: Didih Rizki Chandranegara
33 pages
Study_Material_on_Hashing
No ratings yet
Study_Material_on_Hashing
4 pages
Hashing
No ratings yet
Hashing
4 pages
Hash Table
No ratings yet
Hash Table
26 pages
11 Hashing
No ratings yet
11 Hashing
60 pages
Algo Cha 8
No ratings yet
Algo Cha 8
20 pages
L-2005-08-Advance Data Structure Part 1-HS
No ratings yet
L-2005-08-Advance Data Structure Part 1-HS
46 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
25 pages
Hashing and Graphs
No ratings yet
Hashing and Graphs
28 pages
ADS M TECH MID 2
No ratings yet
ADS M TECH MID 2
26 pages
Hashing PPT
No ratings yet
Hashing PPT
39 pages
DSA Unit VI Hashing and File Organization
No ratings yet
DSA Unit VI Hashing and File Organization
56 pages
Lect Hashing
No ratings yet
Lect Hashing
36 pages
Hashing
No ratings yet
Hashing
34 pages
Hashing
No ratings yet
Hashing
20 pages
Theory PDF
No ratings yet
Theory PDF
18 pages
Seminar 5
No ratings yet
Seminar 5
5 pages
3 Hashing
No ratings yet
3 Hashing
20 pages
Hashing
No ratings yet
Hashing
41 pages
Topic 1: Hashing - Introduction: Hashing Is A Method of Storing and Retrieving Data From A Database Efficiently
No ratings yet
Topic 1: Hashing - Introduction: Hashing Is A Method of Storing and Retrieving Data From A Database Efficiently
31 pages
Full Unit 6 Cse 205 (1)
No ratings yet
Full Unit 6 Cse 205 (1)
20 pages
MODULE-5
No ratings yet
MODULE-5
33 pages
Hashing
No ratings yet
Hashing
35 pages
Hashing
No ratings yet
Hashing
35 pages
Hashing: Amar Jukuntla
No ratings yet
Hashing: Amar Jukuntla
22 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
Hashing
No ratings yet
Hashing
37 pages
Hash Function
No ratings yet
Hash Function
9 pages
Hashing Part 1 Lecture
No ratings yet
Hashing Part 1 Lecture
33 pages
Lab 2
No ratings yet
Lab 2
10 pages
AST20105 Data Structure and Algorithms: Chapter 9 - Hash Table
No ratings yet
AST20105 Data Structure and Algorithms: Chapter 9 - Hash Table
39 pages
MODULE 5_BCS304_HASHING_Leftisht trees_OBST_Notes
No ratings yet
MODULE 5_BCS304_HASHING_Leftisht trees_OBST_Notes
32 pages
DSA MK Lect2 PDF
No ratings yet
DSA MK Lect2 PDF
92 pages
Hash Tables: Dr. Dibakar Saha
No ratings yet
Hash Tables: Dr. Dibakar Saha
26 pages
Chapter 11 Hashing
No ratings yet
Chapter 11 Hashing
42 pages
Dsa Lecture 13 Hash Tables
No ratings yet
Dsa Lecture 13 Hash Tables
15 pages
Unit-5
No ratings yet
Unit-5
50 pages
CO4 - Hashing in Data Structure
No ratings yet
CO4 - Hashing in Data Structure
13 pages
C++ Programming For Beginners
From Everand
C++ Programming For Beginners
Artur Kalls
No ratings yet
Lisp Programming Language
From Everand
Lisp Programming Language
Faiz ul haque Zeya
No ratings yet
Quiz 2 Solutions
No ratings yet
Quiz 2 Solutions
12 pages
Module4 Monitoring Reporting 04 PDF
No ratings yet
Module4 Monitoring Reporting 04 PDF
21 pages
Laboratory Module: Control Systems (EMT 364/4) Semester 2 (2011/2012)
No ratings yet
Laboratory Module: Control Systems (EMT 364/4) Semester 2 (2011/2012)
4 pages
Unit 2 Clock-Driven Scheduling: 5.1 Notations and Assumptions
No ratings yet
Unit 2 Clock-Driven Scheduling: 5.1 Notations and Assumptions
18 pages
Basic To Algebra
No ratings yet
Basic To Algebra
15 pages
Week 3-Primary Vs
No ratings yet
Week 3-Primary Vs
19 pages
Exponent
No ratings yet
Exponent
17 pages
Trial Final Exam - Sem 1 - 2019-2020 PDF
No ratings yet
Trial Final Exam - Sem 1 - 2019-2020 PDF
13 pages
Smoke Free Campus
No ratings yet
Smoke Free Campus
28 pages
UVW 312 - English For Technical Communication WEEK 3: Primary vs. Secondary Information Exercise Exercise 1: Answer The Following Questions
No ratings yet
UVW 312 - English For Technical Communication WEEK 3: Primary vs. Secondary Information Exercise Exercise 1: Answer The Following Questions
3 pages
BKAL1013 Syllabus A191 - Students
No ratings yet
BKAL1013 Syllabus A191 - Students
6 pages
Architecture and Instruction Set: User's Manual, July 2000
No ratings yet
Architecture and Instruction Set: User's Manual, July 2000
118 pages
Syllabus
No ratings yet
Syllabus
4 pages
004 TFDJ 042 Yiedqzosd 0 Aglsj 4 Py
No ratings yet
004 TFDJ 042 Yiedqzosd 0 Aglsj 4 Py
9 pages
CAPE Formula Sheet
No ratings yet
CAPE Formula Sheet
12 pages
Mechanics of Balsa (Ochroma Pyramidale) Wood: MIT Open Access Articles
No ratings yet
Mechanics of Balsa (Ochroma Pyramidale) Wood: MIT Open Access Articles
49 pages
R2 Cable
No ratings yet
R2 Cable
1 page
Datasheet BW 90 AD-5
No ratings yet
Datasheet BW 90 AD-5
4 pages
Syntrum Guideline
No ratings yet
Syntrum Guideline
5 pages
Measurement of Viscosity and Sucrose Concentration
No ratings yet
Measurement of Viscosity and Sucrose Concentration
7 pages
The Keys To Successful Investing
No ratings yet
The Keys To Successful Investing
4 pages
Improving Electric Energy Result by Using Composition Wavelength
No ratings yet
Improving Electric Energy Result by Using Composition Wavelength
5 pages
Myanmar Companies Online
No ratings yet
Myanmar Companies Online
2 pages
National Case Management System Framework
No ratings yet
National Case Management System Framework
61 pages
Language Control in Aldous Huxleys Brave New Worl
No ratings yet
Language Control in Aldous Huxleys Brave New Worl
13 pages
Motivation and Personality
No ratings yet
Motivation and Personality
38 pages
Rubric For Games
50% (4)
Rubric For Games
1 page
From (Abiha Zaidi (Abihazaidi - Nliu@gmail - Com) ) - ID (27) - My Digital Signatures - Final
No ratings yet
From (Abiha Zaidi (Abihazaidi - Nliu@gmail - Com) ) - ID (27) - My Digital Signatures - Final
21 pages
EF4e Beg Filetest 02B AK
No ratings yet
EF4e Beg Filetest 02B AK
3 pages
Kverneland OPTIMA PDF
100% (1)
Kverneland OPTIMA PDF
318 pages
Nuance Paperport
No ratings yet
Nuance Paperport
4 pages
A Comprehensive Analysis of The Android Permissions System: Iman Almomani, (Senior, IEEE) and AALA AL KHAYER
No ratings yet
A Comprehensive Analysis of The Android Permissions System: Iman Almomani, (Senior, IEEE) and AALA AL KHAYER
19 pages
Api Standard
No ratings yet
Api Standard
6 pages
Sbi Clerk Pyq
No ratings yet
Sbi Clerk Pyq
365 pages
SQUARE D - I LINE Busbar Insulation
No ratings yet
SQUARE D - I LINE Busbar Insulation
4 pages
The Entry Level Professional Resume
No ratings yet
The Entry Level Professional Resume
1 page
Lecture Presentation TMHM 005 Macro Perspective of Tourism and Hospitality Chapter 2 The Meaning of Travel Tourism Tourist and Hospitality
No ratings yet
Lecture Presentation TMHM 005 Macro Perspective of Tourism and Hospitality Chapter 2 The Meaning of Travel Tourism Tourist and Hospitality
18 pages
Bangalore Accounts
100% (1)
Bangalore Accounts
311 pages
Rising Disc
No ratings yet
Rising Disc
8 pages
PROJECT
No ratings yet
PROJECT
42 pages