0% found this document useful (0 votes)

53 views26 pages

DS Lecture - 6 (Hashing)

Uploaded by

omvati343

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views26 pages

DS Lecture - 6 (Hashing)

Uploaded by

omvati343

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 26

Hashing

• Mathematical concept
– To define any number as set of numbers in
given interval
– To cut down part of number
– Used in discreet maths, e.g graph theory, set
theory
– Used in Searching technique
– Used in encryption methods

1
Hash Functions and Hash
Tables
• Hashing has 2 major components
– Hash function h
– Hash Table Data Structure of size N
• A hash function h maps keys (a identifying element of
record set) to hash value or hash key which refers to specific
location in Hash table
• Example:
h(x) = x mod N
is a hash function for integer keys
• The integer h(x) is called the hash value of key x

2
Hash Functions and Hash Tables
• A hash table data structure is an array or array
type ADTof some fixed size, containing the keys.
• An array in which records are not stored
consecutively - their place of storage is
calculated using the key and a hash function

hash array
Key index
function

3
• Hashed key: the result of applying a hash function to a
key
• Keys and entries are scattered throughout the array
• Contains the main advantages of both Arrays and Trees
• Mainly the topic of hashing depends upon the two main
factors / parts
(a) Hash Function (b) Collision Resolution
• Table Size is also an factor (miner) in Hashing, which is
0 to tablesize-1.

4
Table Size
• Hash table size
– Should be appropriate for the hash function used

– Too big will waste memory; too small will increase

collisions and may eventually force rehashing
(copying into a larger table)

5
Example
• We design a hash table for
a dictionary storing items 0 
(SSN, Name), where SSN 1 025-612-0001
(social security number) is a 2 981-101-0002

nine-digit positive integer 3 

4 451-229-0004
• The actual data is not

…
stored in hash table
• Pin points the location of 9997 
9998 200-751-9998
actual data or set of data 9999 
• Our hash table uses an
array of size N = 10,000
and the hash function
h(x) = last four digits of x
6
Hash Function
• The mapping of keys into the table is called Hash
Function
• A hash function,
– Ideally, it should distribute keys and entries evenly
throughout the table
– It should be easy and quick to compute.
– It should minimize collisions, where the position
given by the hash function is already occupied
– It should be applicable to all objects

7
• Different types of hash functions are used for the
mapping of keys into tables.

(a) Division Method

(b) Mid-square Method
(c) Folding Method

8
1. Division Method
• Choose a number m larger than the number n of keys
in k.
• The number m is usually chosen to be a prime no.
• The hash function H is defined as,
H(k) = k(mod m) or H(k) = k(mod m) + 1
• Denotes the remainder, when k is divided by m
• 2nd formula is used when range is from 1 to m.

9
• Example:
Elements are: 3205, 7148, 2345

Table size: 0 – 99 (prime)

m = 97 (prime)

H(3205)= 4, H(7148)=67, H(2345)=17

• For 2nd formula add 1 into the remainders.

10
2. Folding Method
• The key k is partitioned into no. of parts
• Then add these parts together and ignoring the
last carry.
• One can also reverse the first part before
adding (right or left justified. Mostly right)
H(k) = k1 + k2 + ………. + kn

11
• Example:

H(3205)=32+05=37 or
H(3250)=32+50=82
H(7148)=71+43=19 or
H(7184)=71+84=55
H(2345)=23+45=77 or
H(2354)=23+54=68

12
3. Mid-Square Method
• The key k is squared. Then the hash function H is
defined as
H(k) = l
• The l is obtained by deleting the digits from both
ends of K2.

• The same position must be used for all the keys.

13
• Example:
k: 3205 7148 2345
k2: 10272025 51093904 5499025
H(k): 72 93 99

• 4th and 5th digits have been selected. From the

right side.

14
Collision Resolution Strategies
• If two keys map on the same hash table index then we
have a collision.
• As the number of elements in the table increases, the
likelihood of a collision increases - so make the table
as large as practical
• Collisions may still happen, so we need a collision
resolution strategy

15
• Two approaches are used to resolve collisions.
(a) Separate chaining: chain together several keys/entries
in each position.
(b) Open addressing: store the key/entry in a different
position.
• Probing: If the table position given by the hashed
key is already occupied, increase the position by
some amount, until an empty position is found

16
Open Addressing

• Types of open addressing are

1. Linear Probing
2. Quadratic Probing
3. Double Hashing.

17
1. Linear Probing
• Locations are checked from the hash location k to the
end of the table and the element is placed in the first
empty slot
• If the bottom of the table is reached, checking “wraps
around” to the start of the table. Modulus is used for
this purpose
• Thus, if linear probing is used, these routines must
continue down the table until a match or empty location
is found

18
• Linear probing is guaranteed to find a slot for the
insertion if there still an empty slot in the table .
• Even though the hash table size is a prime number is
probably not an appropriate size; the size should be at
least 30% larger than the maximum number of elements
ever to be stored in the table.

• If the load factor is greater than 50% - 70% then the

time to search or to add a record will increase.

19
H(k)=h, h+1, h+2, h+3,……, h+I

• However, linear probing also tends to promote

clustering within the table.

1 2 3 4 5 6 7 8

20
2. Quadratic Probing
• Quadratic probing is a solution to the clustering
problem
– Linear probing adds 1, 2, 3, etc. to the original
hashed key
– Quadratic probing adds 12, 22, 32 etc. to the original
hashed key
• However, whereas linear probing guarantees that all
empty positions will be examined if necessary,
quadratic probing does not

21
• If the table size is prime, this will try approximately
half the table slots.
• More generally, with quadratic probing, insertion may
be impossible if the table is more than half-full!

H(k) = h, h+1, h+4, h+5, h+6,……, h+i2

22
3. Double Hashing
• 2nd hash function H’ is used to resolve the collision.
• Here H’(k) = h’ ≠ m
• Therefore we can search the locations with addresses,
H’(k) = h, h+h’, h+2h’, h+3h’,…….
• If m is prime, then this sequence access all the
locations.

23
Double Hashing
• Double hashing uses a
secondary hash function • Common choice of
d(k) and handles collisions compression map for the
by placing an item in the secondary hash function:
first available cell of the d2(k) = k mod q
series
where
(h + jd(k)) mod N
– q<N
for j = 0, 1, … , N - 1
– q is a prime
• The secondary hash
function d(k) cannot have • The possible values for
zero values d2(k) are
• The table size N must be a 1, 2, … , q
prime to allow probing of
all the cells 24
Example of Double Hashing
k h (k ) d (k ) Probes
• Consider a hash 18 5 9 5
table storing integer 41 2 8 2
22 9 10 9
keys that handles 44 5 5 5 7
collision with double 59 7 10 7 10 0
32 6 4 6
hashing 31 5 8 5 8
– N = 13 73 8 11 8 11
– h(k) = k mod 13
– d(k) = k mod 7
0 1 2 3 4 5 6 7 8 9 10 11 12
• Insert keys 18, 41,
22, 44, 59, 32, 31,
73, in this order 59 41 183244 8 224411
0 1 2 3 4 5 6 7 8 9 10 11 12
25
Applications of Hashing
• Compilers use hash tables to keep track of declared
variables
• A hash table can be used for on-line spelling checkers
— if misspelling detection (rather than correction) is
important, an entire dictionary can be hashed and
words checked in constant time
• Game playing programs use hash tables to store seen
positions, thereby saving computation time if the
position is encountered again
• Hash functions can be used to quickly check for
inequality — if two elements hash to different values
they must be different

DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
27 pages
SORTING PROGRAMS - Counting + Bucket + Heap
No ratings yet
SORTING PROGRAMS - Counting + Bucket + Heap
27 pages
Hashing Techniques Explained
No ratings yet
Hashing Techniques Explained
32 pages
Hashing PDF
No ratings yet
Hashing PDF
56 pages
Hashing
No ratings yet
Hashing
56 pages
Understanding Hashing Techniques and Methods
No ratings yet
Understanding Hashing Techniques and Methods
33 pages
Vtucode - in Module 5 DS 2022 Scheme
No ratings yet
Vtucode - in Module 5 DS 2022 Scheme
24 pages
Hashing
No ratings yet
Hashing
20 pages
Understanding Hashing in Data Structures
No ratings yet
Understanding Hashing in Data Structures
53 pages
Hashing: Collision Handling Methods
No ratings yet
Hashing: Collision Handling Methods
52 pages
3 Hashing
No ratings yet
3 Hashing
20 pages
Hashing
No ratings yet
Hashing
33 pages
What Is Hashing
No ratings yet
What Is Hashing
11 pages
Lecture 08 - Hash Tables
No ratings yet
Lecture 08 - Hash Tables
21 pages
Direct Addressing in Hash Tables
No ratings yet
Direct Addressing in Hash Tables
26 pages
Hashing Techniques & Functions
No ratings yet
Hashing Techniques & Functions
30 pages
Hashing New
No ratings yet
Hashing New
48 pages
Hash Tables: A Programmer's Guide
No ratings yet
Hash Tables: A Programmer's Guide
26 pages
Unit 5
No ratings yet
Unit 5
50 pages
Dictionary and Hash Table Overview
No ratings yet
Dictionary and Hash Table Overview
9 pages
HASHING
No ratings yet
HASHING
63 pages
Hashing Methods
No ratings yet
Hashing Methods
20 pages
Hash Tables and Collision Resolution
No ratings yet
Hash Tables and Collision Resolution
47 pages
DS 5
No ratings yet
DS 5
23 pages
Hash Tables: Concepts & Implementations
No ratings yet
Hash Tables: Concepts & Implementations
53 pages
Hashing
No ratings yet
Hashing
30 pages
Unit2 Hashing DSA
No ratings yet
Unit2 Hashing DSA
55 pages
Hashing
No ratings yet
Hashing
34 pages
Hashing Algorithms
No ratings yet
Hashing Algorithms
22 pages
Hashing Techniques Explained
No ratings yet
Hashing Techniques Explained
20 pages
Open vs Closed Hashing Techniques
No ratings yet
Open vs Closed Hashing Techniques
16 pages
Lect Hashing
No ratings yet
Lect Hashing
36 pages
Hashing Techniques Explained
No ratings yet
Hashing Techniques Explained
23 pages
Understanding Hashing Techniques and Functions
No ratings yet
Understanding Hashing Techniques and Functions
13 pages
Understanding Hash Tables in Python
No ratings yet
Understanding Hash Tables in Python
33 pages
Hashing
No ratings yet
Hashing
16 pages
Week13 1
No ratings yet
Week13 1
16 pages
Hash Table
No ratings yet
Hash Table
9 pages
Hashing
No ratings yet
Hashing
25 pages
Primary Clustering in Hashing
No ratings yet
Primary Clustering in Hashing
61 pages
Hash Table Fundamentals and Techniques
No ratings yet
Hash Table Fundamentals and Techniques
39 pages
06 - APS - Hash Table
No ratings yet
06 - APS - Hash Table
28 pages
University Institute of Engineering CSE-2 Year: Advanced Data Structures and Algorithms
No ratings yet
University Institute of Engineering CSE-2 Year: Advanced Data Structures and Algorithms
26 pages
11 Hashing
No ratings yet
11 Hashing
60 pages
2,2 Hashing
No ratings yet
2,2 Hashing
30 pages
Hashing Techniques Explained
No ratings yet
Hashing Techniques Explained
44 pages
Module 5
No ratings yet
Module 5
22 pages
UNIT 8 Hashing
No ratings yet
UNIT 8 Hashing
24 pages
VND - Openxmlformats Officedocument - Wordprocessingml.document&rendition 1
No ratings yet
VND - Openxmlformats Officedocument - Wordprocessingml.document&rendition 1
9 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
Dsa 4
No ratings yet
Dsa 4
55 pages
2 - Hashing
No ratings yet
2 - Hashing
21 pages
Hashing Techniques Overview
No ratings yet
Hashing Techniques Overview
23 pages
Hash Tables and Collision Handling Techniques
No ratings yet
Hash Tables and Collision Handling Techniques
25 pages
Done DS GTU Study Material Presentations Unit-4 13032021035653AM
No ratings yet
Done DS GTU Study Material Presentations Unit-4 13032021035653AM
24 pages
Hashing Techniques in Data Structures
No ratings yet
Hashing Techniques in Data Structures
21 pages
Unit 4 - Query Processing
No ratings yet
Unit 4 - Query Processing
49 pages
Graph Algorithms-Final
No ratings yet
Graph Algorithms-Final
158 pages
Unit 4 - Multimedia Databases
No ratings yet
Unit 4 - Multimedia Databases
26 pages
Is Lecture 1
No ratings yet
Is Lecture 1
37 pages
Linked List
No ratings yet
Linked List
69 pages
Recursion
No ratings yet
Recursion
12 pages
Nnai Bai-205 Unit 1
No ratings yet
Nnai Bai-205 Unit 1
107 pages
CSUnit 1
No ratings yet
CSUnit 1
124 pages
Communication Channels
No ratings yet
Communication Channels
7 pages
Maths 1A Narayana (T100) Study Material
89% (18)
Maths 1A Narayana (T100) Study Material
183 pages
Mep460 - Using Professional Ees - S
100% (1)
Mep460 - Using Professional Ees - S
79 pages
MATH 115: Lecture II Notes
No ratings yet
MATH 115: Lecture II Notes
3 pages
Lovely Professional University, Punjab
No ratings yet
Lovely Professional University, Punjab
8 pages
Detailed Lesson Plan
No ratings yet
Detailed Lesson Plan
4 pages
Using Fmincon As An Optimization Tool For Calibrations
No ratings yet
Using Fmincon As An Optimization Tool For Calibrations
21 pages
SAT Suite Question Bank - Results
No ratings yet
SAT Suite Question Bank - Results
153 pages
Expanded One One Not Onto Project
No ratings yet
Expanded One One Not Onto Project
24 pages
CHAPTER 1 First-Order Differential Equat PDF
No ratings yet
CHAPTER 1 First-Order Differential Equat PDF
13 pages
CBSE Class XII Maths Case Study Guide
No ratings yet
CBSE Class XII Maths Case Study Guide
10 pages
UPCAT Daily Study Schedule (April 16 To August 1)
No ratings yet
UPCAT Daily Study Schedule (April 16 To August 1)
6 pages
CEng 6001 Part II
No ratings yet
CEng 6001 Part II
109 pages
QandA (Questions of Abnitio
No ratings yet
QandA (Questions of Abnitio
5 pages
Full Download Advanced Calculus For Economics and Finance: Theory and Methods 1st Edition Giulio Bottazzi PDF
No ratings yet
Full Download Advanced Calculus For Economics and Finance: Theory and Methods 1st Edition Giulio Bottazzi PDF
20 pages
Evaluating Difficult Integrals Techniques
No ratings yet
Evaluating Difficult Integrals Techniques
4 pages
Applications of Integration in Mathematics and Real
No ratings yet
Applications of Integration in Mathematics and Real
5 pages
Marked Questions May Have More Than One Correct Option. 1.: X Ƒ X X 2 - X - Sin 4 Æ Ö - + Ç ÷ È Ø
No ratings yet
Marked Questions May Have More Than One Correct Option. 1.: X Ƒ X X 2 - X - Sin 4 Æ Ö - + Ç ÷ È Ø
1 page
Applied Maths 2
No ratings yet
Applied Maths 2
2 pages
1 General: MC050 - 110 - NL HW Description - Application Interface Embedded Operating Systems 2016-04-19 70089370 1
No ratings yet
1 General: MC050 - 110 - NL HW Description - Application Interface Embedded Operating Systems 2016-04-19 70089370 1
20 pages
Calculus Guide for Students
100% (1)
Calculus Guide for Students
166 pages
BCA-122 Mathematics & Statistics PDF
100% (8)
BCA-122 Mathematics & Statistics PDF
242 pages
Introduction to Linear Programming Basics
No ratings yet
Introduction to Linear Programming Basics
49 pages
MATLAB Polynomial & LTI Systems Guide
No ratings yet
MATLAB Polynomial & LTI Systems Guide
12 pages
AP Calculus BC Syllabus
No ratings yet
AP Calculus BC Syllabus
7 pages
12 Python Practical List File 2021-22
No ratings yet
12 Python Practical List File 2021-22
32 pages
General Mathematics: Answer Sheets
No ratings yet
General Mathematics: Answer Sheets
4 pages
MATH 1302 - Unit 2 Discussion Assignment
No ratings yet
MATH 1302 - Unit 2 Discussion Assignment
4 pages
UPSC Exam Syllabus Overview
No ratings yet
UPSC Exam Syllabus Overview
3 pages
OSU Undergraduate Math Courses 2022-2023
No ratings yet
OSU Undergraduate Math Courses 2022-2023
174 pages
Quadratics Cheat Sheet Edexcel Pure Year 1: Function 3 + 2 11
100% (1)
Quadratics Cheat Sheet Edexcel Pure Year 1: Function 3 + 2 11
1 page

DS Lecture - 6 (Hashing)

Uploaded by

DS Lecture - 6 (Hashing)

Uploaded by

Hashing

– Too big will waste memory; too small will increase

nine-digit positive integer 3 

(a) Division Method

Table size: 0 – 99 (prime)

H(3205)= 4, H(7148)=67, H(2345)=17

• For 2nd formula add 1 into the remainders.

• The same position must be used for all the keys.

• 4th and 5th digits have been selected. From the

• Types of open addressing are

• If the load factor is greater than 50% - 70% then the

• However, linear probing also tends to promote

H(k) = h, h+1, h+4, h+5, h+6,……, h+i2

You might also like