Understanding Hash Tables in Python

Hash tables store data in an array format where each data value has a unique index value. Data access is very fast if the index is known. Hashing is used to convert key values into array indexes by using a hash function. Collisions occur if two keys map to the same index, and are resolved using chaining or open addressing techniques like linear probing, quadratic probing, or double hashing.

Uploaded by

set ryzen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

156 views33 pages

Understanding Hash Tables in Python

Uploaded by

set ryzen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Hash Table

DIDIH RIZKI CHANDRANEGARA

Hash Table
Hash Table is a data structure which stores data in an
associative manner.
In a hash table, data is stored in an array format, where
each data value has its own unique index value.
Access of data becomes very fast if we know the index of
the desired data.
Hash Table
Thus, it becomes a data structure in which insertion and
search operations are very fast irrespective of the size of the
data.
Hash Table uses an array as a storage medium and uses
hash technique to generate an index where an element is to
be inserted or is to be located from.
The technique in hash table called Hashing
Hashing
Hashing is a technique to convert a range of key values
into a range of indexes of an array.

We're going to use modulo operator to get a range of key

values.
Hashing
Hashing has following advantages :
1. Use hashing to search, data need not be sorted
2. Without collision & overflow, search only takes O(1)
time. Data size is not concerned
3. Security. If you do not know the hash function, you
cannot get data
Hash Table
Hash Table
Hash table is an array of fixed size TableSize
Array elements indexed by a key, which is
mapped to an array index (0...TableSize-1)
Mapping (hash function) h from key to index
E.g., h("john") = 3, it means  key "john"
will be stored in index-3 in table/array
Basic Operations
Following are the basic primary operations of a hash table.
Search − Searches an element in a hash table.
Insert − inserts an element in a hash table.
Delete − Deletes an element from a hash table.
Basic Operation
Search Operation
Whenever an element is to be searched, compute the hash
code of the key passed and locate the element using that
hash code as index in the array.
Use linear probing to get the element ahead if the element
is not found at the computed hash code.
Insert Operation
Whenever an element is to be inserted, compute the hash
code of the key passed and locate the index using that hash
code as an index in the array.
Use linear probing for empty location, if an element is
found at the computed hash code.
Delete Operation
Whenever an element is to be deleted, compute the hash
code of the key passed and locate the index using that hash
code as an index in the array.
Use linear probing to get the element ahead if an element is
not found at the computed hash code. When found, store a
dummy item there to keep the performance of the hash
table intact.
Hash Function

There 3 hash function :

1. Division
2. Mid-square
3. Folding
Division
Mapping a key k into one of m slots by taking the
remainder of k divided by m

h(k) = k mod m

Ex. m = 12, k = 100, then h(k) = 4

Prime number m may be good choice !
Mid Square
Mapping a key k into one of m slots by get the middle some
digits from value k2

h(k) = k2 get middle (log m) digits

Ex. m = 10000, k = 113586, log m = 4

h(k) = 1135862 get middle 4 digits
= 12901779369 get middle 4 digits
= 1779
Folding
Divide k into some sections, besides the last section, have
same length. Then add these sections together
1. shift folding
2. folding at the boundaries
H(k) = ∑ (section divided from k) by a or b
Folding
Ex, k = 12320324111220, section length = 3
Why Array and not Linked List for Hash
Table
The array for a hash table can be replaced by a linked list
data structure, but there would be a problem  indexing
Linked list don’t have sequentially indexing, but an array
have it. And if use linked list, we need extra time to find the
index
Once the index is known (array), the value is obtained
without iteration; this access is faster.
Collision
If the same index is produced by the hash function for
multiple keys then, conflict arises. This situation is called
collision.
We solved it using Chaining and Open Addressing
Chaining
In this technique, if a hash function produces the same
index for multiple elements, these elements are stored in
the same index by using a doubly linked list.
Chaining
Open Addressing
Open addressing is basically a collision resolving
technique. Some of the methods used by open addressing
are:
1. Linear Probing
2. Quadratic Probing
3. Double Hashing
Linear Probing
As we can see, it may happen that the hashing technique is
used to create an already used index of the array.
In such a case, we can search the next empty location in the
array by looking into the next cell until we find an empty
cell. If there is not empty cell, then it would be seen data
overflow.
This technique is called linear probing.
Linear Probing
h(k, i) = (k + i) mod m
h(k, i) = (k mod m) + i
i : 0, 1, ... , m-1
(this is for division)
Quadratic Probing
h(k, i) = (k + i2) mod m or (this is for division)
h(k, i) = (k mod m) + i2
i : 0, 1, ... , m-1
Probe sequence (i = 0,1,2,3,4): +0, +1, +4, +9, +16,...
This method works much better than linear probing, but to
make full use of the hash table,
Should never evaluate to 0 or 1  i>1
Quadratic Probing
Example :
h(58,0) = (58+02) mod 10 = 8 (X)
h(58,1) = (58+12) mod 10 = 9 (X)
h(58,2) = (58+22) mod 10 = 2
Double Hashing
Double hashing is one of the best methods available for
open addressing
because the permutations produced have many of the
characteristics of randomly chosen permutations
h(k, i) = h1(k) + (i*h2(k))
h1 : first hash function
h2 : second hash function
i : 0, 1, ... , m-1
Double Hashing
Good choices for h2(k) ?
- Should never evaluate to 0 ==> i>0
- Ex for Division : h2(k) = R – (k mod R)
==> R is prime number less than TableSize
R=7, m=10 (this is for division)
==> h(49,0) = h1(49)+0*h1(49) = 9
==> h(49,1) = h1(49)+1*h1(49)
 = h(49)+1*(7 - (49 mod 7))
=6
Double Hashing

(this is for division)

Adding References
[Link]
hash_data_structure.htm
[Link]
[Link]
[Link]
es-f520cb3dc85

[Link]
Adding References
[Link]
tables/basics-of-hash-tables/tutorial/
[Link]
[Link]
-algorithm-f04043330902

[Link]
-tables-2154c58098ba
Adding References
[Link]
[Link]
[Link]
[Link]
[Link]
EMAIL
didihrizki@[Link] (work)
diedieh02@[Link] (personal)
PHONE : 081349254787

Understanding Hashing Techniques and Functions
No ratings yet
Understanding Hashing Techniques and Functions
13 pages
Hashing: Amar Jukuntla
No ratings yet
Hashing: Amar Jukuntla
22 pages
Hashing Techniques & Functions
No ratings yet
Hashing Techniques & Functions
30 pages
Hashing Algorithms
No ratings yet
Hashing Algorithms
22 pages
Hashing New
No ratings yet
Hashing New
48 pages
3 Hashing
No ratings yet
3 Hashing
20 pages
Hashing
No ratings yet
Hashing
20 pages
Understanding Hashing in Data Structures
No ratings yet
Understanding Hashing in Data Structures
53 pages
Understanding Hashing Techniques and Methods
No ratings yet
Understanding Hashing Techniques and Methods
33 pages
Dsa Labtask 12
No ratings yet
Dsa Labtask 12
5 pages
Hashing Techniques Explained
No ratings yet
Hashing Techniques Explained
32 pages
Hashing Techniques Explained
No ratings yet
Hashing Techniques Explained
47 pages
Hashing PDF
No ratings yet
Hashing PDF
56 pages
Lab 2
No ratings yet
Lab 2
10 pages
Hashing
No ratings yet
Hashing
16 pages
11 Hashing
No ratings yet
11 Hashing
60 pages
Hashing
No ratings yet
Hashing
11 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
27 pages
Hashing Techniques Overview
No ratings yet
Hashing Techniques Overview
23 pages
Hashing
No ratings yet
Hashing
34 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
26 pages
Hashing v2 12032018
No ratings yet
Hashing v2 12032018
23 pages
Hashing
No ratings yet
Hashing
56 pages
Hashing and Graphs
No ratings yet
Hashing and Graphs
28 pages
Hashing
No ratings yet
Hashing
30 pages
Hash Tables: A Programmer's Guide
No ratings yet
Hash Tables: A Programmer's Guide
26 pages
2,2 Hashing
No ratings yet
2,2 Hashing
30 pages
SORTING PROGRAMS - Counting + Bucket + Heap
No ratings yet
SORTING PROGRAMS - Counting + Bucket + Heap
27 pages
Primary Clustering in Hashing
No ratings yet
Primary Clustering in Hashing
61 pages
Hashing
No ratings yet
Hashing
33 pages
Lecture 3.Pptx 3
No ratings yet
Lecture 3.Pptx 3
24 pages
Hashing Techniques in Data Structures
No ratings yet
Hashing Techniques in Data Structures
10 pages
What Is Hashing
No ratings yet
What Is Hashing
11 pages
Algorithms & Data Structures 06
No ratings yet
Algorithms & Data Structures 06
13 pages
Hashing Techniques and Analysis
No ratings yet
Hashing Techniques and Analysis
60 pages
Ch7 Hashing
No ratings yet
Ch7 Hashing
12 pages
Ds 5 Update
No ratings yet
Ds 5 Update
26 pages
Direct Addressing in Hash Tables
No ratings yet
Direct Addressing in Hash Tables
26 pages
Unit 5
No ratings yet
Unit 5
50 pages
Hashing Techniques in Data Structures
No ratings yet
Hashing Techniques in Data Structures
21 pages
Hashing
No ratings yet
Hashing
23 pages
2 - Hashing
No ratings yet
2 - Hashing
21 pages
Hashing PPT
No ratings yet
Hashing PPT
39 pages
Hashing ClassNotes
No ratings yet
Hashing ClassNotes
8 pages
Hash Tables in DS
No ratings yet
Hash Tables in DS
14 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
Hashing
No ratings yet
Hashing
37 pages
Hashing
No ratings yet
Hashing
37 pages
06 - APS - Hash Table
No ratings yet
06 - APS - Hash Table
28 pages
Hashing Techniques Explained
No ratings yet
Hashing Techniques Explained
20 pages
Matrix Hashing with Collision Resolution
No ratings yet
Matrix Hashing with Collision Resolution
7 pages
Data Structures
No ratings yet
Data Structures
6 pages
Hash Table
No ratings yet
Hash Table
9 pages
Hashing Techniques Explained
No ratings yet
Hashing Techniques Explained
44 pages
Hashing
No ratings yet
Hashing
37 pages
Hashing Techniques and Applications
No ratings yet
Hashing Techniques and Applications
44 pages
Open vs Closed Hashing Techniques
No ratings yet
Open vs Closed Hashing Techniques
16 pages
Modul 1 - Linear Programming
No ratings yet
Modul 1 - Linear Programming
20 pages
Iec 60810 - Sau
No ratings yet
Iec 60810 - Sau
36 pages
Annexes To A3 - Global PDF
No ratings yet
Annexes To A3 - Global PDF
257 pages
Worked Example Question Sheets For D4 HL
No ratings yet
Worked Example Question Sheets For D4 HL
9 pages
Electricity
No ratings yet
Electricity
4 pages
Spectro Maxx PDF
100% (1)
Spectro Maxx PDF
30 pages
Social Support and Work-Family Balance
100% (1)
Social Support and Work-Family Balance
8 pages
Computer Architecture Course Guide
No ratings yet
Computer Architecture Course Guide
21 pages
Data Analysis Using Python (1) NAVTTC
No ratings yet
Data Analysis Using Python (1) NAVTTC
17 pages
MBIST Final 22062016
No ratings yet
MBIST Final 22062016
94 pages
Chemical Bonding: by Om Pandey, Iit Delhi
No ratings yet
Chemical Bonding: by Om Pandey, Iit Delhi
30 pages
MMC 1
No ratings yet
MMC 1
55 pages
Mechanical Eng. Exam Prep
No ratings yet
Mechanical Eng. Exam Prep
11 pages
GATE 2018 Physics Aptitude Questions
No ratings yet
GATE 2018 Physics Aptitude Questions
13 pages
Sysmex White Paper Differential Diagnosis of Thrombocytopenia
No ratings yet
Sysmex White Paper Differential Diagnosis of Thrombocytopenia
5 pages
Celebrating Mathematics Day: Ramanujan's Legacy
No ratings yet
Celebrating Mathematics Day: Ramanujan's Legacy
37 pages
Experiment - Buckling of Strut
No ratings yet
Experiment - Buckling of Strut
2 pages
Operating Instructions: Metering Pump Pneumados PNDB
No ratings yet
Operating Instructions: Metering Pump Pneumados PNDB
32 pages
02 - Dioda - Sedra4 Ch03
No ratings yet
02 - Dioda - Sedra4 Ch03
97 pages
Grade 10 Science: Light & Magnets
No ratings yet
Grade 10 Science: Light & Magnets
2 pages
Lecture 2 Bearing and Punching Stress, Strain
No ratings yet
Lecture 2 Bearing and Punching Stress, Strain
16 pages
Lecture 06 - NUMERICALS ON GATING AND RISER DESIGN
No ratings yet
Lecture 06 - NUMERICALS ON GATING AND RISER DESIGN
5 pages
Notes On Reinforced Concrete To BS 5400 Part 4
100% (2)
Notes On Reinforced Concrete To BS 5400 Part 4
35 pages
Transition Mechanisms From Ipv4 To Ipv6 Addresses: January 2016
No ratings yet
Transition Mechanisms From Ipv4 To Ipv6 Addresses: January 2016
8 pages
Reverse Return System Overview
No ratings yet
Reverse Return System Overview
31 pages
Mathematics: San Agustin Elementary
No ratings yet
Mathematics: San Agustin Elementary
3 pages
Technical Manual Motor 4G52
No ratings yet
Technical Manual Motor 4G52
10 pages
6.debugging Strategies When A Machine Learning System Performs Poorly
No ratings yet
6.debugging Strategies When A Machine Learning System Performs Poorly
5 pages
Historical Bridge Construction Overview
No ratings yet
Historical Bridge Construction Overview
20 pages
Average Study Material PDF 4 PDF
No ratings yet
Average Study Material PDF 4 PDF
6 pages

Understanding Hash Tables in Python

Uploaded by

Understanding Hash Tables in Python

Uploaded by

Hash Table

DIDIH RIZKI CHANDRANEGARA

We're going to use modulo operator to get a range of key

There 3 hash function :

Ex. m = 12, k = 100, then h(k) = 4

h(k) = k2 get middle (log m) digits

Ex. m = 10000, k = 113586, log m = 4

(this is for division)

You might also like