0% found this document useful (0 votes)

27 views7 pages

Tut6 Soln

Uploaded by

barsamic03

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views7 pages

Tut6 Soln

Uploaded by

barsamic03

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

CS2040S: Data Structures and Algorithms

Discussion Group Problems for Week 8

For: March 11–March 15

Problem 1. Hashing Basics

Problem 1.a.
Try hashing these items [42, 24, 18, 36, 52, 0, 47, 45, 60, 27, 32, 7] with the following hash function
h(x) = x mod 7. Each row in the table corresponds to the bucket of h(x). Fill in the table below
with your answer!

Assume that we are using chaining to handle hash collisions!

h(x) x1 x2 x3 x4
0
1
2
3
4
5
6

Solution: First, we would work out what are the hashed values of the above items. They are
[0, 3, 4, 1, 3, 0, 5, 3, 4, 6, 4, 0].

Next, we can iterate through these hashed values one-by-one and fill up our table!

h(x) x1 x2 x3 x4
0 42 0 7
1 36
2
3 24 52 45
4 18 60 32
5 47
6 27

1
Problem 1.b.
We typically use Linked Lists to store the items in a bucket! But... what if instead of a Linked
List, we use an AVL Tree to store the items in a bucket?

What are the advantages or disadvantages of such a solution?

Solution:
Time Complexity

– Consider the theoretical worst case for searching a Hash Table of size n, where every
element is hashed into the same bucket. This bucket’s container would then be of size
n. Then, the time complexity for searching for an element would be heavily dependent
on the type of container chosen.
– Searching through a Linked List of size n is O(n).
– Searching through an AVL Tree of size n is O(log n).
– Thus, the time complexity when using an AVL Tree would be better in the worst case!

Overhead

– Each node in an AVL Tree has to store more information than a Linked List node.
– An AVL Tree node has to store the balance factor and two child pointers.
– Meanwhile, a Linked List node only has to store a single pointer to the next node.

Complexity of Algorithm

– Additionally, AVL Trees are more complex than Linked Lists!

In summary, using an AVL Tree would improve the worst case scenario, but at the cost of more
overhead and complexity!

However, if the hash function chosen is good enough (under uniform hashing assumption) and the
number of buckets is larger than the number of elements, the expected time would be identical in
both cases. This worst case scenario only occurs when the hash function is not good.

Bonus information: Java’s implementation of HashMap converts the buckets from a Linked List to
Red Black Trees (a type of self-balancing Binary Search Tree) after a certain threshold of number
of buckets and bucket size. This switching to Red Black Trees is a fallback that is expected to be
rare if the hash function chosen is good enough.

2
Problem 1.c.
The goal of Hash Tables are to store (key, value) pairs. Here’s a question, at each bucket, is
storing just the (value) sufficient? Or do we need to store the entire (key, value) pair? Why do
you think so?

For example, for a (key, value) pair of (17, 200). At the bucket h(17), is storing (200) sufficient,
or do you need to store (17, 200)?

Solution: Storing just the (value) is not sufficient! Instead, the entire (key, value) pair needs to
be stored. Consider two pairs (x1 , y1 ), (x2 , y2 ) and let h(x1 ) = h(x2 ). We would only store the
values y1 and y2 at the bucket h(x1 ).

When we try to search for the value of x1 in the hash table above, we would encounter the bucket
h(x1 ), and at this bucket, there would be two values y1 and y2 . However, there would be no way
to tell if y1 or y2 belonged to x1 !

If we had stored the entire (key, value) pair, it would be simple to tell that y1 was the correct value!

3
Problem 2. Coupon Chaos!

Mr. Nodle has some coupons that he wishes to spend at his favourite cafe on campus, but there
are different types of coupons. In particular, there are t distinct coupon types, and he can have
any number of each type (including 0). He has n coupons in total.

He wishes to use one coupon a day, starting from day 1. He wishes to use his coupons in
ascending order and will use up all his coupons that are of a lower type first before moving on to
the next type. Nodle wishes to build a calendar that will state which coupon he will be using.

The list of coupons will be given in an array. An example of a possible input is: [5, 20, 5, 20, 3, 20, 3, 20].
Here, t = 3, and n = 8. The output here would be [3, 3, 5, 5, 20, 20, 20, 20].

Since the menu at the cafe that he frequents is not very diverse, there aren’t many different
types of coupons. So we’ll say that t is much smaller than n.

Give as efficient an algorithm as you can, to build his calendar for him.

Solution: There are a few possible solutions to this problem, but it boils down to sorting an
array with many duplicates.
One possible way is to modify a height balanced BST (such as an AVL tree) to also store
the counts of each item. Using this, insert all the items in the input, then do an in-order
traversal, outputting the value at the current node for as many times as it was inserted into
the balanced binary search tree. This takes O(n log t) time, since the size of the tree is at
most t and we need to do n insertion operations.

Another solution that works better on average, given reasonable hashing assumptions is to
hash the coupon types, and keep track of how much of each type we’ve seen. Additionally,
for every new type of coupon that we hash into the table, we add it into a list. In expectation
this should take at most O(n) time.
After processing the input, we’ll sort the list of keys that we had obtained previously, which
will take O(t log t) time.
Then in the sorted order, we look up the counts for each key and add the key to the calendar
as many times as it was counted based on the hash table. Since there were a total of n items,
this should take at most O(n) time.
In total, this solution takes O(n + t log t) time, which is equivalent to O(n) since t << n.

Finally, we can also consider quicksort with 3 way partitioning! Among the other comparison
based sorting algorithms, this is optimal since it is best able to handle the situation where
we have many duplicates and t << n. After each partitioning step, if the pivot has multiple
duplicates, all the duplicates would already be in the correct index and we are left with two
significantly smaller subarrays to recurse on since the duplicates would not be included in
the recursive calls to quicksort. The sort will use a maximum of t pivots, hence the expected
runtime is given by O(n log t).

4
Problem 3. The Missing Element (OPTIONAL)

Let’s revisit the same old problem that we’ve discussed at the beginning of the semester, finding
missing items in the array. Given n items in no particular order, but this time possibly with dupli-
cates, find the first missing number (if we were to start counting from 1), or output “all present”
if all values 1 to n were present in the input.

For example, given [8, 5, 3, 3, 2, 1, 5, 4, 2, 3, 3, 2, 1, 9], the first missing number here is 6.

Bonus: (no need for hash functions): Can we do the same thing using O(1) space? i.e. in-place.

5
Problem 4. Data Structure 2.0 (OPTIONAL)

Implement a data structure RandomizedSet with the following operations:

1. RandomizedSet() which initializes the data structure.

2. Insert(val) which inserts an item val into the set if not present.

3. Remove(val) which removes the item val from the set if present.

4. GetRandom() which returns a random element from the current set of elements. Every
element must have an equal probability of being returned.

All these operations must work in expected O(1) time! Hint: a Hash Table might come in handy!

Assume that the maximum number of elements present in the RandomizedSet will never exceed
a reasonable number n.

6
Problem 5. Data Structure 3.0 (OPTIONAL)

Let’s try to improve upon the kind of data structures we’ve been using so far a little. Implement
a data structure with the following operations:

1. Insert in O(log n) time

2. Delete in O(log n) time

3. Lookup in O(1) time

4. Find successor and predecessor in O(1) time

Section 04 Sol
No ratings yet
Section 04 Sol
15 pages
Tut7 Soln
No ratings yet
Tut7 Soln
10 pages
Understanding Hashing and Collision Resolution
No ratings yet
Understanding Hashing and Collision Resolution
11 pages
CS2040 Summary
No ratings yet
CS2040 Summary
16 pages
Week 6 Hashing Tutorial Solutions
No ratings yet
Week 6 Hashing Tutorial Solutions
5 pages
CS 213 Data Structures Course Overview
No ratings yet
CS 213 Data Structures Course Overview
21 pages
Data Structures Course Overview
No ratings yet
Data Structures Course Overview
67 pages
Cuckoo Hashing For Undergraduates: Rasmus Pagh IT University of Copenhagen March 27, 2006
No ratings yet
Cuckoo Hashing For Undergraduates: Rasmus Pagh IT University of Copenhagen March 27, 2006
6 pages
hw2 15211
No ratings yet
hw2 15211
8 pages
Data Structures and Algorithm Review
No ratings yet
Data Structures and Algorithm Review
14 pages
Advanced Data Structures2015
No ratings yet
Advanced Data Structures2015
4 pages
Lab08 - DS - Hash Tables
No ratings yet
Lab08 - DS - Hash Tables
9 pages
Data Structures & Algorithms Guide
No ratings yet
Data Structures & Algorithms Guide
10 pages
Computer Science Exam Solutions
No ratings yet
Computer Science Exam Solutions
12 pages
Hash Table Probing Solutions
No ratings yet
Hash Table Probing Solutions
8 pages
19 Hashing
No ratings yet
19 Hashing
44 pages
Lab 4 Report
No ratings yet
Lab 4 Report
3 pages
Exam 2 Recitation Practice Exam
No ratings yet
Exam 2 Recitation Practice Exam
12 pages
22csc22 Cat-3.1 - Answer Key
No ratings yet
22csc22 Cat-3.1 - Answer Key
22 pages
Binary Search Trees & Hashing
No ratings yet
Binary Search Trees & Hashing
7 pages
Chapter 1-3 Algorithm Analysis
No ratings yet
Chapter 1-3 Algorithm Analysis
181 pages
Design and Analysis of Algorithm
No ratings yet
Design and Analysis of Algorithm
182 pages
DSA Assignment 1, 2
No ratings yet
DSA Assignment 1, 2
8 pages
Trees
No ratings yet
Trees
30 pages
Primitive Operations: Assumed To Take A Constant Amount of Time in The RAM Model
No ratings yet
Primitive Operations: Assumed To Take A Constant Amount of Time in The RAM Model
7 pages
Heap and Hashtable
No ratings yet
Heap and Hashtable
7 pages
DSL Writeup
No ratings yet
DSL Writeup
64 pages
ECE250 Notes
No ratings yet
ECE250 Notes
23 pages
12 Hashing
No ratings yet
12 Hashing
9 pages
Data Structures Exam: First Semestre 2013
No ratings yet
Data Structures Exam: First Semestre 2013
4 pages
Advanced Data Structures Lecture
No ratings yet
Advanced Data Structures Lecture
46 pages
Hashmap Techniques for Array Problems
No ratings yet
Hashmap Techniques for Array Problems
16 pages
Red Black Trees2
No ratings yet
Red Black Trees2
4 pages
Data Structures Unit 2
No ratings yet
Data Structures Unit 2
22 pages
CS301 Lec41
No ratings yet
CS301 Lec41
18 pages
Day3.2 DS2 HashTablesHeaps
No ratings yet
Day3.2 DS2 HashTablesHeaps
61 pages
Data Structures Exam Guide 2020-21
100% (1)
Data Structures Exam Guide 2020-21
21 pages
Hashing vs. Skip Lists Explained
No ratings yet
Hashing vs. Skip Lists Explained
36 pages
AVL Trees: Properties and Operations
No ratings yet
AVL Trees: Properties and Operations
11 pages
Maps
No ratings yet
Maps
36 pages
MIT6 006S20 Review1 Sol
No ratings yet
MIT6 006S20 Review1 Sol
6 pages
Dictionaries: Sets
No ratings yet
Dictionaries: Sets
92 pages
Lecture 14 Data Structure Review
No ratings yet
Lecture 14 Data Structure Review
14 pages
HL 5 Abstract Data Structures 9005
No ratings yet
HL 5 Abstract Data Structures 9005
39 pages
Homework Solutions for CMPS101
No ratings yet
Homework Solutions for CMPS101
8 pages
Computer Science Students Quiz
No ratings yet
Computer Science Students Quiz
7 pages
Final Review, Video 1, Part 1
No ratings yet
Final Review, Video 1, Part 1
13 pages
C++ Data Structures & Hashing
No ratings yet
C++ Data Structures & Hashing
42 pages
DSAL Lab Manual
No ratings yet
DSAL Lab Manual
61 pages
210 Maps PDF
No ratings yet
210 Maps PDF
39 pages
Dsal Lab Manual
No ratings yet
Dsal Lab Manual
65 pages
Hash
No ratings yet
Hash
5 pages
Hashing Reading
No ratings yet
Hashing Reading
10 pages
HW 4
No ratings yet
HW 4
2 pages
Ov4 en PDF
No ratings yet
Ov4 en PDF
2 pages
Ads 1
No ratings yet
Ads 1
8 pages
Data Structures Module 5 Complete Solutions
No ratings yet
Data Structures Module 5 Complete Solutions
34 pages
Python Array Frequency Analysis
No ratings yet
Python Array Frequency Analysis
22 pages
ECS726-Week02 Symmetric EncryptionP
No ratings yet
ECS726-Week02 Symmetric EncryptionP
62 pages
Unit 4 TAFL
No ratings yet
Unit 4 TAFL
50 pages
The Power of Multiple Integrals in Computer Science Engineering
No ratings yet
The Power of Multiple Integrals in Computer Science Engineering
10 pages
CST302 - Compiler - Design - Module 2
No ratings yet
CST302 - Compiler - Design - Module 2
19 pages
6 Bachelor of Computer Application
No ratings yet
6 Bachelor of Computer Application
29 pages
Cognizant Interview Questions For Java Developers: What Is A Singleton in Java?
No ratings yet
Cognizant Interview Questions For Java Developers: What Is A Singleton in Java?
5 pages
g12 Cs Worksheet (CSV & Stack) 2025-2026
No ratings yet
g12 Cs Worksheet (CSV & Stack) 2025-2026
2 pages
Searching Algorithms
No ratings yet
Searching Algorithms
10 pages
Domain and Range of Functions Worksheet
No ratings yet
Domain and Range of Functions Worksheet
2 pages
Lint Interview Questions
No ratings yet
Lint Interview Questions
5 pages
ICSE Computer Applications
No ratings yet
ICSE Computer Applications
8 pages
Module 2 - Memory and Data Types
No ratings yet
Module 2 - Memory and Data Types
7 pages
Design and Analysis of Algorithms
No ratings yet
Design and Analysis of Algorithms
202 pages
Shortest Strings in Formal Languages
No ratings yet
Shortest Strings in Formal Languages
60 pages
Computer Science Curriculum Overview
No ratings yet
Computer Science Curriculum Overview
8 pages
Unit-5 (Functions & File Handling) - 2-23
No ratings yet
Unit-5 (Functions & File Handling) - 2-23
22 pages
805 IBM COBOL400 Interview Questions Answers Guide
No ratings yet
805 IBM COBOL400 Interview Questions Answers Guide
7 pages
Aspiring Computer Scientist's Journey
No ratings yet
Aspiring Computer Scientist's Journey
1 page
System Programming and Compiler Construction Techknowledge Book PDF
No ratings yet
System Programming and Compiler Construction Techknowledge Book PDF
247 pages
OOPs - Lecture 10 - This Keyword, Polymorphism
No ratings yet
OOPs - Lecture 10 - This Keyword, Polymorphism
17 pages
Maths Ist Sem
100% (1)
Maths Ist Sem
6 pages
Automata 2023
No ratings yet
Automata 2023
4 pages
Linuxfun Part 2-1-37
No ratings yet
Linuxfun Part 2-1-37
37 pages
Assets File Syl Sam Papers Ai 5
No ratings yet
Assets File Syl Sam Papers Ai 5
1 page
CS2 BTech 2nd Year K Series Syllabus EFS 2019 20 Compressed
No ratings yet
CS2 BTech 2nd Year K Series Syllabus EFS 2019 20 Compressed
39 pages
Tic Tac Toe Game in Assembly Language
No ratings yet
Tic Tac Toe Game in Assembly Language
6 pages
Beam Search for Engineers
No ratings yet
Beam Search for Engineers
41 pages
Peso Bill Calculation Examples
No ratings yet
Peso Bill Calculation Examples
9 pages
Aiml Assignment
No ratings yet
Aiml Assignment
21 pages
Python Data Types Methods
No ratings yet
Python Data Types Methods
2 pages

Tut6 Soln

Uploaded by

Tut6 Soln

Uploaded by

CS2040S: Data Structures and Algorithms

Discussion Group Problems for Week 8

Problem 1. Hashing Basics

Assume that we are using chaining to handle hash collisions!

What are the advantages or disadvantages of such a solution?

– Additionally, AVL Trees are more complex than Linked Lists!

Implement a data structure RandomizedSet with the following operations:

1. RandomizedSet() which initializes the data structure.

1. Insert in O(log n) time

2. Delete in O(log n) time

3. Lookup in O(1) time

4. Find successor and predecessor in O(1) time

You might also like