0% found this document useful (0 votes)

161 views605 pages

Max Heap & Tree Structures Guide

The document describes how to initialize a max heap from an input array by building the heap in a bottom-up manner. It shows the steps of moving to the lowest non-leaf node, comparing it to its children and swapping if needed, then continuing moving up and heapifying until the root is reached. The time complexity of this heap initialization process is O(n) where n is the number of elements, as each element is heapified once by bubbling up.

Uploaded by

Rosemond Fabien

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

161 views605 pages

Max Heap & Tree Structures Guide

Uploaded by

Rosemond Fabien

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 605

Initializing A Max Heap

2 3

4 5 6 7

8 9 7
10 11
8

input array = [-, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11]

Initializing A Max Heap
1

2 3

4 5 6 7

8 9 7
10 11
8

Start at rightmost array position that has a child.

Index is n/2.  Heapify
Initializing A Max Heap
1

2 3

4 11 6 7

8 9 7
10 85

Move to next lower array position.

Initializing A Max Heap
1

2 3

4 11 6 7

8 9 7
10 85

Heapify!
Initializing A Max Heap
1

2 3

9 11 6 7

8 4 7
10 85
Initializing A Max Heap
1

2 3

9 11 6 7

8 4 7
10 85

Move to next lower array position and Heapify.

Initializing A Max Heap
1

2 7

9 11 6 3

8 4 7
10 85
Initializing A Max Heap
1

2 7

9 11 6 3

8 4 7
10 85

Move to next lower array position and Heapify.

Initializing A Max Heap
1

11 7

9 6 3

8 4 7
10 85

Find a home for 2.

Initializing A Max Heap
1

11 7

9 10 6 3

8 4 8 75

Find a home for 2.

Initializing A Max Heap
1

11 7

9 10 6 3

8 4 72 8
5

Done, move to next lower array position and Heapify.

Initializing A Max Heap
1

11 7

9 10 6 3

8 4 72 8
5

Find home for 1.

Initializing A Max Heap
11

9 10 6 3

8 4 72 8
5

Find home for 1.

Initializing A Max Heap
11

10 7

9 6 3

8 4 72 8
5

Find home for 1.

Initializing A Max Heap
11

10 7

9 5 6 3

8 4 72 8

Find home for 1.

Initializing A Max Heap
11

10 7

9 5 6 3

8 4 72 8
1

Done.
Initializing a Max Heap*
Time Complexity
11

9 7

8 5 6 3

1 4 7
10 8
2

Height of heap = h.
Number of subtrees with root at level j is <= 2 j-1.
Time for each subtree is O(h-j+1).
Complexity
Time for level j subtrees is <= 2j-1(h-j+1) = t(j).
Total time is t(1) + t(2) + … + t(h-1) = O(n).
Leftist Trees
Linked binary tree.
Can do everything a heap can do and in the
same asymptotic complexity.
Can meld two leftist tree priority queues in
O(log n) time.
Extended Binary Trees

Start with any binary tree and add an

external node wherever there is an
empty subtree.
Result is an extended binary tree.
A Binary Tree
An Extended Binary Tree

number of external nodes is n+1

The Function s()

For any node x in an extended binary tree,

let s(x) be the length of a shortest path
from x to an external node in the subtree
rooted at x.
s() Values Example
s() Values Example
2

2 1

2 1 1 0

1 1 0 0 1 0

0 0 0 0 0 0
Properties Of s()

If x is an external node, then s(x) = 0.

Otherwise,
s(x) = min {s(leftChild(x)),
s(rightChild(x))} + 1
Height Biased Leftist Trees

A binary tree is a (height biased) leftist tree

iff for every internal node x,
s(leftChild(x)) >= s(rightChild(x))

Tree Terminology
A Leftist Tree
2

2 1

2 1 1 0

1 1 0 0 1 0

0 0 0 0 0 0
Leftist Trees--Property 1

In a leftist tree, the rightmost path is a

shortest root to external node path and
the length of this path is s(root).
A Leftist Tree
2

2 1

2 1 1 0

1 1 0 0 1 0

0 0 0 0 0 0

Length of rightmost path is 2.

Leftist Trees—Property 2

The number of internal nodes is at least

2s(root) - 1
Because levels 1 through s(root) have no
external nodes.
So, s(root) <= log(n+1)
A Leftist Tree
2

2 1

2 1 1 0

1 1 0 0 1 0

0 0 0 0 0 0

Levels 1 and 2 have no external nodes.

Leftist Trees—Property 3

Length of rightmost path is O(log n), where

n is the number of nodes in a leftist tree.

Follows from Properties 1 and 2.

Leftist Trees As Priority Queues

Min leftist tree … leftist tree that is a min tree.

Used as a min priority queue.
Max leftist tree … leftist tree that is a max tree.
Used as a max priority queue.
A Min Leftist Tree
2

4 3

6 8 5

8 6 9
Some Min Leftist Tree Operations
empty()
size()
top()
push()
pop()
meld()
initialize()
push() and pop() use meld().
Push Operation
push(7) 2

4 3

6 8 5

8 6 9
Push Operation
push(7) 2

4 3

6 8 5

8 6 9

Create a single node min leftist tree. 7

Push Operation
push(7) 2

4 3

6 8 5

8 6 9

Create a single node min leftist tree. 7

Meld the two min leftist trees.

Remove Min (pop)
2

4 3

6 8 5

8 6 9
Remove Min (pop)
2

4 3

6 8 5

8 6 9

Remove the root.

Remove Min (pop)
2

4 3

6 8 5

8 6 9

Remove the root.

Meld the two subtrees.
Meld Two Min Leftist Trees

4 3

6 8 5 6

8 6 9

Traverse only the rightmost paths so as to get

logarithmic performance.
Meld Two Min Leftist Trees
4 3

6 8 5 6

8 6 9

Meld right subtree of tree with smaller root and

all of other tree.
Meld Two Min Leftist Trees
4 3

6 8 5 6

8 6 9

Meld right subtree of tree with smaller root and all of

other tree.
Meld Two Min Leftist Trees
4 6

6 8

8 6

Meld right subtree of tree with smaller root and all of

other tree.
Meld Two Min Leftist Trees
8 6

Meld right subtree of tree with smaller root and all of

other tree.
Right subtree of 6 is empty. So, result of melding right
subtree of tree with smaller root and other tree is the
other tree.
Meld Two Min Leftist Trees
8 6

Make melded subtree right subtree of smaller root.

Swap left and right subtree if s(left) < s(right). 6

8
Meld Two Min Leftist Trees
4
4
6

6 6
6
8

8 6 8
8 6

Make melded subtree right subtree of smaller root.

Swap left and right subtree if s(left) < s(right).

Meld Two Min Leftist Trees
3

5 4

9 6 6

8 6 8

Make melded subtree right subtree of smaller root.

Swap left and right subtree if s(left) < s(right).

Meld Two Min Leftist Trees
3

4
5

6 6
9

8 6 8
Initializing In O(n) Time
• create n single node min leftist trees
and place them in a FIFO queue
• repeatedly remove two min leftist trees
from the FIFO queue, meld them, and
put the resulting min leftist tree into the
FIFO queue
• the process terminates when only 1 min
leftist tree remains in the FIFO queue
• analysis is the same as for heap
initialization
Tournament Trees

Winner trees.
Loser Trees.
Winner Trees
Complete binary tree with n external
nodes and n - 1 internal nodes.
External nodes represent tournament
players.
Each internal node represents a match
played between its two children;
the winner of the match is stored at
the internal node.
Root has overall winner.
Winner Tree For 16 Players

player match node

Winner Tree For 16 Players
1

1
2

3 1 2 2

3 6 1 3 2 4 2 5

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8

Smaller element wins => min winner tree.

Winner Tree For 16 Players
1

1
2

3 1 2 2

3 6 1 3 2 4 2 5

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8

height is log2 n (excludes player level)

Complexity Of Initialize

• O(1) time to play match at each match node.

• n - 1 match nodes.
• O(n) time to initialize n player winner tree.
Applications

Sorting.

Insert elements to be sorted into a

winner tree.
Repeatedly extract the winner and
replace by a large value.
Sort 16 Numbers
1

1
2

3 1 2 2

3 6 1 3 2 4 2 5

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8
Sort 16 Numbers
1

1
2

3 1 2 2

3 6 1 3 2 4 2 5

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8
Sort 16 Numbers
1

1
2

3 1 2 2

3 6 1 3 2 4 2 5

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8

Sorted array.
Sort 16 Numbers
1

1
2

3 1 2 2

3 6 5 3 2 4 2 5

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8

Sorted array.
Sort 16 Numbers
1

1
2

3 3 2 2

3 6 5 3 2 4 2 5

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8

Sorted array.
Sort 16 Numbers
1

3
2

3 3 2 2

3 6 5 3 2 4 2 5

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8

Sorted array.
Sort 16 Numbers
2

3
2

3 3 2 2

3 6 5 3 2 4 2 5

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8

Sorted array.
Sort 16 Numbers
2

3
2

3 3 2 2

3 6 5 3 2 4 2 5

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8

1 2

Sorted array.
Sort 16 Numbers
2

3
2

3 3 2 2

3 6 5 3 6 4 2 5

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8

1 2

Sorted array.
Sort 16 Numbers
2

3
2

3 3 4 2

3 6 5 3 6 4 2 5

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8

1 2

Sorted array.
Sort 16 Numbers
2

3
2

3 3 4 2

3 6 5 3 6 4 2 5

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8

1 2

Sorted array.
Sort 16 Numbers
2

3
2

3 3 4 2

3 6 5 3 6 4 2 5

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8

1 2

Sorted array.
Sort 16 Numbers
2

3
2

3 3 4 2

3 6 5 3 6 4 2 5

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8

1 2 2

Sorted array.
Sort 16 Numbers
2

3
2

3 3 4 2

3 6 5 3 6 4 5 5

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8

1 2 2

Sorted array.
Sort 16 Numbers
2

3
2

3 3 4 5

3 6 5 3 6 4 5 5

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8

1 2 2

Sorted array.
Sort 16 Numbers
2

3
4

3 3 4 5

3 6 5 3 6 4 5 5

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8

1 2 2

Sorted array.
Sort 16 Numbers
3

3
4

3 3 4 5

3 6 5 3 6 4 5 5

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8

1 2 2

Sorted array.
Sort 16 Numbers
3

3
4

3 3 4 5

3 6 5 3 6 4 5 5

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8

1 2 2 3

Sorted array.
Time To Sort

• Initialize winner tree.

 O(n) time
• Remove winner and replay.
 O(log n) time
• Remove winner and replay n times.
 O(n log n) time
• Total sort time is O(n log n).
• Actually Theta(n log n).
Winner Tree Operations

• Initialize
 O(n) time
• Get winner
 O(1) time
• Remove/replace winner and replay
 O(log n) time
 more precisely Theta(log n)
Replace Winner And Replay
1

1
2

3 1 2 2

3 6 1 3 2 4 2 5

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8

Replace winner with 6.

Replace Winner And Replay
1

1
2

3 1 2 2

3 6 1 3 2 4 2 5

4 3 6 8 6 5 7 3 2 6 9 4 5 2 5 8

Replay matches on path to root.

Replace Winner And Replay
1

1
2

3 1 2 2

3 6 1 3 2 4 2 5

4 3 6 8 6 5 7 3 2 6 9 4 5 2 5 8

Replay matches on path to root.

Replace Winner And Replay
1

1
2

3 1 2 2

3 6 1 3 2 4 2 5

4 3 6 8 6 5 7 3 2 6 9 4 5 2 5 8

Opponent is player who lost last match played at this node.

Loser Tree

Each match node stores the match

loser rather than the match winner.
Min Loser Tree For 16 Players

4 8

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8
Min Loser Tree For 16 Players

6 1

4 8 5 7

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8
Min Loser Tree For 16 Players
1

6 3 2

4 8 5 7 6 9

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8
Min Loser Tree For 16 Players
1

3
2

6 3 4 2

4 8 5 7 6 9 5 8

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8
Min Loser Tree For 16 Players
1

3
2

6 3 4 5

4 8 5 7 6 9 5 8

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8
Min Loser Tree For 16 Players
1

3
2

6 3 4 5

4 8 5 7 6 9 5 8

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8
Min Loser Tree For 16 Players
2

3
2

6 3 4 5

4 8 5 7 6 9 5 8

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8
1 Winner

3
2

6 3 4 5

4 8 5 7 6 9 5 8

4 3 6 8 1 5 7 3 2 6 9 4 5 2 5 8
Complexity Of Loser Tree
Initialize

• One match at each match node.

• One store of a left child winner.
• Total time is O(n).
• More precisely Theta(n).
21 Winner

3
2

6 53 4 5

4 8 95 7 6 9 5 8

4 3 6 8 91 5 7 3 2 6 9 4 5 2 5 8

Replace winner with 9 and replay matches.

Complexity Of Replay

• One match at each level that has a match

node.
• O(log n)
• More precisely Theta(log n).
More Tournament Tree
Applications

• k-way merging of runs during an external

merge sort
• Truck loading
Truck Loading

 n packages to be loaded into trucks

 each package has a weight
 each truck has a capacity of c tons
 minimize number of trucks
Truck Loading

n = 5 packages
weights [2, 5, 6, 3, 4]
truck capacity c = 10

Load packages from left to right. If a package

doesn’t fit into current truck, start loading a
new truck.
Truck Loading

n = 5 packages
weights [2, 5, 6, 3, 4]
truck capacity c = 10

truck1 = [2, 5]
truck2 = [6, 3]
truck3 = [4]
uses 3 trucks when 2 trucks suffice
Truck Loading

n = 5 packages
weights [2, 5, 6, 3, 4]
truck capacity c = 10

truck1 = [2, 5, 3]
truck2 = [6, 4]
Bin Packing

• n items to be packed into bins

• each item has a size
• each bin has a capacity of c
• minimize number of bins
Bin Packing

Truck loading is same as bin packing.

Truck is a bin that is to be packed (loaded).
Package is an item/element.
Bin packing to minimize number of bins is NP-hard.
Several fast heuristics have been proposed.
Bin Packing Heuristics

• First Fit.
 Bins are arranged in left to right order.
 Items are packed one at a time in given order.
 Current item is packed into leftmost bin into
which it fits.
 If there is no bin into which current item fits,
start a new bin.
First Fit

n=4
weights = [4, 7, 3, 6]
capacity = 10

Pack red item into first bin.

First Fit

n=4
weights = [4, 7, 3, 6]
capacity = 10

Pack blue item next.

Doesn’t fit, so start a new bin.
First Fit

n=4
weights = [4, 7, 3, 6]
capacity = 10
First Fit

n=4
weights = [4, 7, 3, 6]
capacity = 10

Pack yellow item into first

bin.
First Fit

n=4
weights = [4, 7, 3, 6]
capacity = 10

Pack green item.

Need a new bin.
First Fit

n=4
weights = [4, 7, 3, 6]
capacity = 10

Not optimal.
2 bins suffice.
Bin Packing Heuristics

• First Fit Decreasing.

 Items are sorted into decreasing order.
 Then first fit is applied.
Bin Packing Heuristics

• Best Fit.
 Items are packed one at a time in given order.
 To determine the bin for an item, first
determine set S of bins into which the item fits.
 If S is empty, then start a new bin and put item
into this new bin.
 Otherwise, pack into bin of S that has least
available capacity.
Bin Packing Heuristics

• Best Fit Decreasing.

 Items are sorted into decreasing order.
 Then best fit is applied.
Performance

• For first fit and best fit:

Heuristic Bins <= (17/10)(Minimum Bins) + 2

• For first fit decreasing and best fit

decreasing:
Heuristic Bins <= (11/9)(Minimum Bins) + 4
Complexity Of First Fit

Use a max tournament tree in which

the players are n bins and the value
of a player is the available capacity
in the bin.

O(n log n), where n is the number of

items.
Binary Search Trees

• Dictionary Operations:
 find(key)
 insert(key, value)
 erase(key)
• Additional operations:
 ascend()
 get(index) (indexed binary search tree)
 delete(index) (indexed binary search tree)
Complexity Of Dictionary Operations
find(), insert() and erase()

Data Structure Worst Case Expected

Hash Table O(n) O(1)

Binary Search O(n) O(log n)

Tree
Balanced O(log n) O(log n)
Binary Search
Tree
n is number of elements in dictionary
Complexity Of Other Operations
ascend(), get(index), delete(index)

Data Structure ascend get and delete

Hash Table O(D + n log n) O(D + n log n)

Indexed BST O(n) O(n)

Indexed O(n) O(log n)

Balanced BST

D is number of buckets
Definition Of Binary Search Tree

• A binary tree.
• Each node has a (key, value) pair.
• For every node x, all keys in the left
subtree of x are smaller than that in x.
• For every node x, all keys in the right
subtree of x are greater than that in x.
Example Binary Search Tree
20

10 40

6 15 30

2 8 25

Only keys are shown.

The Operation ascend()
20

10 40

6 15 30

2 8 25

Do an inorder traversal. O(n) time.

The Operation find()
20

10 40

6 15 30

2 8 25

Complexity is O(height) = O(n), where n is

number of nodes/elements.
The Operation insert()
20

10 40

6 15 30

2 8 25 35

Insert a pair whose key is 35.

The Operation insert()
20

10 40

6 15 30

2 8 25 35

Insert a pair whose key is 7.

The Operation insert()
20

10 40

6 15 30

18 25 35
2 8

Insert a pair whose key is 18.

The Operation insert()
20

10 40

6 15 30

18 25 35
2 8

Complexity of insert() is O(height).

The Operation erase()

Three cases:
 Element is in a leaf.
 Element is in a degree 1 node.
 Element is in a degree 2 node.
Erase From A Leaf
20

10 40

6 15 30

18 25 35
2 8

Erase a leaf element. key = 7

Erase From A Leaf (contd.)
20

10 40

6 15 30

18 25 35
2 8

Erase a leaf element. key = 35

Erase From A Degree 1 Node
20

10 40

6 15 30

18 25 35
2 8

Erase from a degree 1 node. key = 40

Erase From A Degree 1 Node (contd.)
20

10 40

6 15 30

18 25 35
2 8

Erase from a degree 1 node. key = 15

Erase From A Degree 2 Node
20

10 40

6 15 30

18 25 35
2 8

Erase from a degree 2 node. key = 10

Erase From A Degree 2 Node
20

10 40

6 15 30

18 25 35
2 8

Replace with largest key in left subtree (or

smallest in right subtree).
Erase From A Degree 2 Node
20

10 40

6 15 30

18 25 35
2 8

Replace with largest key in left subtree (or

smallest in right subtree).
Erase From A Degree 2 Node
20

8 40

6 15 30

18 25 35
2 8

Replace with largest key in left subtree (or

smallest in right subtree).
Erase From A Degree 2 Node
20

8 40

6 15 30

18 25 35
2 8

Largest key must be in a leaf or degree 1 node.

Another Erase From A Degree 2 Node
20

10 40

6 15 30

18 25 35
2 8

Erase from a degree 2 node. key = 20

Erase From A Degree 2 Node
20

10 40

6 15 30

18 25 35
2 8

Replace with largest in left subtree.

Erase From A Degree 2 Node
20

10 40

6 15 30

18 25 35
2 8

Replace with largest in left subtree.

Erase From A Degree 2 Node
18

10 40

6 15 30

18 25 35
2 8

Replace with largest in left subtree.

Erase From A Degree 2 Node
18

10 40

6 15 30

2 8 25 35

Complexity is O(height).
Indexed Binary Search Tree

• Binary search tree.

• Each node has an additional field.
 leftSize = number of nodes in its left subtree
Example Indexed Binary Search Tree
7
20

4 3
10 40

1 0 1
6 15 30

0 0 0 0
1 18
2 8 25 35

0
7

leftSize values are in red

leftSize And Rank
Rank of an element is its position in inorder
(inorder = ascending key order).
[2,6,7,8,10,15,18,20,25,30,35,40]
rank(2) = 0
rank(15) = 5
rank(20) = 7
leftSize(x) = rank(x) with respect to elements in
subtree rooted at x
leftSize And Rank
7
20

4 3
10 40

1 0 1
6 15 30

0 0 0 0
1 18
2 8 25 35

0
7

sorted list = [2,6,7,8,10,15,18,20,25,30,35,40]

get(index) And delete(index)
7
20

4 3
10 40

1 0 1
6 15 30

0 0 0 0
1 18
2 8 25 35

0
7

sorted list = [2,6,7,8,10,15,18,20,25,30,35,40]

get(index) And delete(index)

• if index = x.leftSize desired element is

x.element
• if index < x.leftSize desired element is
index’th element in left subtree of x
• if index > x.leftSize desired element is
(index - x.leftSize-1)’th element in right
subtree of x
Applications
(Complexities Are For Balanced Trees)

Best-fit bin packing in O(n log n) time.

Representing a linear list so that get(index),
insert(index, element), and erase(index)
run in O(log(list size)) time (uses an
indexed binary tree, not indexed binary
search tree).
Can’t use hash tables for either of these
applications.
Linear List As Indexed Binary Tree
7
h

4 3
e l

1 0 1
b f j

0 0 0 0
1 g
a d i k

0
c

list = [a,b,c,d,e,f,g,h,i,j,k,l]
insert(5,’m’)
7
h

4 3
e l

1 0 1
b f j

0 0 0 0
1 g
a d i k

0
c

list = [a,b,c,d,e,f,g,h,i,j,k,l]
insert(5,’m’)
7
h

4 3
e l

1 0 1
b f j

0 0 0 0
1 g
a d i k

0
c

list = [a,b,c,d,e, m,f,g,h,i,j,k,l]

find node with element 4 (e)
insert(5,’m’)
7
h

4 3
e l

1 0 1
b f j

0 0 0 0
1 g
a d i k

0
c

list = [a,b,c,d,e, m,f,g,h,i,j,k,l]

find node with element 4 (e)
insert(5,’m’)
7
h

4 3
e l

1 m 0 1
b f j

0 0 0 0
1 g
a d i k

0
c

add m as right child of e; former right

subtree of e becomes right subtree of m
insert(5,’m’)
7
h

4 3
e l

1 0 1
b f j

0 0 0 0
1 g
a d m i k

0
c

add m as leftmost node in right subtree

of e
insert(5,’m’)

• Other possibilities exist.

• Must update some leftSize values on path
from root to new node.
• Complexity is O(height).
Balanced Binary Search Trees

• height is O(log n), where n is the

number of elements in the tree
• AVL (Adelson-Velsky and Landis)
trees
• red-black trees
• find, insert, and erase take O(log n)
time
Balanced Binary Search Trees
• Indexed AVL trees
• Indexed red-black trees
• Indexed operations also take
O(log n) time
Balanced Search Trees
• weight balanced binary search trees
• 2-3 & 2-3-4 trees
• AA trees
• B-trees
• BBST
• etc.
AVL Tree
• binary tree
• More specifically it’s a “self-balancing
binary search tree”
• for every node x, define its balance factor
balance factor of x = height of left subtree of x
- height of right subtree of x
• balance factor of every node x is -1, 0, or 1
• rebalancing if necessary
Balance Factors
-1

1 1

0 1 -1
0

0
0 0 -1 0

This is an AVL tree.

Height

The height of an AVL tree that has n nodes is

at most 1.44 log2 (n+2).

The height of every n node binary tree is

at least log2 (n+1).
AVL Search Tree
-1
10

1 1
7 40
0 1 -1
0 45
3 8 30
0
0 0 -1 0 60
35
1 5 20
0
25
insert(9)
-1
10

0 1 1
7 40
0 1 -1
0 -1 45
3 8 30
0
0 0 0 -1 0 60
35
1 5 9 20
0
25
insert(29)
-1
10

1 1
7 40
0 1 -1
0 45
3 8 30
0
0 0 -2 -1 35
0 60
1 5 20
0 -1
RR imbalance => new node is in 25
right subtree of right subtree of 0

blue node (node with bf = -2) 29

insert(29)
-1
10

1 1
7 40
0 1 -1
0 45
3 8 30
0
0 0 0 0 60
35
1 5 25
0 0
20 29

RR rotation.
AVL Rotations

• RR
• LL
• RL
• LR
Red Black Trees

Colored Nodes Definition

• Binary search tree.
• Each node is colored red or black.
• Root and all external nodes are black.
• No root-to-external-node path has two
consecutive red nodes.
• All root-to-external-node paths have the
same number of black nodes
Example Red Black Tree
10

7 40

45
3 8 30

35 60
1 5 20

25
Red Black Trees

Colored Edges Definition

• Binary search tree.
• Child pointers are colored red or black.
• Pointer to an external node is black.
• No root to external node path has two
consecutive red pointers.
• Every root to external node path has the
same number of black pointers.
Example Red Black Tree
10

7 40

45
3 8 30

35 60
1 5 20

25
Red Black Tree

• The height of a red black tree that has n

(internal) nodes is between log2(n+1) and
2log2(n+1).
• C++ STL Class map => red black tree
• Java standard library TreeMap => red-black tree
Graphs

• G = (V,E)
• V is the vertex set.
• Vertices are also called nodes and points.
• E is the edge set.
• Each edge connects two different vertices.
• Edges are also called arcs and lines.
• Directed edge has an orientation (u,v).
u v
Graphs

• Undirected edge has no orientation (u,v).

u v

• Undirected graph => no oriented edge.

• Directed graph => every edge has an
orientation.
Undirected Graph

2
3
8
1 10

4
5
9
11

6
7
Directed Graph (Digraph)

2
3
8
1 10

4
5
9
11

6
7
Applications—Communication Network

2
3
8
1 10

4
5
9
11

6
7

• Vertex = city, edge = communication link.

Driving Distance/Time Map

2
4 3
8 8
1 6 10
2 4 5
4 4 3
5
9
11
5 6

6 7
7

• Vertex = city, edge weight = driving

distance/time.
Street Map

2
3
8
1 10

4
5
9
11

6
7

• Some streets are one way.

Complete Undirected Graph

Has all possible edges.

n=1 n=2 n=3 n=4

Number Of Edges—Undirected Graph

• Each edge is of the form (u,v), u != v.

• Number of such pairs in an n vertex graph is
n(n-1).
• Since edge (u,v) is the same as edge (v,u),
the number of edges in a complete
undirected graph is n(n-1)/2.
• Number of edges in an undirected graph is
<= n(n-1)/2.
Number Of Edges—Directed Graph

• Each edge is of the form (u,v), u != v.

• Number of such pairs in an n vertex graph is
n(n-1).
• Since edge (u,v) is not the same as edge
(v,u), the number of edges in a complete
directed graph is n(n-1).
• Number of edges in a directed graph is <=
n(n-1).
Vertex Degree

2
3
8
1 10

4
5
9
11

6
7

Number of edges incident to vertex.

degree(2) = 2, degree(5) = 3, degree(3) = 1
Sum Of Vertex Degrees

8
10

9
11

Sum of degrees = 2e (e is number of edges)

In-Degree Of A Vertex

2
3
8
1 10

4
5
9
11

6
7

in-degree is number of incoming edges

indegree(2) = 1, indegree(8) = 0
Out-Degree Of A Vertex

2
3
8
1 10

4
5
9
11

6
7

out-degree is number of outbound edges

outdegree(2) = 1, outdegree(8) = 2
Sum Of In- And Out-Degrees

each edge contributes 1 to the in-degree of

some vertex and 1 to the out-degree of some
other vertex

sum of in-degrees = sum of out-degrees = e,

where e is the number of edges in the
digraph
Graph Operations And
Representation
Sample Graph Problems

• Path problems.
• Connectedness problems.
• Spanning tree problems.
• Graph coloring problems.
• Flow network problems.
Path Finding
Path between 1 and 8.
2
4 3
8 8
1 6 10
2 4 5
4 4 3
5
9
11
5 6

6 7
7

Path length is 20.

Another Path Between 1 and 8

2
4 3
8 8
1 6 10
2 4 5
4 4 3
5
9
11
5 6

6 7
7

Path length is 28.

Example Of No Path

2
3
8
1 10

4
5
9
11

6
7

No path between 2 and 9.

Connected Graph

• Undirected graph.
• There is a path between every pair of
vertices.
Example Of Not Connected

2
3
8
1 10

4
5
9
11

6
7
Connected Graph Example

2
3
8
1 10

4
5
9
11

6
7
Connected Components

• A subgraph in which any two vertices are

connected to each other by paths
• It is connected to no additional vertices in
the supergraph
• A vertex with no incident edges is itself a
connected component
Connected Components
Connected Components

2
3
8
1 10

4
5
9
11

6
7
Connected Component

• A maximal subgraph that is connected.

 Cannot add vertices and edges from original
graph and retain connectedness.
• A connected graph has exactly 1
component.
Not A Component

2
3
8
1 10

4
5
9
11

6
7
Connected Components

• Several algorithms exists

• A straightforward approach:
– To find all the connected components of a
graph, loop through its vertices, starting a new
breadth first or depth first search whenever the
loop reaches a vertex that has not already been
included in a previously found connected
component.
Connected Components
Connected Components
Connected Components
Connected Components
Communication Network

2
3
8
1 10

4
5
9
11

6
7

Each edge is a link that can be constructed

(i.e., a feasible link).
Communication Network Problems

• Is the network connected?

 Can we communicate between every pair of
cities?
• Find the components.
• Want to construct smallest number of
feasible links so that resulting network is
connected.
Cycles And Connectedness

2
3
8
1 10

4
5
9
11

6
7

Removal of an edge that is on a cycle does not affect

connectedness.
Cycles And Connectedness

2
3
8
1 10

4
5
9
11

6
7

Connected subgraph with all vertices and

minimum number of edges has no cycles.
Tree

• Connected graph that has no cycles.

• n vertex connected graph with n-1 edges.
Spanning Tree

• Subgraph that includes all vertices of the

original graph.
• Subgraph is a tree.
 If original graph has n vertices, the spanning
tree has n vertices and n-1 edges.
Minimum Cost Spanning Tree

2
4 3
8 8
1 6 10
2 4 5
4 4 3
5
9
8 11
5 6
2

6 7
7

• Tree cost is sum of edge weights/costs.

A Spanning Tree

2
4 3
8 8
1 6 10
2 4 5
4 4 3
5
9
8 11
5 6
2

6 7
7

Spanning tree cost = 51.

Minimum Cost Spanning Tree

2
4 3
8 8
1 6 10
2 4 5
4 4 3
5
9
8 11
5 6
2

6 7
7

Spanning tree cost = 41.

A Wireless Broadcast Tree

2
4 3
8 8
1 6 10
2 4 5
4 4 3
5
9
8 11
5 6
2

6 7
7

Source = 1, weights = needed power.

Cost = 4 + 8 + 5 + 6 + 7 + 8 + 3 = 41.
Graph Representation

• Adjacency Matrix
• Adjacency Lists
 Linked Adjacency Lists
 Array Adjacency Lists
Adjacency Matrix
• 0/1 n x n matrix, where n = # of vertices
• A(i,j) = 1 iff (i,j) is an edge

1 2 3 4 5
2
3 1 0 1 0 1 0
2 1 0 0 0 1
1
3 0 0 0 0 1
4 4 1 0 0 0 1
5
5 0 1 1 1 0
Adjacency Matrix Properties
1 2 3 4 5
2
3 1 0 1 0 1 0
2 1 0 0 0 1
1
3 0 0 0 0 1
4 4 1 0 0 0 1
5
5 0 1 1 1 0
•Diagonal entries are zero.
•Adjacency matrix of an undirected graph is
symmetric.
A(i,j) = A(j,i) for all i and j.
Adjacency Matrix (Digraph)
1 2 3 4 5
2
3 1 0 0 0 1 0
2 1 0 0 0 1
1
3 0 0 0 0 0
4 4 0 0 0 0 1
5
5 0 1 1 0 0
•Diagonal entries are zero.
•Adjacency matrix of a digraph need not be
symmetric.
Adjacency Matrix

• n2 bits of space
• For an undirected graph, may store only
lower or upper triangle (exclude diagonal).
 (n-1)n/2 bits
• O(n) time to find vertex degree and/or
vertices adjacent to a given vertex.
Adjacency Lists
• Adjacency list for vertex i is a linear list of
vertices adjacent from vertex i.
• An array of n adjacency lists.
aList[1] = (2,4)
2 aList[2] = (1,5)
3
aList[3] = (5)
1
aList[4] = (5,1)
4
5
aList[5] = (2,4,3)
Linked Adjacency Lists
• Each adjacency list is a chain.
2 aList[1] 2 4
3
[2] 1 5
1 [3] 5
[4] 5 1
4
5 aList[5] 2 4 3

Array Length = n
# of chain nodes = 2e (undirected graph)
# of chain nodes = e (digraph)
Array Adjacency Lists
• Each adjacency list is an array list.
2 aList[1] 2 4
3
[2] 1 5
1 [3] 5
[4] 5 1
4
5 aList[5] 2 4 3

Array Length = n
# of list elements = 2e (undirected graph)
# of list elements = e (digraph)
Weighted Graphs

• Cost adjacency matrix.

 C(i,j) = cost of edge (i,j)
• Adjacency lists => each list element is a
pair (adjacent vertex, edge weight)
Number Of C++ Classes Needed
• Graph representations
 Adjacency Matrix
 Adjacency Lists
Linked Adjacency Lists
Array Adjacency Lists
 3 representations
• Graph types
 Directed and undirected.
 Weighted and unweighted.
 2 x 2 = 4 graph types
• 3 x 4 = 12 C++ classes
Abstract Class Graph
template<class T>
class graph
{
public:
// ADT methods come here

// implementation independent methods come here

};
Abstract Methods Of Graph

// ADT methods
virtual ~graph() {}
virtual int numberOfVertices() const = 0;
virtual int numberOfEdges() const = 0;
virtual bool existsEdge(int, int) const = 0;
virtual void insertEdge(edge<T>*) = 0;
virtual void eraseEdge(int, int) = 0;
virtual int degree(int) const = 0;
virtual int inDegree(int) const = 0;
virtual int outDegree(int) const = 0;
Abstract Methods Of Graph

// ADT methods (continued)

virtual bool directed() const = 0;
virtual bool weighted() const = 0;
virtual vertexIterator<T>* iterator(int) = 0;
virtual void output(ostream&) const = 0;
Graph Search Methods
• A vertex u is reachable from vertex v iff there is a
path from v to u.

2
3
8
1

4
5
9
10

6
7 11
Graph Search Methods
• A search method starts at a given vertex v and
visits/labels/marks every vertex that is reachable
from v.
2
3
8
1

4
5
9
10

6
7 11
Graph Search Methods
• Many graph problems solved using a search
method.
 Path from one vertex to another.
 Is the graph connected?
 Find a spanning tree.
 Etc.
• Commonly used search methods:
 Breadth-first search. (BFS)
 Depth-first search. (DFS)
Breadth-First Search

• Visit start vertex and put into a FIFO queue.

• Repeatedly remove a vertex from the queue, visit
its unvisited adjacent vertices, put newly visited
vertices into the queue.
Breadth-First Search Example

2
3
8
1

4
5
9
10

6
7 11

Start search at vertex 1.

Breadth-First Search Example

2
3
FIFO Queue
8
1 1
4
5
9
10

6
7 11

Visit/mark/label start vertex and put in a FIFO queue.

Breadth-First Search Example

2
3
FIFO Queue
8
1 1
4
5
9
10

6
7 11