Parallel and Distributed Storage: Practice Exercises
Parallel and Distributed Storage: Practice Exercises
21
Parallel and Distributed Storage
Practice Exercises
a. Suppose you have a histogram where values are between 1 and 100, and
are partitioned into 10 ranges, 1 – 10, 11 – 20, … , 91 – 100, with frequen-
cies 15, 5, 20, 10, 10, 5, 5, 20, 5, and 5, respectively. Give a load-balanced
range partitioning function to divide the values into five partitions.
b. Write an algorithm for computing a balanced range partition with p par-
titions, given a histogram of frequency distributions containing n ranges.
tified node, the mastership has changed, and a different node is now the
master. How can such a situation be dealt with?
b. While the master replica could be chosen on a per-partition basis, some
systems support a per-record master replica, where the records of a par-
tition (or tablet) are replicated at some set of nodes, but each record’s
master replica can be on any of the nodes from within this set of nodes,
independent of the master replica of other records. List two benefits of
keeping track of master on a per-record basis.
c. Suggest how to keep track of the master replica for each record, when
there are a large number of records.