[SPARK-979] Add some randomization to scheduler to better balance in-memory partition distributions - ASF Jira

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.0.0
Component/s: None
Labels:
None

Description

The Spark scheduler is very deterministic, which causes problems for the following workload (in serial order on a cluster with a small number of nodes):

cache rdd 1 with 1 partition
cache rdd 2 with 1 partition
cache rdd 3 with 1 partition
....

After a while, only executor 1 will have data in memory, and eventually leading to evicting in-memory blocks to disk while all other executors are empty.

We can solve this problem by adding some randomization to the cluster scheduling, or by adding memory aware scheduling (which is much harder to do).

Attachments

Issue Links

links to

[Github] Pull Request #8387 (tdas)

Activity

People

Assignee:: Kay Ousterhout

Reporter:: Reynold Xin

Votes:: 1 Vote for this issue

Watchers:: 8 Start watching this issue

Dates

Created:: 03/Dec/13 15:13

Updated:: 24/Aug/15 03:55

Resolved:: 01/Mar/14 11:26