SHS-NG M4.2: Port executors page to new backend. #45

vanzin · 2017-08-10T01:33:42Z

The executors page is built on top of the REST API, so the page itself
was easy to hook up to the new code.

Some other pages depend on the ExecutorListener class that is being
removed, though, so they needed to be modified to use data from the
new store. Fortunately, all they seemed to need is the map of executor
logs, so that was somewhat easy too.

The executor timeline graph required adding some properties to the
ExecutorSummary API type. Instead of following the previous code,
which stored all the listener events in memory, the timeline is
now created based on the data available from the API.

I had to change some of the test golden files because the old code would
return executors in "random" order (since it used a mutable Map instead
of something that returns a sorted list), and the new code returns executors
in id order.

squito

just one small bug I think

again it would be good to mention where this is tested -- in this case I think existing tests actually cover this.

squito · 2017-10-25T19:19:38Z

core/src/main/scala/org/apache/spark/status/AppStatusListener.scala

shouldn't active tasks be updated even if event.reason == Resubmitted?

looks like that is what happens in the current ExecutorsTab

Is it? This is the code right below the copy & pasted comment on top of this block:

if (taskEnd.reason == Resubmitted) { return }

That being said, if a resubmitted task generates a onTaskStart call, then the accounting here will be off (and will also be off in the current code in master, as far as I can tell).

oh you're right, sorry I missed the return -- its the same in master.

But yeah, master looks wrong too.

squito · 2017-10-25T20:01:07Z

core/src/main/scala/org/apache/spark/ui/jobs/AllJobsPage.scala

what do you think about adding a helper method for these json snippets? obviously not the fault of your change, but I find these so hard to read, eg. figuring out where data-title ends (worse in some of the other cases esp in StagePage). something like

UIUtils.timelineJsonEvent( className="executor added", group="executors", start = e.addTime.getTime(), end = None, contentClass = "executor-event-content", dataTitle = s"Executor ${e.id}<br> Added at ${UIUtils.formatDate(new Date(e.addTime.getTime()))}", contentBody = s"Executor ${e.id} added")

I'm not against it but I'd rather do that separately.

The executors page is built on top of the REST API, so the page itself was easy to hook up to the new code. Some other pages depend on the `ExecutorListener` class that is being removed, though, so they needed to be modified to use data from the new store. Fortunately, all they seemed to need is the map of executor logs, so that was somewhat easy too. The executor timeline graph required adding some properties to the ExecutorSummary API type. Instead of following the previous code, which stored all the listener events in memory, the timeline is now created based on the data available from the API. I had to change some of the test golden files because the old code would return executors in "random" order (since it used a mutable Map instead of something that returns a sorted list), and the new code returns executors in id order.

## What changes were proposed in this pull request? This PR aims at improving the way physical plans are explained in spark. Currently, the explain output for physical plan may look very cluttered and each operator's string representation can be very wide and wraps around in the display making it little hard to follow. This especially happens when explaining a query 1) Operating on wide tables 2) Has complex expressions etc. This PR attempts to split the output into two sections. In the header section, we display the basic operator tree with a number associated with each operator. In this section, we strictly control what we output for each operator. In the footer section, each operator is verbosely displayed. Based on the feedback from Maryann, the uncorrelated subqueries (SubqueryExecs) are not included in the main plan. They are printed separately after the main plan and can be correlated by the originating expression id from its parent plan. To illustrate, here is a simple plan displayed in old vs new way. Example query1 : ``` EXPLAIN SELECT key, Max(val) FROM explain_temp1 WHERE key > 0 GROUP BY key HAVING max(val) > 0 ``` Old : ``` *(2) Project [key#2, max(val)#15] +- *(2) Filter (isnotnull(max(val#3)#18) AND (max(val#3)#18 > 0)) +- *(2) HashAggregate(keys=[key#2], functions=[max(val#3)], output=[key#2, max(val)#15, max(val#3)#18]) +- Exchange hashpartitioning(key#2, 200) +- *(1) HashAggregate(keys=[key#2], functions=[partial_max(val#3)], output=[key#2, max#21]) +- *(1) Project [key#2, val#3] +- *(1) Filter (isnotnull(key#2) AND (key#2 > 0)) +- *(1) FileScan parquet default.explain_temp1[key#2,val#3] Batched: true, DataFilters: [isnotnull(key#2), (key#2 > 0)], Format: Parquet, Location: InMemoryFileIndex[file:/user/hive/warehouse/explain_temp1], PartitionFilters: [], PushedFilters: [IsNotNull(key), GreaterThan(key,0)], ReadSchema: struct<key:int,val:int> ``` New : ``` Project (8) +- Filter (7) +- HashAggregate (6) +- Exchange (5) +- HashAggregate (4) +- Project (3) +- Filter (2) +- Scan parquet default.explain_temp1 (1) (1) Scan parquet default.explain_temp1 [codegen id : 1] Output: [key#2, val#3] (2) Filter [codegen id : 1] Input : [key#2, val#3] Condition : (isnotnull(key#2) AND (key#2 > 0)) (3) Project [codegen id : 1] Output : [key#2, val#3] Input : [key#2, val#3] (4) HashAggregate [codegen id : 1] Input: [key#2, val#3] (5) Exchange Input: [key#2, max#11] (6) HashAggregate [codegen id : 2] Input: [key#2, max#11] (7) Filter [codegen id : 2] Input : [key#2, max(val)#5, max(val#3)#8] Condition : (isnotnull(max(val#3)#8) AND (max(val#3)#8 > 0)) (8) Project [codegen id : 2] Output : [key#2, max(val)#5] Input : [key#2, max(val)#5, max(val#3)#8] ``` Example Query2 (subquery): ``` SELECT * FROM explain_temp1 WHERE KEY = (SELECT Max(KEY) FROM explain_temp2 WHERE KEY = (SELECT Max(KEY) FROM explain_temp3 WHERE val > 0) AND val = 2) AND val > 3 ``` Old: ``` *(1) Project [key#2, val#3] +- *(1) Filter (((isnotnull(KEY#2) AND isnotnull(val#3)) AND (KEY#2 = Subquery scalar-subquery#39)) AND (val#3 > 3)) : +- Subquery scalar-subquery#39 : +- *(2) HashAggregate(keys=[], functions=[max(KEY#26)], output=[max(KEY)#45]) : +- Exchange SinglePartition : +- *(1) HashAggregate(keys=[], functions=[partial_max(KEY#26)], output=[max#47]) : +- *(1) Project [key#26] : +- *(1) Filter (((isnotnull(KEY#26) AND isnotnull(val#27)) AND (KEY#26 = Subquery scalar-subquery#38)) AND (val#27 = 2)) : : +- Subquery scalar-subquery#38 : : +- *(2) HashAggregate(keys=[], functions=[max(KEY#28)], output=[max(KEY)#43]) : : +- Exchange SinglePartition : : +- *(1) HashAggregate(keys=[], functions=[partial_max(KEY#28)], output=[max#49]) : : +- *(1) Project [key#28] : : +- *(1) Filter (isnotnull(val#29) AND (val#29 > 0)) : : +- *(1) FileScan parquet default.explain_temp3[key#28,val#29] Batched: true, DataFilters: [isnotnull(val#29), (val#29 > 0)], Format: Parquet, Location: InMemoryFileIndex[file:/user/hive/warehouse/explain_temp3], PartitionFilters: [], PushedFilters: [IsNotNull(val), GreaterThan(val,0)], ReadSchema: struct<key:int,val:int> : +- *(1) FileScan parquet default.explain_temp2[key#26,val#27] Batched: true, DataFilters: [isnotnull(key#26), isnotnull(val#27), (val#27 = 2)], Format: Parquet, Location: InMemoryFileIndex[file:/user/hive/warehouse/explain_temp2], PartitionFilters: [], PushedFilters: [IsNotNull(key), IsNotNull(val), EqualTo(val,2)], ReadSchema: struct<key:int,val:int> +- *(1) FileScan parquet default.explain_temp1[key#2,val#3] Batched: true, DataFilters: [isnotnull(key#2), isnotnull(val#3), (val#3 > 3)], Format: Parquet, Location: InMemoryFileIndex[file:/user/hive/warehouse/explain_temp1], PartitionFilters: [], PushedFilters: [IsNotNull(key), IsNotNull(val), GreaterThan(val,3)], ReadSchema: struct<key:int,val:int> ``` New: ``` Project (3) +- Filter (2) +- Scan parquet default.explain_temp1 (1) (1) Scan parquet default.explain_temp1 [codegen id : 1] Output: [key#2, val#3] (2) Filter [codegen id : 1] Input : [key#2, val#3] Condition : (((isnotnull(KEY#2) AND isnotnull(val#3)) AND (KEY#2 = Subquery scalar-subquery#23)) AND (val#3 > 3)) (3) Project [codegen id : 1] Output : [key#2, val#3] Input : [key#2, val#3] ===== Subqueries ===== Subquery:1 Hosting operator id = 2 Hosting Expression = Subquery scalar-subquery#23 HashAggregate (9) +- Exchange (8) +- HashAggregate (7) +- Project (6) +- Filter (5) +- Scan parquet default.explain_temp2 (4) (4) Scan parquet default.explain_temp2 [codegen id : 1] Output: [key#26, val#27] (5) Filter [codegen id : 1] Input : [key#26, val#27] Condition : (((isnotnull(KEY#26) AND isnotnull(val#27)) AND (KEY#26 = Subquery scalar-subquery#22)) AND (val#27 = 2)) (6) Project [codegen id : 1] Output : [key#26] Input : [key#26, val#27] (7) HashAggregate [codegen id : 1] Input: [key#26] (8) Exchange Input: [max#35] (9) HashAggregate [codegen id : 2] Input: [max#35] Subquery:2 Hosting operator id = 5 Hosting Expression = Subquery scalar-subquery#22 HashAggregate (15) +- Exchange (14) +- HashAggregate (13) +- Project (12) +- Filter (11) +- Scan parquet default.explain_temp3 (10) (10) Scan parquet default.explain_temp3 [codegen id : 1] Output: [key#28, val#29] (11) Filter [codegen id : 1] Input : [key#28, val#29] Condition : (isnotnull(val#29) AND (val#29 > 0)) (12) Project [codegen id : 1] Output : [key#28] Input : [key#28, val#29] (13) HashAggregate [codegen id : 1] Input: [key#28] (14) Exchange Input: [max#37] (15) HashAggregate [codegen id : 2] Input: [max#37] ``` Note: I opened this PR as a WIP to start getting feedback. I will be on vacation starting tomorrow would not be able to immediately incorporate the feedback. I will start to work on them as soon as i can. Also, currently this PR provides a basic infrastructure for explain enhancement. The details about individual operators will be implemented in follow-up prs ## How was this patch tested? Added a new test `explain.sql` that tests basic scenarios. Need to add more tests. Closes apache#24759 from dilipbiswal/explain_feature. Authored-by: Dilip Biswal <[email protected]> Signed-off-by: Wenchen Fan <[email protected]>

vanzin force-pushed the shs-ng/M4.2 branch from 1f58877 to f70c9bf Compare August 10, 2017 01:36

vanzin force-pushed the shs-ng/M4.1 branch from b541970 to 6d652c4 Compare September 28, 2017 17:54

vanzin force-pushed the shs-ng/M4.2 branch from f70c9bf to 193c0e0 Compare September 28, 2017 17:54

squito reviewed Oct 25, 2017

View reviewed changes

vanzin force-pushed the shs-ng/M4.1 branch from 6d652c4 to 01a9783 Compare October 26, 2017 18:28

vanzin force-pushed the shs-ng/M4.2 branch from 193c0e0 to a1f1b55 Compare October 26, 2017 18:28

vanzin force-pushed the shs-ng/M4.1 branch from 01a9783 to 0a34327 Compare October 26, 2017 21:13

vanzin force-pushed the shs-ng/M4.2 branch from a1f1b55 to 4687aed Compare October 26, 2017 21:13

vanzin force-pushed the shs-ng/M4.1 branch from 0a34327 to 22c7967 Compare November 6, 2017 19:34

vanzin force-pushed the shs-ng/M4.2 branch from 4687aed to 0dd7b54 Compare November 6, 2017 19:34

vanzin closed this Nov 6, 2017

vanzin mentioned this pull request Nov 6, 2017

[SPARK-20646][core] Port executors page to new UI backend. apache/spark#19678

Closed

vanzin deleted the shs-ng/M4.2 branch April 25, 2019 16:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

SHS-NG M4.2: Port executors page to new backend. #45

SHS-NG M4.2: Port executors page to new backend. #45

Uh oh!

vanzin commented Aug 10, 2017

Uh oh!

squito left a comment

Uh oh!

squito Oct 25, 2017

Uh oh!

vanzin Oct 26, 2017

Uh oh!

squito Oct 26, 2017

Uh oh!

squito Oct 25, 2017

Uh oh!

vanzin Oct 26, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SHS-NG M4.2: Port executors page to new backend. #45

SHS-NG M4.2: Port executors page to new backend. #45

Uh oh!

Conversation

vanzin commented Aug 10, 2017

Uh oh!

squito left a comment

Choose a reason for hiding this comment

Uh oh!

squito Oct 25, 2017

Choose a reason for hiding this comment

Uh oh!

vanzin Oct 26, 2017

Choose a reason for hiding this comment

Uh oh!

squito Oct 26, 2017

Choose a reason for hiding this comment

Uh oh!

squito Oct 25, 2017

Choose a reason for hiding this comment

Uh oh!

vanzin Oct 26, 2017

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants