Skip to content

Conversation

@suo
Copy link
Member

@suo suo commented Jul 9, 2022

Stack from ghstack (oldest at bottom):

print_test_stats.py had a somewhat misleading name, as it was also
responsible for uploading test statistics to both S3 and Scuba.

We are using Rockset for most of this now, so removing the redundant
code.

The following are things we do not have replacements for:

  • The actual display of regressed or slow tests in the
    print_test_stats job step. I don't think people really looked at these
    so not a big loss.
  • Scuba uploads. This is sad, but fundamentally I think we probably
    don't want to maintain two data pipelines.

print_test_stats.py had a somewhat misleading name, as it was also
responsible for uploading test statistics to both S3 and Scuba.

We are using Rockset for most of this now, so removing the redundant
code.

The following are things we do *not* have replacements for:
- The actual display of regressed or slow tests in the
`print_test_stats` job step. I don't think people really looked at these
so not a big loss.
- Scuba uploads. This is sad, but fundamentally I think we probably
don't want to maintain two data pipelines.

[ghstack-poisoned]
@suo suo requested a review from a team as a code owner July 9, 2022 06:57
@facebook-github-bot
Copy link
Contributor

facebook-github-bot commented Jul 9, 2022

🔗 Helpful links

❌ 1 New Failures

As of commit ae990c1 (more details on the Dr. CI page):

Expand to see more
  • 1/1 failures introduced in this PR

🕵️ 1 new failure recognized by patterns

The following CI failures do not appear to be due to upstream breakages

See GitHub Actions build pull / linux-focal-py3.7-gcc7 / test (backwards_compat, 1, 1, linux.2xlarge) (1/1)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

2022-07-09T07:23:29.4321191Z The PR is introduc...m to confirm whether this change is wanted or not.
2022-07-09T07:23:29.4304782Z processing existing schema:  duration_ns(__torch__.torch.classes.profiling.InstructionStats _0) -> int _0
2022-07-09T07:23:29.4306881Z processing existing schema:  source(__torch__.torch.classes.profiling.SourceStats _0) -> __torch__.torch.classes.profiling.SourceRef _0
2022-07-09T07:23:29.4308938Z processing existing schema:  line_map(__torch__.torch.classes.profiling.SourceStats _0) -> Dict(int, __torch__.torch.classes.profiling.InstructionStats) _0
2022-07-09T07:23:29.4310238Z processing existing schema:  __init__(__torch__.torch.classes.profiling._ScriptProfile _0) -> NoneType _0
2022-07-09T07:23:29.4311994Z processing existing schema:  enable(__torch__.torch.classes.profiling._ScriptProfile _0) -> NoneType _0
2022-07-09T07:23:29.4313483Z processing existing schema:  disable(__torch__.torch.classes.profiling._ScriptProfile _0) -> NoneType _0
2022-07-09T07:23:29.4315758Z processing existing schema:  _dump_stats(__torch__.torch.classes.profiling._ScriptProfile _0) -> __torch__.torch.classes.profiling.SourceStats[] _0
2022-07-09T07:23:29.4317488Z processing existing schema:  __init__(__torch__.torch.classes.c10d.ProcessGroup _0, int _1, int _2) -> NoneType _0
2022-07-09T07:23:29.4318664Z processing existing schema:  __init__(__torch__.torch.classes.c10d.Work _0) -> NoneType _0
2022-07-09T07:23:29.4320448Z processing existing schema:  __init__(__torch__.torch.classes.dist_rpc.WorkerInfo _0, str _1, int _2) -> NoneType _0
2022-07-09T07:23:29.4321191Z The PR is introducing backward incompatible changes to the operator library. Please contact PyTorch team to confirm whether this change is wanted or not. 
2022-07-09T07:23:29.4321714Z 
2022-07-09T07:23:29.4321800Z Broken ops: [
2022-07-09T07:23:29.4322195Z 	__getstate__(__torch__.torch.classes.sparse.LinearPackedParamsBase _0) -> ((Tensor, Tensor?, int[]) _0)
2022-07-09T07:23:29.4322708Z 	__setstate__(__torch__.torch.classes.sparse.LinearPackedParamsBase _0, (Tensor, Tensor?, int[]) _1) -> NoneType _0
2022-07-09T07:23:29.4322982Z ]
2022-07-09T07:23:29.5543393Z ##[error]Process completed with exit code 1.
2022-07-09T07:23:29.5572937Z Prepare all required actions
2022-07-09T07:23:29.5573227Z Getting action download info
2022-07-09T07:23:29.6847578Z ##[group]Run ./.github/actions/get-workflow-job-id
2022-07-09T07:23:29.6847802Z with:

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

print_test_stats.py had a somewhat misleading name, as it was also
responsible for uploading test statistics to both S3 and Scuba.

We are using Rockset for most of this now, so removing the redundant
code.

The following are things we do *not* have replacements for:
- The actual display of regressed or slow tests in the
`print_test_stats` job step. I don't think people really looked at these
so not a big loss.
- Scuba uploads. This is sad, but fundamentally I think we probably
don't want to maintain two data pipelines.

[ghstack-poisoned]
Copy link
Contributor

@janeyx99 janeyx99 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A blocker to this is the test_history.py script that the distributed team uses for test case info. Those stats read from S3 to generate a historical view across jobs--maybe an observable view would be sufficient?

cc. @pritamdamania87 (or @pritamdamania?) last time we talked, you were still using this script which was why I didn't merge #73098. Do you still use the script?

@github-actions
Copy link
Contributor

github-actions bot commented Sep 9, 2022

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

@github-actions github-actions bot added the Stale label Sep 9, 2022
@facebook-github-bot
Copy link
Contributor

/easycla

As part of the transition to the PyTorch Foundation, this project now requires contributions be covered under the new CLA. See #85559 for additional details.

This comment will trigger a new check of this PR. If you are already covered, you will simply see a new "EasyCLA" check that passes. If you are not covered, a bot will leave a new comment with a link to sign.

@linux-foundation-easycla
Copy link

CLA Not Signed

@github-actions github-actions bot closed this Nov 3, 2022
@facebook-github-bot facebook-github-bot deleted the gh/suo/589/head branch June 8, 2023 18:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cla signed module: rocm AMD GPU support for Pytorch Stale

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants