Torch manual seed to seed cuda devices #1762

alykhantejani · 2017-06-09T15:11:47Z

Seed all cuda devices when torch.manual_seed is called (issue #1615)

…nd num input planes

alykhantejani · 2017-06-09T15:46:33Z

urm a weird Conv.cpp seems to have snuck into this push - let me fix that.

alykhantejani · 2017-06-09T15:47:54Z

ok, It's fixed - sorry about that

Kaixhin · 2017-06-20T15:18:49Z

What was the rationale behind this? I can't set random seeds in my CPU multiprocessing code because it's trying to set CUDA seeds. This might be more convenient than controlling CUDA seeds manually, but this seems like a bug to me.

  File "/home/ka709/ferry/test.py", line 13, in test
    torch.manual_seed(args.seed + rank)
  File "/home/ka709/miniconda3/lib/python3.6/site-packages/torch/__init__.py", line 133, in manual_seed
    torch.cuda.manual_seed_all(seed)
  File "/home/ka709/miniconda3/lib/python3.6/site-packages/torch/cuda/random.py", line 37, in manual_seed_all
    _lazy_init()
  File "/home/ka709/miniconda3/lib/python3.6/site-packages/torch/cuda/__init__.py", line 83, in _lazy_init
    "Cannot re-initialize CUDA in forked subprocess. " + msg)
RuntimeError: Cannot re-initialize CUDA in forked subprocess. To use CUDA with multiprocessing, you must use the 'spawn' start method

alykhantejani · 2017-06-20T17:40:19Z

Bug report filed here: #1856

Syncing nvfuser devel branch to upstream master. https://github.com/csarofeen/pytorch/ Code changes includes: - TransformPropagator refactor: switched to Dijkstra instead of exhaustive enumeration on all possible paths to reduce compilation time on transform propagation; - Indexing refactor: remove reference tensor creation in all tensor indexing logic (#1690) - (more) generic grouped grid reduction kernel; - Minor parser/fuser patches: 1. zero-dim tensor reduction support 3. no-op binary removal within fused graph 4. expand supported in fusion Squashed commits to WAR github API Commits that's actually in this PR from the devel branch: ``` a054b3e Refactor TransormPropagator to allow specifying a position and propagating to part of the DAG (#1775) d67e1cd Indexing refactor stage 1: remove reference tensor creation in all tensor indexing logic (#1690) 1b65299 Issue 1770 (#1774) 35b0427 Avoid compilation errors like below: (#1773) 452c773 Ignore reductions of zero-dim tensors per PyTorch conventions (#1771) 31d6c56 TransformPropagator refactor (#1769) 570c5a8 Merge pull request #1767 from csarofeen/upstream_merge_0621 9d6c3d8 merging upstream 61305cd 0ed815f New TransformPropagator algorithm (#1763) 6c19520 no-op binary removal (#1764) ec7fa41 Proper propagation of IterType (#1762) b263562 Fix dimensionality check (#1759) 2d6343f More generic grouped grid reduction kernel (#1740) 64e2b56 [nvfuser] prevent spamming warning message (#77777) (#1758) 0c43162 [nvFuser] Improving bitwise ops support (#77158) (#1757) b93a147 Parser expand (#1754) ``` RUN_TORCHBENCH: nvfuser Pull Request resolved: #80355 Approved by: https://github.com/davidberard98

Summary: Syncing nvfuser devel branch to upstream master. https://github.com/csarofeen/pytorch/ Code changes includes: - TransformPropagator refactor: switched to Dijkstra instead of exhaustive enumeration on all possible paths to reduce compilation time on transform propagation; - Indexing refactor: remove reference tensor creation in all tensor indexing logic (#1690) - (more) generic grouped grid reduction kernel; - Minor parser/fuser patches: 1. zero-dim tensor reduction support 3. no-op binary removal within fused graph 4. expand supported in fusion Squashed commits to WAR github API Commits that's actually in this PR from the devel branch: ``` a054b3e Refactor TransormPropagator to allow specifying a position and propagating to part of the DAG (#1775) d67e1cd Indexing refactor stage 1: remove reference tensor creation in all tensor indexing logic (#1690) 1b65299 Issue 1770 (#1774) 35b0427 Avoid compilation errors like below: (#1773) 452c773 Ignore reductions of zero-dim tensors per PyTorch conventions (#1771) 31d6c56 TransformPropagator refactor (#1769) 570c5a8 Merge pull request #1767 from csarofeen/upstream_merge_0621 9d6c3d8 merging upstream 61305cd 0ed815f New TransformPropagator algorithm (#1763) 6c19520 no-op binary removal (#1764) ec7fa41 Proper propagation of IterType (#1762) b263562 Fix dimensionality check (#1759) 2d6343f More generic grouped grid reduction kernel (#1740) 64e2b56 [nvfuser] prevent spamming warning message (#77777) (#1758) 0c43162 [nvFuser] Improving bitwise ops support (#77158) (#1757) b93a147 Parser expand (#1754) ``` RUN_TORCHBENCH: nvfuser Pull Request resolved: #80355 Reviewed By: qihqi Differential Revision: D37573400 Pulled By: davidberard98 fbshipit-source-id: 52ab68d89ec01ef61f69f5abeb18c9d3a312aa64

alykhantejani added 3 commits June 5, 2017 12:10

added checks to cudnn Convolution for stride, dilation, kernel size a…

7efd8d7

…nd num input planes

add convolution_shape_checks for cudnn convs

8abca10

torch.manual_seed now seeds all cuda devices too

cb69016

Merge branch 'master' into torch_manual_seed_to_seed_cuda_devices

67a762a

apaszke approved these changes Jun 10, 2017

View reviewed changes

apaszke merged commit 5f1a16a into pytorch:master Jun 10, 2017

ezyang added the open source label Jun 24, 2019

jjsjann123 pushed a commit to jjsjann123/pytorch that referenced this pull request Jun 22, 2022

Proper propagation of IterType (pytorch#1762)

ec7fa41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Torch manual seed to seed cuda devices #1762

Torch manual seed to seed cuda devices #1762

Uh oh!

alykhantejani commented Jun 9, 2017

Uh oh!

alykhantejani commented Jun 9, 2017

Uh oh!

alykhantejani commented Jun 9, 2017

Uh oh!

Kaixhin commented Jun 20, 2017 •

edited

Loading

Uh oh!

alykhantejani commented Jun 20, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Torch manual seed to seed cuda devices #1762

Torch manual seed to seed cuda devices #1762

Uh oh!

Conversation

alykhantejani commented Jun 9, 2017

Uh oh!

alykhantejani commented Jun 9, 2017

Uh oh!

alykhantejani commented Jun 9, 2017

Uh oh!

Kaixhin commented Jun 20, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alykhantejani commented Jun 20, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Kaixhin commented Jun 20, 2017 •

edited

Loading