Refactor batch sampler #8958

dmitriy-serdyuk · 2018-06-27T19:03:03Z

facebook-github-bot

@ezyang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

ezyang · 2018-06-27T22:03:52Z

@pytorchbot retest this please

alsrgv · 2018-08-03T02:32:16Z

This change makes DistributedSampler very slow. To repro, take MNIST example and add DistributedSampler:

    train_dataset = \
        datasets.MNIST('../data', train=True, download=True,
                       transform=transforms.Compose([
                           transforms.ToTensor(),
                           transforms.Normalize((0.1307,), (0.3081,))
                       ]))
    train_loader = torch.utils.data.DataLoader(train_dataset,
        batch_size=args.batch_size, sampler=torch.utils.data.distributed.DistributedSampler(train_dataset, num_replicas=1, rank=0), **kwargs)

The reason seems to be that it creates a lot of tensors for each index element. I was able to make it fast again by adding the cast to int.

cc @dmitriy-serdyuk, @apaszke

fmassa · 2018-08-03T11:20:03Z

Thanks for the message @alsrgv !
The better fix is to replace this line with something like

indices = torch.randperm(len(self.dataset), generator=g).tolist()

as list(tensor) returns now a list of 0d tensors, while what we want is to return a list with python numbers (obtained via tensor.tolist()).

Can you send a PR fixing this?

bearpelican · 2018-08-06T06:35:25Z

Fixes it for me!

alsrgv · 2018-08-08T23:45:13Z

@fmassa, thanks for the suggestion. Submitted as #10361

Summary: Pull Request resolved: #10361 Differential Revision: D9240798 Pulled By: ezyang fbshipit-source-id: dc4cfe79612f711bbcff34a147877df6a5f7b89f

Summary: Pull Request resolved: pytorch#10361 Differential Revision: D9240798 Pulled By: ezyang fbshipit-source-id: dc4cfe79612f711bbcff34a147877df6a5f7b89f

Summary: Since #8958 was merged, the BatchSampler samples 0d tensors from WeightedRandomSampler instead of integers. It significantly reduces performance. This PR fix it the same way as #10361 fix DistributedSampler. Pull Request resolved: #10636 Differential Revision: D9423869 Pulled By: zou3519 fbshipit-source-id: f94da2d4cccf70e63beea6cfc3d1230b5610ae44

Summary: Since pytorch#8958 was merged, the BatchSampler samples 0d tensors from WeightedRandomSampler instead of integers. It significantly reduces performance. This PR fix it the same way as pytorch#10361 fix DistributedSampler. Pull Request resolved: pytorch#10636 Differential Revision: D9423869 Pulled By: zou3519 fbshipit-source-id: f94da2d4cccf70e63beea6cfc3d1230b5610ae44

Refactor batch sampler

5820bdf

dmitriy-serdyuk requested review from apaszke, colesbury, ezyang, gchanan, soumith and zdevito as code owners June 27, 2018 19:03

facebook-github-bot reviewed Jun 27, 2018

View reviewed changes

soumith approved these changes Jun 27, 2018

View reviewed changes

facebook-github-bot closed this in ba8e133 Jun 27, 2018

vadimkantorov mentioned this pull request Jul 3, 2018

Samplers require only integers in example indices after Tensor+Variable merge #5612

Closed

alsrgv pushed a commit to alsrgv/pytorch that referenced this pull request Aug 8, 2018

Fix performance of DistributedSampler per pytorch#8958

5016738

facebook-github-bot pushed a commit that referenced this pull request Aug 9, 2018

Fix performance of DistributedSampler per #8958

18d2fcd

Summary: Pull Request resolved: #10361 Differential Revision: D9240798 Pulled By: ezyang fbshipit-source-id: dc4cfe79612f711bbcff34a147877df6a5f7b89f

Chetter2 mentioned this pull request Aug 17, 2018

Fix performance of WeightedRandomSampler #10636

Closed

ezyang added the open source label Jun 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor batch sampler #8958

Refactor batch sampler #8958

Uh oh!

dmitriy-serdyuk commented Jun 27, 2018

Uh oh!

facebook-github-bot left a comment

Uh oh!

ezyang commented Jun 27, 2018

Uh oh!

alsrgv commented Aug 3, 2018

Uh oh!

fmassa commented Aug 3, 2018 •

edited

Loading

Uh oh!

bearpelican commented Aug 6, 2018

Uh oh!

alsrgv commented Aug 8, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Refactor batch sampler #8958

Refactor batch sampler #8958

Uh oh!

Conversation

dmitriy-serdyuk commented Jun 27, 2018

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

ezyang commented Jun 27, 2018

Uh oh!

alsrgv commented Aug 3, 2018

Uh oh!

fmassa commented Aug 3, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bearpelican commented Aug 6, 2018

Uh oh!

alsrgv commented Aug 8, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

fmassa commented Aug 3, 2018 •

edited

Loading