Skip to content

Conversation

@ssnl
Copy link
Collaborator

@ssnl ssnl commented Jul 20, 2018

second trial of #7140

cc @csarofeen Let's see if this works. It passes everything locally.

Copy link
Contributor

@facebook-github-bot facebook-github-bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@ssnl has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@ssnl
Copy link
Collaborator Author

ssnl commented Jul 20, 2018

@pytorchbot retest this please

Copy link
Contributor

@facebook-github-bot facebook-github-bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@ssnl has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@ssnl ssnl mentioned this pull request Jul 21, 2018
Copy link
Contributor

@facebook-github-bot facebook-github-bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@ssnl has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@csarofeen
Copy link
Contributor

This is something I was thinking of too. I don't know why but it still produces (very occasionally) a hang. I totally agree it should work, but will try to figure out why it still hangs on occasion.

@ssnl
Copy link
Collaborator Author

ssnl commented Jul 21, 2018

@pytorchbot retest this please

@csarofeen
Copy link
Contributor

Been running a script over and over today to check, only happened twice out of many hundred so who knows. I think it should be fine.

@ssnl
Copy link
Collaborator Author

ssnl commented Jul 22, 2018 via email

@ssnl
Copy link
Collaborator Author

ssnl commented Jul 22, 2018 via email

Copy link
Contributor

@apaszke apaszke left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Generally looks look, but I'd rather get rid of the done_event unless it's necessary

torch.manual_seed(seed)

# Do not wait for putting thread to join when this worker exits. Otherwise,
# this worker may always be waiting to put and doesn't check index_queue

This comment was marked as off-topic.

This comment was marked as off-topic.

if r is None:
# use done_event so that we can get faster exiting signal even if there
# are still indices in index_queue
if r is None or done_event.is_set():

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

self.index_queues = [multiprocessing.Queue() for _ in range(self.num_workers)]
self.worker_queue_idx = 0
self.worker_result_queue = multiprocessing.SimpleQueue()
self.worker_result_queue = multiprocessing.Queue()

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

if self.pin_memory or self.timeout > 0:
if self.pin_memory:
self.data_queue = queue.Queue()
if self.pin_memory:

This comment was marked as off-topic.

This comment was marked as off-topic.

# removes pids no matter what
if not self.shutdown:
self.shutdown = True
self.done_event.set()

This comment was marked as off-topic.

time.sleep(self.sleep_sec)
if not self.sleeped:
time.sleep(self.sleep_sec)
self.sleeped = True

This comment was marked as off-topic.

This comment was marked as off-topic.

Copy link
Contributor

@facebook-github-bot facebook-github-bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@ssnl has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

if r is None:
# use done_event so that we can get faster exiting signal even if there
# are still indices in index_queue
if r is None or done_event.is_set():

This comment was marked as off-topic.

self.index_queues = [multiprocessing.Queue() for _ in range(self.num_workers)]
self.worker_queue_idx = 0
self.worker_result_queue = multiprocessing.SimpleQueue()
self.worker_result_queue = multiprocessing.Queue()

This comment was marked as off-topic.

ssnl added a commit that referenced this pull request Jul 23, 2018
Summary:
second trial of #7140

cc csarofeen Let's see if this works. It passes everything locally.
Pull Request resolved: #9655

Differential Revision: D8940177

Pulled By: SsnL

fbshipit-source-id: 8d6340fc9f7355c71e1e26b262da166402faa158
@ssnl ssnl deleted the dl branch July 23, 2018 03:55
@ssnl ssnl restored the dl branch July 25, 2018 02:48
ssnl added a commit to ssnl/pytorch that referenced this pull request Jul 25, 2018
ssnl added a commit to ssnl/pytorch that referenced this pull request Jul 25, 2018
ssnl added a commit that referenced this pull request Jul 25, 2018
facebook-github-bot pushed a commit that referenced this pull request Jul 25, 2018
…" (#9804)

Summary:
This reverts commit 9ee5133.
Pull Request resolved: #9804

Reviewed By: ezyang

Differential Revision: D8987780

Pulled By: SsnL

fbshipit-source-id: 75ad70b0b8d672d0b35235fa248b187be64b68e5
jramseyer pushed a commit to jramseyer/pytorch that referenced this pull request Jul 30, 2018
Summary:
second trial of pytorch#7140

cc csarofeen Let's see if this works. It passes everything locally.
Pull Request resolved: pytorch#9655

Differential Revision: D8940177

Pulled By: SsnL

fbshipit-source-id: 8d6340fc9f7355c71e1e26b262da166402faa158
jramseyer pushed a commit to jramseyer/pytorch that referenced this pull request Jul 30, 2018
…ch#9655)" (pytorch#9804)

Summary:
This reverts commit 9ee5133.
Pull Request resolved: pytorch#9804

Reviewed By: ezyang

Differential Revision: D8987780

Pulled By: SsnL

fbshipit-source-id: 75ad70b0b8d672d0b35235fa248b187be64b68e5
facebook-github-bot pushed a commit that referenced this pull request Aug 9, 2018
)

Summary:
#9655
Pull Request resolved: #10366

Differential Revision: D9237393

Pulled By: SsnL

fbshipit-source-id: fabfad7f371ba33300098f6b885c0e3f26c3e14a
PenghuiCheng pushed a commit to PenghuiCheng/pytorch that referenced this pull request Aug 10, 2018
…orch#10366)

Summary:
pytorch#9655
Pull Request resolved: pytorch#10366

Differential Revision: D9237393

Pulled By: SsnL

fbshipit-source-id: fabfad7f371ba33300098f6b885c0e3f26c3e14a
goodlux pushed a commit to goodlux/pytorch that referenced this pull request Aug 15, 2018
Summary:
second trial of pytorch#7140

cc csarofeen Let's see if this works. It passes everything locally.
Pull Request resolved: pytorch#9655

Differential Revision: D8940177

Pulled By: SsnL

fbshipit-source-id: 8d6340fc9f7355c71e1e26b262da166402faa158
goodlux pushed a commit to goodlux/pytorch that referenced this pull request Aug 15, 2018
…ch#9655)" (pytorch#9804)

Summary:
This reverts commit 9ee5133.
Pull Request resolved: pytorch#9804

Reviewed By: ezyang

Differential Revision: D8987780

Pulled By: SsnL

fbshipit-source-id: 75ad70b0b8d672d0b35235fa248b187be64b68e5
goodlux pushed a commit to goodlux/pytorch that referenced this pull request Aug 15, 2018
…orch#10366)

Summary:
pytorch#9655
Pull Request resolved: pytorch#10366

Differential Revision: D9237393

Pulled By: SsnL

fbshipit-source-id: fabfad7f371ba33300098f6b885c0e3f26c3e14a
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants