More efficient SmartCacheLoader by cgrain · Pull Request #2181 · Project-MONAI/MONAI

cgrain · 2021-05-12T10:42:22Z

Description

On this website it is advised to delete tensors explicitly (with del) when the program is done with it. Applying that, I have made a small change in SmartCacheLoader that reflects that. I have also added a small unit test that ensures that the updated cache is indeed what it needs to be.

Status

Ready

Types of changes

Non-breaking change (fix or new feature that would not break existing functionality).
Breaking change (fix or new feature that would cause existing functionality to change).
New tests added to cover the changes.
Integration tests passed locally by running ./runtests.sh -f -u --net --coverage.
Quick tests passed locally by running ./runtests.sh --quick --unittests.
In-line docstrings updated.
Documentation updated, tested make html command in the docs/ folder.

rijobro · 2021-05-12T12:04:38Z

Hi, I read through the blog you linked, and guess you're referring to this paragraph:

Free up memory using del
This is a common pitfall for new PyTorch users, and we think it isn’t documented enough.
After you’re done with some PyTorch tensor or variable, delete it using the python del operator to free up memory.

In the blog post there is no data to back up this claim. As such, I think we'd need to see some numbers that your implementation uses less memory than the current one.

rijobro · 2021-05-12T12:12:14Z

Although I agree that the current implementation is a little ugly, it seems to have unnecessary for loops. How about:

self._cache = self._cache[self._replace_num:self._replace_num + remain_num] + self._replacements[:self._replace_num]

@wyli

cgrain · 2021-05-12T12:27:39Z

Thank you for your quick reply!

Your question for some data is very reasonable, and I have only some anecdotal data unfortunately. I have actually implemented my method in my own project, and instead of SLURM killing my job because I was out of memory, the job ran without problems. Of course, I have not implemented nor tested your method. I will see if I can create a test that shows the difference in data use. The problem to check that thoroughly is that the garbage collector of python (which is not completely determenistic) can either be quick, which will result in a small difference, or slow, which will result in higher memory spikes.

rijobro · 2021-05-12T12:42:14Z

Sounds good, I'm curious to see what your benchmarking gives. My worry is that if we can't trust the garbage collector here, then we can't trust it anywhere in the code base and will have to delete everything everywhere. My default position is therefore to trust the garbage collector!

wyli · 2021-05-12T12:45:42Z

did a quick test it seems test2 is good:

import random

def test1(cache_num=100, replace_num=20):
  _cache = list(range(cache_num))
  replace = [random.random() for x in range(replace_num)]
  remain_num: int = cache_num - replace_num
  for i in range(remain_num):
    _cache[i] = _cache[i + replace_num]
  for i in range(replace_num):
    _cache[remain_num + i] = replace[i]
  return _cache

def test2(cache_num=100, replace_num=20):
  _cache = list(range(cache_num))
  replace = [random.random() for x in range(replace_num)]

  del _cache[:replace_num]
  _cache.extend(replace)
  return _cache

def test3(cache_num=100, replace_num=20):
  _cache = list(range(cache_num))
  replace = [random.random() for x in range(replace_num)]
  remain_num: int = cache_num - replace_num
  _cache = _cache[replace_num:replace_num + remain_num] + replace[:replace_num]
  return _cache

wyli · 2021-05-12T12:51:40Z

/black
thanks @cgrain, could you address this DCO issue by signing the commits? https://github.com/Project-MONAI/MONAI/pull/2181/checks?check_run_id=2565463758

Signed-off-by: Richard Brown <[email protected]> Signed-off-by: Coen <[email protected]>

Signed-off-by: Coen <[email protected]>

cgrain · 2021-05-12T14:21:15Z

Thanks @wyli, DCO should be signed now!

Signed-off-by: monai-bot <[email protected]>

tests/test_smartcachedataset.py

Signed-off-by: Coen <[email protected]>

* better way of managing Cache Signed-off-by: Coen <[email protected]> * Update test_smartcachedataset.py Signed-off-by: Coen <[email protected]> Signed-off-by: Yaniel Cabrera <[email protected]>

* better way of managing Cache Signed-off-by: Coen <[email protected]> * Update test_smartcachedataset.py Signed-off-by: Coen <[email protected]>

rijobro and others added 4 commits May 12, 2021 16:03

fix typo (Project-MONAI#2176)

6aa7637

Signed-off-by: Richard Brown <[email protected]> Signed-off-by: Coen <[email protected]>

Update test_smartcachedataset.py

16883fa

Signed-off-by: Coen <[email protected]>

better way of managing Cache

3b97a2d

Signed-off-by: Coen <[email protected]>

Update test_smartcachedataset.py

19c6c55

Signed-off-by: Coen <[email protected]>

cgrain force-pushed the CoenGruijt_Patch_efficient_loader branch from 1c2507d to 19c6c55 Compare May 12, 2021 14:07

wyli and others added 2 commits May 12, 2021 15:24

Merge branch 'dev' into CoenGruijt_Patch_efficient_loader

1e83001

[MONAI] python code formatting

c1b8e9d

Signed-off-by: monai-bot <[email protected]>

wyli reviewed May 12, 2021

View reviewed changes

tests/test_smartcachedataset.py Outdated Show resolved Hide resolved

cgrain and others added 2 commits May 12, 2021 17:15

Fix Typo

3fe22ce

Signed-off-by: Coen <[email protected]>

Merge branch 'dev' into CoenGruijt_Patch_efficient_loader

0cbaafa

wyli approved these changes May 12, 2021

View reviewed changes

wyli enabled auto-merge (squash) May 12, 2021 17:15

wyli merged commit 4ef7d22 into Project-MONAI:dev May 12, 2021

wyli pushed a commit that referenced this pull request May 26, 2021

More efficient SmartCacheLoader (#2181)

2791a14

* better way of managing Cache Signed-off-by: Coen <[email protected]> * Update test_smartcachedataset.py Signed-off-by: Coen <[email protected]>

wyli pushed a commit that referenced this pull request May 26, 2021

More efficient SmartCacheLoader (#2181)

6f4bf55

* better way of managing Cache Signed-off-by: Coen <[email protected]> * Update test_smartcachedataset.py Signed-off-by: Coen <[email protected]>

wyli pushed a commit that referenced this pull request May 26, 2021

More efficient SmartCacheLoader (#2181)

4df4e57

* better way of managing Cache Signed-off-by: Coen <[email protected]> * Update test_smartcachedataset.py Signed-off-by: Coen <[email protected]>

wyli pushed a commit that referenced this pull request May 26, 2021

More efficient SmartCacheLoader (#2181)

0f9b34a

* better way of managing Cache Signed-off-by: Coen <[email protected]> * Update test_smartcachedataset.py Signed-off-by: Coen <[email protected]>

wyli pushed a commit that referenced this pull request May 26, 2021

More efficient SmartCacheLoader (#2181)

521fbf8

* better way of managing Cache Signed-off-by: Coen <[email protected]> * Update test_smartcachedataset.py Signed-off-by: Coen <[email protected]>

wyli pushed a commit that referenced this pull request May 27, 2021

More efficient SmartCacheLoader (#2181)

304ce5d

* better way of managing Cache Signed-off-by: Coen <[email protected]> * Update test_smartcachedataset.py Signed-off-by: Coen <[email protected]>

wyli pushed a commit that referenced this pull request May 27, 2021

More efficient SmartCacheLoader (#2181)

9ce37a9

* better way of managing Cache Signed-off-by: Coen <[email protected]> * Update test_smartcachedataset.py Signed-off-by: Coen <[email protected]>

wyli pushed a commit that referenced this pull request May 27, 2021

More efficient SmartCacheLoader (#2181)

82936eb

* better way of managing Cache Signed-off-by: Coen <[email protected]> * Update test_smartcachedataset.py Signed-off-by: Coen <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More efficient SmartCacheLoader#2181

More efficient SmartCacheLoader#2181
wyli merged 8 commits intoProject-MONAI:devfrom
cgrain:CoenGruijt_Patch_efficient_loader

cgrain commented May 12, 2021 •

edited by wyli

Loading

Uh oh!

rijobro commented May 12, 2021

Uh oh!

rijobro commented May 12, 2021 •

edited

Loading

Uh oh!

cgrain commented May 12, 2021

Uh oh!

rijobro commented May 12, 2021

Uh oh!

wyli commented May 12, 2021

Uh oh!

wyli commented May 12, 2021 •

edited

Loading

Uh oh!

cgrain commented May 12, 2021

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

cgrain commented May 12, 2021 • edited by wyli Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Status

Types of changes

Uh oh!

rijobro commented May 12, 2021

Uh oh!

rijobro commented May 12, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cgrain commented May 12, 2021

Uh oh!

rijobro commented May 12, 2021

Uh oh!

wyli commented May 12, 2021

Uh oh!

wyli commented May 12, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cgrain commented May 12, 2021

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

cgrain commented May 12, 2021 •

edited by wyli

Loading

rijobro commented May 12, 2021 •

edited

Loading

wyli commented May 12, 2021 •

edited

Loading