Less memory: Blob sharing across nets by name #1985

tnarihi · 2015-02-26T20:44:00Z

This PR implements blob sharing feature which can reduce memory consumption in solver. In the previous implementation, blobs storing intermediate results are independently used among different nets, which usually causes redundant memory consumption for blob data. In this PR, Nets in solver, i.e. net, test_nets, share the blobs by name to save memory. However, in the case that the shapes of blobs over nets have different shapes, e.g. different batch size, it will break. Layers except data layers is okay since there are reshaped before every Forward call. I made some changes that data layers always reshape their top blobs in Forward, and it will changes the default behavior of DummyDataLayer. Another issue is that it will break if the nets are used simultaneously, e.g. multi-threads, but I believe nobody does like that so far.
I have only thought about standard classification net and segmentation net. I am not sure if it will work for every possible network. And I am also not sure whether this is best solution (another idea is to use global Blob pool or memory pool) but definitely helpful for anyone who concerns memory usage. In my experiment, It reduces the memory usage 1GB-ish in VGG16 training (batch_size=16). Again, I summarize this PR here:

Share blobs with different nets (by setting share_blobs=true in SolverParameter. default=false)
Data layers always reshapes top blobs
Change the default behavior of DummyDataLayer (refill_constant=false will keep the behavior but might breaks share_blobs=true)

jyegerlehner · 2015-03-01T22:32:56Z

src/caffe/net.cpp

Is this better than using Blob::ShareData() (and possibly ShareDiff)? E.g.

blobs_[target_blob_id]->ShareData((*other->blobs()[i]));

Then you don't need UpdateBlobPointers because each net just keeps its own. I think this is more robust; ShareData() doesn't require that blob pointers be reassigned as in UpdateBlobPointers. If you assign blob pointers instead of using ShareData, UpdateBlobPointers has to know all the places that blob pointers reside and keep them in sync. That introduces tight coupling to the rest of the implementation which is not robust but rather prone to breaking.
Actually, if you do ShareData(), I think you have to reshape before sharing because ShareData checks that the shape is the same.

Yeah, first I tried to do like that. However, I noticed that ShareData actually do sharing SyncedMemory instance (shared_ptr), and if you reshape the blob after sharing it and the allocated capacity is not large enough as you need for reshaping, it will create a new synced memory instance. Then, the coupling will be gone. This will happen in the special case such as an input layer produces different shape blobs for each data. That is why I decide to share blobs directly. But, yes, your point is right. Using UpdateBlobPointers makes it a little complicated. If you have any better idea to implement this, please help me. Thanks!

which save memory consumptions. This should be called in solver initialization. In the current implementation, `Solver` intializes the training and testing nets independently. That leads to much memory consumption.

mtamburrano · 2015-07-14T10:50:13Z

any news about this pr?

jyegerlehner reviewed Mar 1, 2015
View reviewed changes

jeffdonahue mentioned this pull request Mar 9, 2015

Wrap up SyncedMem resize from @kloudkl; make train/test nets share data blobs #355

Closed

shelhamer added the JD label Mar 10, 2015

tnarihi added 4 commits March 12, 2015 14:08

BasePrefetchingLayer always reshapes

2fede72

HDF5DataLayer always reshapes

91d9d95

DummyDataLayer Always reshapes, refill_constant

75282c3

Share blobs with other nets

859f632

which save memory consumptions. This should be called in solver initialization. In the current implementation, `Solver` intializes the training and testing nets independently. That leads to much memory consumption.

tnarihi force-pushed the share-blobs-over-nets branch from 632ce31 to d0021f8 Compare March 12, 2015 21:11

Add option that Solver shares blobs over nets

e8a9b02

tnarihi force-pushed the share-blobs-over-nets branch from 2c70614 to e8a9b02 Compare March 12, 2015 22:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Less memory: Blob sharing across nets by name #1985

Less memory: Blob sharing across nets by name #1985

Uh oh!

tnarihi commented Feb 26, 2015

Uh oh!

jyegerlehner Mar 1, 2015

Uh oh!

tnarihi Mar 2, 2015

Uh oh!

mtamburrano commented Jul 14, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Less memory: Blob sharing across nets by name #1985

Are you sure you want to change the base?

Less memory: Blob sharing across nets by name #1985

Uh oh!

Conversation

tnarihi commented Feb 26, 2015

Uh oh!

jyegerlehner Mar 1, 2015

Choose a reason for hiding this comment

Uh oh!

tnarihi Mar 2, 2015

Choose a reason for hiding this comment

Uh oh!

mtamburrano commented Jul 14, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants