Improve / Fix Weight Sharing

Weight sharing as-is relies on a weight owner with which shared layers share their parameter blobs. This poses a few problems in relation to loss, loading and saving parameters, and weight initialization that are listed here for addressing.
- [x] Fix incorrect momentum and history due to separation of shared weights #2866  
- [x] Fix the resuming / fine-tuning issue for shared weights; see https://github.com/BVLC/caffe/pull/959#issuecomment-55500404. Done in #594 as it turns out.
- [x] Determine if there is actually a loss / weight ownership issue as asked at https://github.com/BVLC/caffe/pull/546/files#r16817721 by @ashafaei. [No, there is not –shelhamer]
- [x] Save memory through accumulation #1977 by sharing diffs #2866 
- [ ] Load and save only the owned weights and not shared duplicates #2836 for hdf5
- [ ] Figure out how snapshot / restore should resolve by layer or param name and fallback as needed
- [ ] Only the owner should initialize weights. Currently unnecessary work and memory is expended filling all weights, and then these are discarded to share with the weight owners.
- [ ] Die if weight fillers are defined in layers that don't own their parameters (the weights are properly initialized in this case, but only by ignoring the incorrect specification as written).

@jeffdonahue @longjon 


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve / Fix Weight Sharing #1211

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Improve / Fix Weight Sharing #1211

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions