Support HDF5

I'm breaking the discussion about HDF5 out from #197 ...

https://github.com/BVLC/caffe/issues/2840 is a helpful summary. This is a more detailed comparison.

Are the benefits worth the effort?
## HDF5 vs. LMDB backends

|  | HDF5 | LMDB |
| --- | --- | --- |
| Compression | Whole dataset | Each datum |
| Compression options | [GZIP, LZF](http://docs.h5py.org/en/latest/high/dataset.html#lossless-compression-filters) | [Most image formats](http://docs.opencv.org/modules/highgui/doc/reading_and_writing_images_and_video.html#Mat imread%28const string& filename, int flags%29) |
## HDF5Data vs. Data layers

|  | HDF5Data | Data |
| --- | --- | --- |
| Data types | [N-D tensors](http://docs.h5py.org/en/latest/quick.html#core-concepts) | ["N<4"-D tensors](https://github.com/NVIDIA/caffe/blob/v0.12.2/src/caffe/proto/caffe.proto#L29-L31) |
| Transformations | [shuffle](https://github.com/NVIDIA/caffe/blob/v0.12.2/src/caffe/proto/caffe.proto#L531-L536) | [scale, mirror, crop, mean](https://github.com/NVIDIA/caffe/blob/v0.12.2/src/caffe/proto/caffe.proto#L345-L360) |
## Other HDF5 pros/cons
- Pros
  - Not tied to any particular Caffe-specific format, so it would be easier to integrate with other frameworks
  - Can have many labels/blobs all in one file
  - Can use multiple files in one HDF5Data layer
- Cons
  - Doesn't have the memory-mapping optimizations that LMDB does


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support HDF5 #224

HDF5 vs. LMDB backends

HDF5Data vs. Data layers

Other HDF5 pros/cons

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	HDF5	LMDB
Compression	Whole dataset	Each datum
Compression options	GZIP, LZF	[Most image formats](http://docs.opencv.org/modules/highgui/doc/reading_and_writing_images_and_video.html#Mat imread%28const string& filename, int flags%29)

	HDF5Data	Data
Data types	N-D tensors	"N<4"-D tensors
Transformations	shuffle	scale, mirror, crop, mean

Support HDF5 #224

Description

HDF5 vs. LMDB backends

HDF5Data vs. Data layers

Other HDF5 pros/cons

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions