Add external library data layer #3955

sasagalic-MSFT · 2016-04-06T18:37:14Z

Currently there are a number of data layers available in Caffe supporting different input formats. However, there is always one more format which is optimal for the given training.
One cannot expect Caffe to support all the formats, but fairly close solution would be to implement extensible architecture which enables easy integration of external formats with Caffe.
In this change external library data layer is implemented. This data layer expects external library (Shared Object on Linux or DLL on Windows) to provide data for Caffe network.
Using this data layer has several benefits:

Decouples data source from Caffe. New formats are easily consumable and do not require changes in Caffe codebase
Enables in-memory synthetic data generation
Enables multiple top blobs (current prefetching data layers only support 2 tops, which is a limitation for some trainings)
Opens possibility to implement some of the existing data layers (LMDB, LevelDB etc.) as external libraries to reduce number of dependencies Caffe has

This change is implemented with compatibility in mind. All existing data layers maintain the same behaviour. External library adapters are provided for Windows and Linux to localize platform specific code.

In this change external library data layer is implemented. This data layer is data layer that pulls actual input data from external library which is compliant with interface declared in external_lib_data_source.hpp. External library data layer is employed by specifying "ExternalLibData" as layer type in network proto file as well as library data parameters. Path to external library, external library data source factory method name and external library parameters are parameters that need to be specified. Since external library implementation is platform dependent, appropriate interface is added to avoid spreading platform dependence throughout Caffe codebase. Interface implementation is added for Windows (dynamic link library) and Linux (shared object library).

willyd · 2016-05-19T15:58:08Z

Could this be made even more general? Allowing not only plugin-like data layers, but also compute layers?

hgaiser

I like the idea of loading layers from external libraries. I would also like to extend this to compute layers (as proposed in #5243). That would solve the large number of incompatible caffe forks that currently exist for different networks.

See the inline comments for some feedback on the implementation.

hgaiser · 2017-02-01T15:37:42Z

include/caffe/util/external_lib_data_source.hpp

+#ifndef EXTARNAL_LIB_DATA_SOURCE_H_
+#define EXTARNAL_LIB_DATA_SOURCE_H_
+
+class IExternalLibDataSource;


Caffe doesn't use prefixes for interface classes right now. For example: Layer is an interface and it's simply called Layer. I think it makes sense to maintain the same naming style, so not include the prefix. Same goes for the other interface classes.

hgaiser · 2017-02-01T15:47:38Z

include/caffe/util/external_lib_data_source_util.hpp

+* @brief   Returns library object that abstracts away library management
+*          dependent on platform.
+*/
+boost::shared_ptr<IExternalLib> GetDataSourceLibraryWrapper(


If C++11 is allowed, a std::unique_ptr would be nicer here. There is still a single owner at this point, so it models the ownership more accurately, and it can be implicitly converted to a shared_ptr (boost and std). It could even be released into any lifetime management scheme you can think of.

hgaiser · 2017-02-01T15:52:20Z

include/caffe/layers/external_lib_data_layer.hpp

+class ExternalLibDataLayer : public BasePrefetchingDataLayer<Dtype> {
+ public:
+  explicit ExternalLibDataLayer(const LayerParameter& param);
+  virtual ~ExternalLibDataLayer();


Since this is a class template, shouldn't the function definitions be included here? It seems like this will result in linking errors for some values for Dtype.

hgaiser · 2017-02-01T16:06:14Z

include/caffe/util/external_lib_data_source.hpp

+/**
+* @brief    Defines interface for data source.
+*/
+class IExternalLibDataSource {


Why did you create additional interfaces? Caffe already has an interface for data layers: BaseDataLayer<Dtype>. Why not let the loaded module create instances of that class instead of the new interfaces? I'd expect that would save a lot of code overhead. On top of that I think it would also simplify the process for supporting compute layers.

hgaiser · 2017-02-01T16:09:48Z

include/caffe/util/external_lib_data_source_util.hpp

+class IExternalLib {
+ public:
+  virtual ~IExternalLib() {}
+  virtual IExternalLibDataSource* GetDataSource() = 0;


It would be nice if the class for loading symbols/functions from a library could be generalized (possibly through templating) to support loading functions with any signature. Then it could be reused for loading compute layers as well.

shelhamer · 2017-04-14T07:35:05Z

Closing in favor of the more general approach in #5294. Thanks @sasagalic-MSFT for your concern about how to handle all the different varieties of data out there.

zer0n deleted the bvlc_external_lib_data_layer branch July 22, 2016 04:50

willyd mentioned this pull request Feb 1, 2017

Cpp modular layers #5243

Closed

hgaiser reviewed Feb 1, 2017

View reviewed changes

shelhamer closed this Apr 14, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add external library data layer #3955

Add external library data layer #3955

sasagalic-MSFT commented Apr 6, 2016

willyd commented May 19, 2016

hgaiser left a comment

hgaiser Feb 1, 2017

hgaiser Feb 1, 2017

hgaiser Feb 1, 2017

hgaiser Feb 1, 2017

hgaiser Feb 1, 2017

shelhamer commented Apr 14, 2017

Add external library data layer #3955

Add external library data layer #3955

Conversation

sasagalic-MSFT commented Apr 6, 2016

willyd commented May 19, 2016

hgaiser left a comment

Choose a reason for hiding this comment

hgaiser Feb 1, 2017

Choose a reason for hiding this comment

hgaiser Feb 1, 2017

Choose a reason for hiding this comment

hgaiser Feb 1, 2017

Choose a reason for hiding this comment

hgaiser Feb 1, 2017

Choose a reason for hiding this comment

hgaiser Feb 1, 2017

Choose a reason for hiding this comment

shelhamer commented Apr 14, 2017