e.g. Unzip .tar; read .json definition and create a standardized way of constructing input data pipelines