Roadmap 2023

The following represents a high-level overview of our 2023 plan. You should be aware that this roadmap may change at any time and the order below does not reflect any type of priority.

We strongly encourage you to comment on our roadmap and provide us feedback on this issue here.

Some of the items mentioned below are the continuation of the 2022 effort (https://github.com/NVIDIA/DALI/issues/3774)

Improving Usability:
----------------------
- **eager mode** - extending support for using DALI operators as standalone entities and improving their interoperability with other libraries like VPF, CV-CUDA or MONAI
- **conditional execution** - providing a convenient API to conditionally apply operation based on a predicate, providing AutoAugment style capabilities - 
  - conditional execution itself (https://github.com/NVIDIA/DALI/pull/4569, https://github.com/NVIDIA/DALI/pull/4561, https://github.com/NVIDIA/DALI/pull/4738, https://github.com/NVIDIA/DALI/pull/4618, https://github.com/NVIDIA/DALI/pull/4602, https://github.com/NVIDIA/DALI/pull/4589, https://github.com/NVIDIA/DALI/pull/4617, https://github.com/NVIDIA/DALI/pull/4629, https://github.com/NVIDIA/DALI/pull/4676)
  - automatic augmentation module with AutoAugment, RandAugment, and TrivialAugment (https://github.com/NVIDIA/DALI/pull/4694, https://github.com/NVIDIA/DALI/pull/4699, https://github.com/NVIDIA/DALI/pull/4696, https://github.com/NVIDIA/DALI/pull/4702, https://github.com/NVIDIA/DALI/pull/4704, https://github.com/NVIDIA/DALI/pull/4706, https://github.com/NVIDIA/DALI/pull/4710, https://github.com/NVIDIA/DALI/pull/4753, https://github.com/NVIDIA/DALI/pull/4678)
- **support for NVIDIA Grace Hopper Superchip**, this includes flexible execution model utilizing fast CPU<->GPU memory transfers, where data can go from CPU to GPU and back to the GPU in single pipeline

Extending input format support:
-----------------------------------
- Extending support of formats and containers with **variable frame rate videos**
  - decoding raw H264 and H265 streams from memory (https://github.com/NVIDIA/DALI/pull/4480)
- Support for **higher dynamic ranges** data (int32, float) through the whole data processing pipelines
- Adding **GPU acceleration for more image formats**, like TIFF or new profiles of the existing one
  - lossless JPEG decoding on CPU and GPU with fn.experimental.decoders.image (https://github.com/NVIDIA/DALI/pull/4625, https://github.com/NVIDIA/DALI/pull/4600, https://github.com/NVIDIA/DALI/pull/4587, https://github.com/NVIDIA/DALI/pull/4572, https://github.com/NVIDIA/DALI/pull/4592, https://github.com/NVIDIA/DALI/pull/4548) 

Performance:
---------------
- optimizing **memory consumption**
  - cudaMallocAsync support (https://github.com/NVIDIA/DALI/pull/4900, https://github.com/NVIDIA/DALI/pull/4923, and https://github.com/NVIDIA/DALI/pull/4921)
  - API for pre-allocation and releasing of memory pools (https://github.com/NVIDIA/DALI/pull/4563, https://github.com/NVIDIA/DALI/pull/4556)
- operators **performance optimizations**
  - O_DIRECT support mode support to fn.readers.tfrecord (https://github.com/NVIDIA/DALI/pull/4820).
  - O_DIRECT mode support to fn.readers.numpy (https://github.com/NVIDIA/DALI/pull/4796, https://github.com/NVIDIA/DALI/pull/4848) 

New transformations:
------------------------
We are constantly extending the set of operations supported by DALI. Currently, this section lists the most notable additions to our areas of interest that we plan to do this year. This list is not exhaustive and we plan on expanding the set of operators as the needs or requests arise. 

- new transformations for general **data processing**
  - fn.experimental.tensor_resize operator (https://github.com/NVIDIA/DALI/pull/4492)
- new transformations for **image processing**
  - median blur - https://github.com/NVIDIA/DALI/pull/4950, https://github.com/NVIDIA/DALI/pull/4975 
  - histogram equalization operator (fn.experimental.equalize) (https://github.com/NVIDIA/DALI/pull/4742, https://github.com/NVIDIA/DALI/pull/4575, https://github.com/NVIDIA/DALI/pull/4565).
  - 2-D convolution(fn.experimental.filter) (https://github.com/NVIDIA/DALI/pull/4764, https://github.com/NVIDIA/DALI/pull/4298, https://github.com/NVIDIA/DALI/pull/4525).
- new transformations for **video processing**
  - the above image transformations are applicable to video as well


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Roadmap 2023 #4578

Improving Usability:

Extending input format support:

Performance:

New transformations:

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Roadmap 2023 #4578

Description

Improving Usability:

Extending input format support:

Performance:

New transformations:

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions