Allow defining output metadata in the OpSchema by rostan-t · Pull Request #6244 · NVIDIA/DALI

rostan-t · 2026-03-02T16:15:56Z

Category:

New feature (non-breaking change which adds functionality)

Description:

It is currently necessary to run operators in order to know the output ndim, dtype and layouts. This PR adds the possibility to specify those metadata when defining an OpSchema.

This is especially relevant for dynamic mode because accessing those metadata currently causes immediate evaluation of tensors, even when EvalMode.eager or EvalMode.defer is used.

Additional information:

Affected modules and functionalities:

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

Implements new requirements
Affects existing requirements
N/A

REQ IDs: N/A

JIRA TASK: DALI-2778

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

mzient · 2026-03-03T09:35:46Z

dali/pipeline/operator/op_schema.h

+   * @param index Index of the output to set the function for.
+   * @param fn Function that returns the data type for the given output.
+   */
+  OpSchema &OutputDType(int index, OutputDTypeFunc fn);


I think that some kind of policy would be nice - and a sane default, too. Most operators will follow the pattern:

use the value of dtype argument, if not None (and it's already a special argument)

use the data type of the 1st input if dtype is None.
I say "is not None" as opposed to "is present", because some operators have a default dtype and adhere to it, even if the user doesn't provide one.

The operators that do NOT follow this pattern are:

readers (typically uint8, unpredictable type for NumPy reader, something else for video)

arithmetic/math (use type promotion rules)

sum (promotes all integers to int64 of the same signeddness; keeps floating point type)

normalize, mean, rms, etc (promotes integers to float, keeps floating point type).

mzient · 2026-03-03T11:52:25Z

dali/pipeline/operator/op_schema.h


+  /** Try calculating the data type of a given output */
+  std::optional<DALIDataType> CalculateOutputDType(int index, const OpSpec &spec,
+                                                   span<const DALIDataType> input_dtypes) const;


I think that the metadata should be a part of what is now called OpSpec::InOutDeviceDesc

Even if it's not, input_dtypes should all be optional - in many cases we only need the data type of the 1st input. Inability to compute all of them will ruin the static evaluation for pipeline mode.

mzient · 2026-03-03T11:52:37Z

dali/pipeline/operator/op_schema.h

+
+  /** Try calculating the ndim of a given output */
+  std::optional<int> CalculateOutputNdim(int index, const OpSpec &spec,
+                                         span<const int> input_ndims) const;


mzient · 2026-03-03T11:52:44Z

dali/pipeline/operator/op_schema.h

+
+  /** Try calculating the layout of a given output */
+  std::optional<TensorLayout> CalculateOutputLayout(int index, const OpSpec &spec,
+                                                    span<const TensorLayout> input_layouts) const;


rostan-t added the Dynamic Mode label Mar 2, 2026

rostan-t added 3 commits March 2, 2026 17:14

Support specifying output dtype, ndim and layout in OpSchema

3c7baf0

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Infer output ndim, dtype, and layout without full evaluation

346d293

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Start adding output metadata inference to operator schemas

98706e8

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

rostan-t force-pushed the opschema-metadata branch from 6f6cb95 to 98706e8 Compare March 2, 2026 17:16

mzient reviewed Mar 3, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow defining output metadata in the OpSchema#6244

Allow defining output metadata in the OpSchema#6244
rostan-t wants to merge 3 commits intoNVIDIA:mainfrom
rostan-t:opschema-metadata

rostan-t commented Mar 2, 2026 •

edited

Loading

Uh oh!

mzient Mar 3, 2026

Uh oh!

mzient Mar 3, 2026

Uh oh!

mzient Mar 3, 2026

Uh oh!

mzient Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rostan-t commented Mar 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Category:

Description:

Additional information:

Affected modules and functionalities:

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

Uh oh!

mzient Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

mzient Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

mzient Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

mzient Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rostan-t commented Mar 2, 2026 •

edited

Loading