Skip to content

Parquet Statistics null_count does not distinguish between 0 and not specified #6215

@jp0317

Description

@jp0317

Describe the bug

Please close if this is intended: currently the null_count in Statistics defaults to 0 and the has_nulls() is based on null_count > 0, leading to ambiguity when null_count equals 0: either all values are non-null, or just that the null count stat is missing.

To Reproduce
n/a

Expected behavior

it feels reasonable to distinguish no null and null_count not set.

Additional context
n/a

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugparquetChanges to the parquet crate

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions