Skip to content

Support ColumnMetaData encoding_stats in Parquet Writing #7341

@JigaoLuo

Description

@JigaoLuo

I often use the parquet-rewrite CLI to create new Parquet files. However, I've noticed that I can't find the encoding_stats information when using commands like parquet-tools inspect --detail <parquetfile>. Additionally, I've been unable to locate any implementation related to encoding_stats in our Parquet writer.

pub encoding_stats: Option<Vec<PageEncodingStats>>,

I'm curious to know if writing encoding_stats is currently supported. I would really appreciate it if support for this feature could be added to the Parquet writer. Moreover, it would be great if this could be provided as an argument in the parquet-rewrite CLI.

For example, if I'm not mistaken, parquet-cpp-arrow generates Parquet files with encoding_stats enabled by default.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementAny new improvement worthy of a entry in the changelogparquetChanges to the parquet crate

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions