Skip to content

Support Empty Column Projection in ParquetRecordBatchReader #1537

@tustvold

Description

@tustvold

Is your feature request related to a problem or challenge? Please describe what you are trying to do.

If an empty array is passed to ArrowReader::get_record_reader_by_columns it returns an error.

Describe the solution you'd like

It would be advantageous to support an empty projection within ParquetRecordBatchReader, and instead use file metadata to return the requisite number of rows. This will need the ability to create a RecordBatch with no columns and a non-zero row count #1536

Describe alternatives you've considered

We could not add this functionality

Additional context

This would simplify the implementation of Hive based partitioning, amongst other things - apache/datafusion#2000

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementAny new improvement worthy of a entry in the changelogparquetChanges to the parquet crate

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions