Skip to content

BasicStats: TOTAL_RECORDS_PROP does not update after deletes #6709

@simhadri-g

Description

@simhadri-g

Apache Iceberg version

1.1.0 (latest release)

Query engine

Hive

Please describe the bug 🐞

Issue:

My doubts are

  • Should we be taking the difference between TOTAL_RECORDS_PROP and TOTAL_POS_DELETES_PROP as below to get the total row count (assuming there are no equality deletes )?:
    long totalRecords = Long.parseLong(summary.get(SnapshotSummary.TOTAL_RECORDS_PROP)) - Long.parseLong(summary.get(SnapshotSummary.TOTAL_POS_DELETES_PROP));

  • Is there a way to know the total number of rows deleted during equality deletes and obtain the correct row count?

I would be most grateful if someone could answer my questions.

I apologize in advance if my understanding is lacking.

Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions