Skip to content

Support compressed metadata in Iceberg#81451

Merged
alesapin merged 4 commits intomasterfrom
support_compressed_metadata
Jun 19, 2025
Merged

Support compressed metadata in Iceberg#81451
alesapin merged 4 commits intomasterfrom
support_compressed_metadata

Conversation

@alesapin
Copy link
Copy Markdown
Member

@alesapin alesapin commented Jun 6, 2025

Changelog category (leave one):

  • Improvement

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

Now clickhouse supports compressed metadata.json files for Iceberg. Fixes #70874.

Documentation entry for user-facing changes

  • Documentation is written (mandatory for new features)

@clickhouse-gh
Copy link
Copy Markdown
Contributor

clickhouse-gh bot commented Jun 6, 2025

Workflow [PR], commit [e506aec]

@clickhouse-gh clickhouse-gh bot added the pr-improvement Pull request with some product improvements label Jun 6, 2025
@kssenii kssenii self-assigned this Jun 7, 2025
@clickhouse-gh
Copy link
Copy Markdown
Contributor

clickhouse-gh bot commented Jun 17, 2025

Workflow [PR], commit [30d43f0]

Summary:

@alesapin alesapin marked this pull request as ready for review June 18, 2025 13:57

auto pos = data_path.find(common_path);
/// Valid situation when data and metadata files are stored in different directories.
if (pos == std::string::npos)
Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This change is unrelated, but there are some implementations of iceberg which store data and metadata in different directories.

struct ShortMetadataFileInfo
{
UInt32 version;
Int32 version;
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can it be negative?

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

All these metadata come from Java world. There are no unsigned integers in java. So I think actually answer is "no", but technically "yes"

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's add a comment?

@alesapin alesapin added this pull request to the merge queue Jun 19, 2025
Merged via the queue into master with commit e0b6e78 Jun 19, 2025
121 checks passed
@alesapin alesapin deleted the support_compressed_metadata branch June 19, 2025 09:57
@robot-ch-test-poll4 robot-ch-test-poll4 added the pr-synced-to-cloud The PR is synced to the cloud repo label Jun 19, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

pr-improvement Pull request with some product improvements pr-synced-to-cloud The PR is synced to the cloud repo

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Iceberg Integration: Support compressed manifest files

4 participants