Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: apache/orc
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: v2.1.3
Choose a base ref
...
head repository: apache/orc
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: v2.1.4
Choose a head ref
  • 12 commits
  • 24 files changed
  • 6 contributors

Commits on Jul 5, 2025

  1. Preparing branch for 2.1.4 development

    Signed-off-by: Dongjoon Hyun <[email protected]>
    dongjoon-hyun committed Jul 5, 2025
    Configuration menu
    Copy the full SHA
    cfb7313 View commit details
    Browse the repository at this point in the history

Commits on Jul 15, 2025

  1. ORC-1952: [C++] Fix the issue where the value of headerThirdByte exce…

    …eds the valid byte range
    
    ### What changes were proposed in this pull request?
    
    Ensure DIRECT encoding is employed when the input value exceeds the valid byte range.
    
    ### Why are the changes needed?
    
    In Patched Base Encoding, the value of headerThirdByte exceeds the valid byte range.
    
    ### How was this patch tested?
    
    Add the RleV2_value_limit_test to the RleTest suite.
    
    ### Was this patch authored or co-authored using generative AI tooling?
    
    No.
    
    Closes #2324 from luffy-zh/ORC-1952.
    
    Lead-authored-by: luffy-zh <[email protected]>
    Co-authored-by: ffacs <[email protected]>
    Signed-off-by: Dongjoon Hyun <[email protected]>
    (cherry picked from commit 3d61594)
    Signed-off-by: Dongjoon Hyun <[email protected]>
    2 people authored and dongjoon-hyun committed Jul 15, 2025
    Configuration menu
    Copy the full SHA
    ddedb96 View commit details
    Browse the repository at this point in the history

Commits on Aug 8, 2025

  1. ORC-1970: [CI] Fix cpp-linter-action to use hash tag

    ### What changes were proposed in this pull request?
    
    Bump cpp-linter-action version to f91c446a32ae3eb9f98fef8c9ed4c7cb613a4f8a.
    
    ### Why are the changes needed?
    
    Fix CI.
    
    ### How was this patch tested?
    
    The CI job could test it.
    
    ### Was this patch authored or co-authored using generative AI tooling?
    
    No.
    
    Closes #2354 from luffy-zh/ORC-1970.
    
    Authored-by: luffy-zh <[email protected]>
    Signed-off-by: Gang Wu <[email protected]>
    (cherry picked from commit 7dc7178)
    Signed-off-by: Gang Wu <[email protected]>
    luffy-zh authored and wgtmac committed Aug 8, 2025
    Configuration menu
    Copy the full SHA
    c94b11d View commit details
    Browse the repository at this point in the history

Commits on Sep 12, 2025

  1. ORC-1973: [C++] Use int64_t instead of google::protobuf::int64 fo…

    …r `Protobuf` v22+
    
    ### What changes were proposed in this pull request?
    
    This PR aims to use `int64_t` instead of `google::protobuf::int64` to follow ProtocolBuff v22.0 changes.
    
    - https://github.com/protocolbuffers/protobuf/releases/tag/v22.0
    
    > Fixed C++ code generation for protos that use int32_t, uint32_t, int64_t, uint64_t, size_t as field names.
    
    ### Why are the changes needed?
    
    Currently, `branch-2.1` branch CIs are broken like the following.
    - #2384
      - https://github.com/apache/orc/actions/runs/17682608443/job/50260045180?pr=2384
    
    ```
    In file included from /Users/runner/work/orc/orc/c++/src/io/InputStream.cc:19:
    /Users/runner/work/orc/orc/c++/src/io/InputStream.hh:75:31: error: no type named 'int64' in namespace 'google::protobuf'
        virtual google::protobuf::int64 ByteCount() const override;
                ~~~~~~~~~~~~~~~~~~^
    /Users/runner/work/orc/orc/c++/src/io/InputStream.cc:115:21: error: no type named 'int64' in namespace 'google::protobuf'
      google::protobuf::int64 SeekableArrayInputStream::ByteCount() const {
      ~~~~~~~~~~~~~~~~~~^
    /Users/runner/work/orc/orc/c++/src/io/InputStream.cc:116:42: error: no type named 'int64' in namespace 'google::protobuf'
        return static_cast<google::protobuf::int64>(position_);
                           ~~~~~~~~~~~~~~~~~~^
    3 errors generated.
    ```
    
    ### How was this patch tested?
    
    Pass the CIs.
    
    ### Was this patch authored or co-authored using generative AI tooling?
    
    No.
    
    Closes #2386 from dongjoon-hyun/ORC-1973-2.1.
    
    Authored-by: Dongjoon Hyun <[email protected]>
    Signed-off-by: Dongjoon Hyun <[email protected]>
    dongjoon-hyun committed Sep 12, 2025
    Configuration menu
    Copy the full SHA
    6f5638f View commit details
    Browse the repository at this point in the history
  2. ORC-1996: Remove MacOS 13 from GitHub Action CI and docs

    ### What changes were proposed in this pull request?
    
    This PR aims to remove `MacOS 13` from GitHub Action CI and docs.
    
    ### Why are the changes needed?
    
    https://github.blog/changelog/2025-07-11-upcoming-changes-to-macos-hosted-runners-macos-latest-migration-and-xcode-support-policy-updates/
    
    > The macOS 13 hosted runner image is closing down, following our [N-1 OS support policy](https://github.com/actions/runner-images?tab=readme-ov-file#software-and-image-support). This process will begin October 1, 2025, and the image will be fully retired on December 4, 2025. We recommend updating workflows to use
    
    ### How was this patch tested?
    
    Check the CIs triggered on this PR.
    
    ### Was this patch authored or co-authored using generative AI tooling?
    
    No.
    
    Closes #2384 from dongjoon-hyun/ORC-1996-2.1.
    
    Authored-by: Dongjoon Hyun <[email protected]>
    Signed-off-by: Dongjoon Hyun <[email protected]>
    dongjoon-hyun committed Sep 12, 2025
    Configuration menu
    Copy the full SHA
    dcb577c View commit details
    Browse the repository at this point in the history
  3. ORC-1974: [C++] Use google::protobuf::TextFormat instead of `DebugS…

    …tring` for `Protobuf` v30+
    
    ### What changes were proposed in this pull request?
    
    This PR aims to use `google::protobuf::TextFormat` instead of `DebugString` for Protobuf v30+.
    
    - https://github.com/protocolbuffers/protobuf/releases/tag/v30.0
    
        > Make DebugString print debug output, enable debug markers for debug output
    
    ### Why are the changes needed?
    
    Otherwise, our C++ `orc-metadata` tool will have a regression to expose the debug marker string, `goo.gle/debugstr`.
    
    Currently, `branch-2.1` CI is broken on `MacOS` due to this.
    - https://github.com/apache/orc/tree/branch-2.1
      - https://github.com/apache/orc/actions/runs/17683046777/job/50261415209
    
    ### How was this patch tested?
    
    Pass the CIs.
    
    ### Was this patch authored or co-authored using generative AI tooling?
    
    No.
    
    Closes #2387 from dongjoon-hyun/ORC-1974-2.1.
    
    Authored-by: Dongjoon Hyun <[email protected]>
    Signed-off-by: Dongjoon Hyun <[email protected]>
    dongjoon-hyun committed Sep 12, 2025
    Configuration menu
    Copy the full SHA
    6129e4f View commit details
    Browse the repository at this point in the history
  4. ORC-1892: [C++] Upgrade snappy to 1.2.2

    ### What changes were proposed in this pull request?
    
    This PR aims to upgrade `snappy` to 1.2.2 in `cmake_modules`.
    
    ### Why are the changes needed?
    
    To bring the latest bug fixes.
    - https://github.com/google/snappy/releases/tag/1.2.2
    
    > We fixed a very old issue of data corruption when compressed size
    exceeds 4GB. This can happen when you compress data close to 4GB
    and it's incompressible, for example, random data.
    
    ### How was this patch tested?
    
    Pass the CIs.
    
    ### Was this patch authored or co-authored using generative AI tooling?
    
    No.
    
    Closes #2391 from dongjoon-hyun/ORC-1892-2.1.
    
    Authored-by: Dongjoon Hyun <[email protected]>
    Signed-off-by: Dongjoon Hyun <[email protected]>
    dongjoon-hyun committed Sep 12, 2025
    Configuration menu
    Copy the full SHA
    89d37ea View commit details
    Browse the repository at this point in the history
  5. ORC-1893: [C++] Upgrade zstd to 1.5.7

    ### What changes were proposed in this pull request?
    
    This PR aims to upgrade `zstd` to 1.5.7 in `cmake_modules` module.
    
    `GitHub Action MacOS 15` is updated to ZSTD 1.5.7.
    
    - https://github.com/actions/runner-images/blob/macos-15-arm64/20250911.2324/images/macos/macos-15-arm64-Readme.md
    
    ### Why are the changes needed?
    
    To bring the latest bug fixes,
    - https://github.com/facebook/zstd/releases/tag/v1.5.6
    - https://github.com/facebook/zstd/releases/tag/v1.5.7
    
    ### How was this patch tested?
    
    Pass the CIs.
    
    ### Was this patch authored or co-authored using generative AI tooling?
    
    No.
    
    Closes #2395 from dongjoon-hyun/ORC-1893-2.1.
    
    Authored-by: Dongjoon Hyun <[email protected]>
    Signed-off-by: Dongjoon Hyun <[email protected]>
    dongjoon-hyun committed Sep 12, 2025
    Configuration menu
    Copy the full SHA
    8e238cc View commit details
    Browse the repository at this point in the history

Commits on Sep 26, 2025

  1. ORC-2010: Use IANA Identifier America/Los_Angeles instead of `US/…

    …Pacific` in Java
    
    ### What changes were proposed in this pull request?
    
    This PR aims to use `IANA` Identifier `America/Los_Angeles` instead of `US/Pacific` which is removed from `TZDB`.
    
    ### Why are the changes needed?
    
    To make Apache ORC code up-to-date with the current `IANA` zone pattern and be robust from the underlying OS changes.
    
    ### How was this patch tested?
    
    Pass the CIs.
    
    ### Was this patch authored or co-authored using generative AI tooling?
    
    No.
    
    Closes #2418 from dongjoon-hyun/ORC-2010.
    
    Authored-by: Dongjoon Hyun <[email protected]>
    Signed-off-by: Dongjoon Hyun <[email protected]>
    (cherry picked from commit 2c023ef)
    Signed-off-by: Dongjoon Hyun <[email protected]>
    dongjoon-hyun committed Sep 26, 2025
    Configuration menu
    Copy the full SHA
    c4420c2 View commit details
    Browse the repository at this point in the history

Commits on Oct 16, 2025

  1. ORC-2027: [C++] Fix undefined behavior in DoubleColumnReader::readFlo…

    …at()
    
    ### What changes were proposed in this pull request?
    
    Unaligned reads are UB in C++. We can not guarantee that the `bufferPointer_` pointer is aligned by `alignof(int32_t)`.
    
    ### How was this patch tested?
    Use UBsan to test in private repo.
    
    ### Was this patch authored or co-authored using generative AI tooling?
    No.
    
    Closes #2444 from HuaHuaY/fix_issue_2027.
    
    Lead-authored-by: Zehua Zou <[email protected]>
    Co-authored-by: Zehua Zou <[email protected]>
    Signed-off-by: Gang Wu <[email protected]>
    (cherry picked from commit c4fa9fa)
    Signed-off-by: Gang Wu <[email protected]>
    2 people authored and wgtmac committed Oct 16, 2025
    Configuration menu
    Copy the full SHA
    2be451b View commit details
    Browse the repository at this point in the history

Commits on Jan 3, 2026

  1. ORC-2041: Upgrade cpp-linter-action hash to match ASF infra

    ### What changes were proposed in this pull request?
    
    This PR aims to. update `cpp-linter-action` hash to match ASF infra.
    
    ### Why are the changes needed?
    
    Currently, the AS-IS `branch-2.1` CI is broken due to the ASF INFRA change.
    
    > The action cpp-linter/cpp-linter-actionf91c446a32ae3eb9f98fef8c9ed4c7cb613a4f8a is not allowed in apache/orc because all actions must be from a repository owned by your enterprise, created by GitHub, verified in the GitHub Marketplace, or match one of the patterns: ...
    
    As of now, the value is the following.
    
    - https://github.com/apache/infrastructure-actions/blob/main/actions.yml#L278
    
    ```
    cpp-linter/cpp-linter-action:
      0f6d1b8d7e38b584cbee606eb23d850c217d54f8:
        tag: v2.15.1
    ```
    
    ### How was this patch tested?
    
    Pass the CIs on this PR.
    
    ### Was this patch authored or co-authored using generative AI tooling?
    
    No.
    
    Closes #2467 from dongjoon-hyun/ORC-2041-2.1.
    
    Authored-by: Dongjoon Hyun <[email protected]>
    Signed-off-by: William Hyun <[email protected]>
    dongjoon-hyun authored and williamhyun committed Jan 3, 2026
    Configuration menu
    Copy the full SHA
    365c605 View commit details
    Browse the repository at this point in the history

Commits on Jan 6, 2026

  1. Preparing for release 2.1.4

    Signed-off-by: William Hyun <[email protected]>
    williamhyun committed Jan 6, 2026
    Configuration menu
    Copy the full SHA
    2efceb3 View commit details
    Browse the repository at this point in the history
Loading