CI: Use Java 1.9.0-SNAPSHOT for testing#1899
Conversation
With apache/iceberg#12593 Row lineage is required and Spark uses Java 1.8.0 that does not write the required `start-row-id` field in the snapshot. Therefore, I think it would be good to just use the SNAPSHOT for now until it gets released.
|
Thanks @nastra |
|
|
||
| # Download AWS bundle | ||
| RUN curl --retry 5 -s https://repo1.maven.org/maven2/org/apache/iceberg/iceberg-aws-bundle/${ICEBERG_VERSION}/iceberg-aws-bundle-${ICEBERG_VERSION}.jar \ | ||
| RUN curl --retry 5 -s https://repository.apache.org/content/groups/snapshots/org/apache/iceberg/iceberg-aws-bundle/1.9.0-SNAPSHOT/iceberg-aws-bundle-1.9.0-20250408.002722-86.jar \ |
There was a problem hiding this comment.
ah this doesnt exist anymore, https://repository.apache.org/content/groups/snapshots/org/apache/iceberg/iceberg-aws-bundle/1.9.0-SNAPSHOT/
which causes CI to fail with
IllegalArgumentException: Cannot initialize FileIO implementation org.apache.iceberg.aws.s3.S3FileIO: Cannot find constructor for interface org.apache.iceberg.io.FileIO
Missing org.apache.iceberg.aws.s3.S3FileIO [java.lang.NoClassDefFoundError: software/amazon/awssdk/services/s3/model/S3Exception]
make: *** [Makefile:61: test-integration] Error 1
Error: Process completed with exit code 2.
cc @Fokko
There was a problem hiding this comment.
interestingly, iceberg-spark-runtime contains both 20250408 and 20250409 but iceberg-aws-bundle only has 20250409
There was a problem hiding this comment.
interestingly, iceberg-spark-runtime contains both 20250408 and 20250409 but iceberg-aws-bundle only has 20250409
Not sure what's going on there 🤔
The JARs are cleaned up weekly. Since the 0.9.0 SNAPSHOT is being voted on, I suggest bumping it to an available version. I've created a PR here: #1907
This reverts commit aeb4493.
With apache/iceberg#12593 Row lineage is required and Spark uses Java 1.8.0 that does not write the required `first-row-id` field in the snapshot. Therefore, I think it would be good to just use the SNAPSHOT for now until it gets released. We can easily revert the PR once that's done. Closes apache#1898 <!-- Thanks for opening a pull request! --> <!-- In the case this PR will resolve an issue, please replace ${GITHUB_ISSUE_ID} below with the actual Github issue id. --> <!-- Closes #${GITHUB_ISSUE_ID} --> # Rationale for this change # Are these changes tested? # Are there any user-facing changes? <!-- In the case of user-facing changes, please add the changelog label. -->
With apache/iceberg#12593 Row lineage is required and Spark uses Java 1.8.0 that does not write the required
first-row-idfield in the snapshot.Therefore, I think it would be good to just use the SNAPSHOT for now until it gets released. We can easily revert the PR once that's done.
Closes #1898
Rationale for this change
Are these changes tested?
Are there any user-facing changes?