-
Notifications
You must be signed in to change notification settings - Fork 70
add duration.parquet #80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Thanks for creating the PR! I think we need to include the code snippet to create this file at least in the PR description and use parquet cli to print its metadata so people can know what's in it. BTW, I'm not familiar with Polars, is it possible to use the Parquet Java writer from parquet-java or Parquet C++ writer from Apache Arrow to create such files? |
|
@wgtmac I have update the PR description.
Can I use pyarrow or pandas? Otherwise I have to set up a java environment on my machine. |
|
Yes, pyarrow sounds good to me. |
done |
|
Ah sorry, I didn't check it carefully. Parquet does not have an official |
I agree -- I will clarify / figure out a plan on what to do here: |
|
I think we have figured out the issue and @Liyixin95 has provided a fix here: Quoting from myself on apache/arrow-rs#5626 (comment) In order for the arrow-rs parquet reader to read the data as a duration it needs to interpret the contents of that metadata hint. So I suggest we close this PR and go with the fix in arrow-rs |
Sure, I will close this pr. |
Thanks agian for your help |
add test file for this issue
code:
metadata: