Extract parquet statistics for StructArray#6090
Extract parquet statistics for StructArray#6090Lordworms wants to merge 3 commits intoapache:mainfrom
Conversation
|
closing/reopening to rerun ci |
cbd6eac to
3d285d9
Compare
|
I am depressed about the large review backlog in this crate. We are looking for more help from the community reviewing PRs -- see #6418 for more |
1 similar comment
|
I am depressed about the large review backlog in this crate. We are looking for more help from the community reviewing PRs -- see #6418 for more |
|
i'm interested in this topic, but not too aquantanted to the source code of this crate yet, give me more time and i can share some review load! |
| ]), | ||
| }; | ||
| // Due to https://github.com/apache/datafusion/issues/8334, | ||
| // statistics for struct arrays are not supported |
There was a problem hiding this comment.
maybe we remove this comment after this was resolved
|
|
||
| expected_max: struct_array(vec![ | ||
| (Some(true), Some(3)), | ||
| (Some(true), Some(0)), |
There was a problem hiding this comment.
original values was
(Some(true), Some(0)),
(Some(false), Some(5)),
(None, None),
Then isn't min should be Some(false), Some(5) and max should be Some(true),Some(0)
|
This PR looks like it has some comments that are still waiting to be addressed. I am sorry I don't have time to push this along -- if anyone cares about this particular feature I think the first thing would be do some research and confirm what the behavior of other parquet implementations are It is not clear to me if there is much value / precident for extracting min/max statistics for I am sorry this has dragged out so long @Lordworms -- I think we should close this PR unless someone else would like to push it along. |
|
Closing this PR as stale. Feel free to reopen if it becomes active again. |
Which issue does this PR close?
Closes #.
Rationale for this change
What changes are included in this PR?
Are there any user-facing changes?