Skip to content

[python] return Arrow tables with dictionary of large_string, rather than string#4220

Merged
bkmartinjr merged 7 commits intomainfrom
bkm/soma-440-part-deux
Sep 8, 2025
Merged

[python] return Arrow tables with dictionary of large_string, rather than string#4220
bkmartinjr merged 7 commits intomainfrom
bkm/soma-440-part-deux

Conversation

@bkmartinjr
Copy link
Copy Markdown
Member

@bkmartinjr bkmartinjr commented Sep 4, 2025

Follow-on fix for SOMA-440, correctly setting the Arrow schema in tables returned by DataFrame.read.

For Python: the first fix, PR #4209, resolved the type as reported by the schema property. This PR finishes the job by returning Arrow tables with the correct schema.

For R: revert the previous change, reverting to "small" string/binary types.

Note to reviewers: addition change in Python tests to further reduce memory on MacOS CI runners.

@codecov
Copy link
Copy Markdown

codecov Bot commented Sep 4, 2025

Codecov Report

❌ Patch coverage is 17.94872% with 32 lines in your changes missing coverage. Please review.
✅ Project coverage is 75.39%. Comparing base (a2f7a7f) to head (cb6a83e).
⚠️ Report is 68 commits behind head on main.

Additional details and impacted files
@@            Coverage Diff             @@
##             main    #4220      +/-   ##
==========================================
+ Coverage   69.58%   75.39%   +5.80%     
==========================================
  Files         169      229      +60     
  Lines       19072    30738   +11666     
  Branches     1233     1236       +3     
==========================================
+ Hits        13272    23176    +9904     
- Misses       5366     7140    +1774     
+ Partials      434      422      -12     
Flag Coverage Δ
libtiledbsoma 57.06% <5.55%> (-0.06%) ⬇️
python 89.72% <ø> (-0.03%) ⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Components Coverage Δ
python_api 89.72% <ø> (-0.03%) ⬇️
libtiledbsoma 45.99% <11.11%> (+0.72%) ⬆️
🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@bkmartinjr bkmartinjr marked this pull request as ready for review September 8, 2025 01:03
Copy link
Copy Markdown
Member

@mojaveazure mojaveazure left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM; can you bump the R development version before merging?

@bkmartinjr bkmartinjr merged commit c3cb4fd into main Sep 8, 2025
24 checks passed
@bkmartinjr bkmartinjr deleted the bkm/soma-440-part-deux branch September 8, 2025 21:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants