Goal of having a single copy of tiledb core via libtiledbsoma, eliminating all use of tiledb-py and tiledb-r:
- Currently present in
SOMAReader (as of 2023-02-13):
- Reads:
- Writes:
Query API:
Feature completion:
Read performance, and in particular all of the zero-copy or ecosystem format paths (optmized CSC/CSR, etc)
Large out-of-core / distributed processing:
cc @gspowley @nguyenv @Shelnutt2 @eddelbuettel
Goal of having a single copy of tiledb core via
libtiledbsoma, eliminating all use oftiledb-pyandtiledb-r:SOMAReader(as of 2023-02-13):readfor arraysset_dim_points,submit,results_complete, etc.nnzstatsaccessSOMAReadertoSOMAArrayReadershape[c++] Add shape-getter toSOMAReaderAPI #927schemareads (as Arrow) [c++] Implement Arrow-schema-getter in C++ #1121metadatareads [c++] Add metadata functions toSOMAArray#926open(for read) andcloseSOMAGroupReader[c++] Implement group member-names-to-URIs map in C++ #1122open(for read), andcloseSOMAArrayWriteropen(for write), andclosewritebulk dataSOMAGroupWriter[c++]tiledb.group_createviaSOMAGroupWriter#1182create[c++]tiledb.group_createviaSOMAGroupWriter#1182open(for write), andcloseQuery API:
somacore) uses only the API, with no direct implementation-level accesses.SOMAArrayReaderandSOMAGroupReaderfeature-complete (as building blocks) before we can implement this in C++SOMAArrayWriterandSOMAGroupWriterdevelopment can proceed in parallel with this.Feature completion:
SOMAReaderuse all supported types (not justintandstringas currently) to indexSOMADataFrameRead performance, and in particular all of the zero-copy or ecosystem format paths (optmized CSC/CSR, etc)
Large out-of-core / distributed processing:
cc @gspowley @nguyenv @Shelnutt2 @eddelbuettel