Open data framework for biology
Query, trace & validate datasets & models at scale. One API: lakehouse, lineage, feature store, ontologies, LIMS, and ELN.

Lineage
Trace data, code & reports
Know where data came from and what it's used for. Track data lineage with a single line of code.
Lakehouse
Query datasets at scale
Query and batch-load datasets with lakehouse support for a wide range of table & array formats. Manage their features & schemas as metadata in a database.

LIMS
Lakehouse-native sheets
Build and query sheets with the same features and schemas that index your datasets in storage. A single Python/R class with built-in ontologies, project & change management.

Integrity
Validate & annotate datasets
Use schemas to enforce consistency across your data assets. Annotate datasets with a single line of code.
Zero lock-in
Administrate with ease while staying in control
Manage fine-grained access for humans & agents with SaaS-like simplicity. Enjoy full admin control at the database and storage level.

Context
Build your organization's long-term memory
As your team and agents work, data, models & reports are automatically linked — building context & training data that compounds over time.
