feat(csharp/src/Drivers/Databricks): Primary Key and Foreign Key Metadata Optimization #2886

eric-wang-1990 · 2025-05-28T00:37:04Z

Arrow ADBC: Primary Key and Foreign Key Metadata Optimization

Description

This PR adds support for optimizing Primary Key and Foreign Key metadata queries in the C# Databricks ADBC driver. It introduces a new connection parameter adbc.databricks.enable_pk_fk that allows users to control whether the driver should make PK/FK metadata calls to the server or return empty results for improved performance.

Background

Primary Key and Foreign Key metadata queries can be expensive operations, particularly in Databricks environments where they may not be fully supported in certain catalogs. This implementation provides a way to optimize these operations by:

Allowing users to disable PK/FK metadata calls entirely via configuration
Automatically returning empty results for legacy catalogs (SPARK, hive_metastore) where PK/FK metadata is not supported
Ensuring that empty results maintain schema compatibility with real metadata responses

Proposed Changes

Add new connection parameter adbc.databricks.enable_pk_fk to control PK/FK metadata behavior (default: true)
Implement special handling for legacy catalogs (SPARK, hive_metastore) to return empty results without server calls
Modify method visibility in base classes to allow proper overriding in derived classes
Add comprehensive test coverage for the new functionality

How is this tested?

Added unit tests that verify:

The correct behavior of the ShouldReturnEmptyPkFkResult method with various combinations of settings
Schema compatibility between empty results and real metadata responses
Proper handling of different catalog scenarios

These tests ensure that the optimization works correctly while maintaining compatibility with client applications that expect consistent schema structures.

…ization

CurtHagenlocher

Thanks! Please resolve the white space issue identified by the linker and see additional comment.

CurtHagenlocher · 2025-05-28T14:20:41Z

csharp/src/Drivers/Databricks/DatabricksStatement.cs

+                new Field("TABLE_SCHEM", StringType.Default, true),
+                new Field("TABLE_NAME", StringType.Default, true),
+                new Field("COLUMN_NAME", StringType.Default, true),
+                new Field("KEQ_SEQ", Int32Type.Default, true),


should this be "KEY_SEQ"? If so, it also needs to be changed on line 448.

Nope this should be KEQ

So the primary key column is named KEQ_SEQ and the foreign key column is named KEY_SEQ? Weird.

Yes...that is weird but that is what returned from thrift

feat(csharp/Drivers/Databricks): Add support for PK/FK metadata optim…

91feacf

…ization

eric-wang-1990 requested a review from CurtHagenlocher as a code owner May 28, 2025 00:37

github-actions bot added this to the ADBC Libraries 19 milestone May 28, 2025

update

9e76ea9

eric-wang-1990 changed the title ~~feat(csharp/Drivers/Databricks): Primary Key and Foreign Key Metadata Optimization~~ feat(csharp/Drivers/Databricks):Primary Key and Foreign Key Metadata Optimization May 28, 2025

CurtHagenlocher changed the title ~~feat(csharp/Drivers/Databricks):Primary Key and Foreign Key Metadata Optimization~~ feat(csharp/src/Drivers/Databricks): Primary Key and Foreign Key Metadata Optimization May 28, 2025

CurtHagenlocher requested changes May 28, 2025

View reviewed changes

linter

918e944

eric-wang-1990 requested a review from CurtHagenlocher May 28, 2025 17:03

CurtHagenlocher approved these changes May 28, 2025

View reviewed changes

CurtHagenlocher merged commit aae84d2 into apache:main May 28, 2025
7 checks passed

serramatutu mentioned this pull request Aug 12, 2025

Sync upstream dbt-labs/arrow-adbc#52

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(csharp/src/Drivers/Databricks): Primary Key and Foreign Key Metadata Optimization #2886

feat(csharp/src/Drivers/Databricks): Primary Key and Foreign Key Metadata Optimization #2886

Uh oh!

eric-wang-1990 commented May 28, 2025

Uh oh!

CurtHagenlocher left a comment

Uh oh!

CurtHagenlocher May 28, 2025

Uh oh!

eric-wang-1990 May 28, 2025

Uh oh!

CurtHagenlocher May 28, 2025

Uh oh!

eric-wang-1990 May 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat(csharp/src/Drivers/Databricks): Primary Key and Foreign Key Metadata Optimization #2886

feat(csharp/src/Drivers/Databricks): Primary Key and Foreign Key Metadata Optimization #2886

Uh oh!

Conversation

eric-wang-1990 commented May 28, 2025

Arrow ADBC: Primary Key and Foreign Key Metadata Optimization

Description

Background

Proposed Changes

How is this tested?

Uh oh!

CurtHagenlocher left a comment

Choose a reason for hiding this comment

Uh oh!

CurtHagenlocher May 28, 2025

Choose a reason for hiding this comment

Uh oh!

eric-wang-1990 May 28, 2025

Choose a reason for hiding this comment

Uh oh!

CurtHagenlocher May 28, 2025

Choose a reason for hiding this comment

Uh oh!

eric-wang-1990 May 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants