feat(csharp/src/Drivers/BigQuery): support evaluation kind and statement type setting #2698

qifanzhang-ms · 2025-04-11T07:53:58Z

Support evaluation kind and statement type setting for GBQ driver, and improve code at BigQueryStatement.cs.

csharp/test/Drivers/BigQuery/BigQueryTestConfiguration.cs

davidhcoe · 2025-04-11T16:19:58Z

Is there a specific multi-statement test that should be added (presumably from both the DriverTests and the ClientTests)?

qifanzhang-ms · 2025-04-14T08:49:03Z

Is there a specific multi-statement test that should be added (presumably from both the DriverTests and the ClientTests)?
I tried to add a test. Since our current test framework requires developers to write query statements in json files themselves, the added test would be a bit abrupt.

davidhcoe · 2025-04-14T14:05:27Z

Is there a specific multi-statement test that should be added (presumably from both the DriverTests and the ClientTests)?
I tried to add a test. Since our current test framework requires developers to write query statements in json files themselves, the added test would be a bit abrupt.

The way I would do something like this is for any environment that can execute a query. For example, something like:

 [SkippableFact, Order(6)]
 public void CanExecuteMultiStatementQuery()
 {
     foreach (BigQueryTestEnvironment environment in _environments)
     {
         AdbcConnection adbcConnection = GetAdbcConnection(environment.Name);
         AdbcStatement statement = adbcConnection.CreateStatement();
         
          string query1 = "SELECT " +
                  "CAST(1 as INT64) as id, " +
                  "CAST(1.23 as FLOAT64) as number, " +
                  "PARSE_NUMERIC(\"4.56\") as decimal, " +
                  "PARSE_BIGNUMERIC(\"7.89000000000000000000000000000000000000\") as big_decimal, " +
                  "CAST(True as BOOL) as is_active, " +
                  "'John Doe' as name, " +
                  "FROM_BASE64('YWJjMTIz') as data, " +
                  "CAST('2023-09-08' as DATE) as date, " +
                  "CAST('12:34:56' as TIME) as time, " +
                  "CAST('2023-09-08 12:34:56' as DATETIME) as datetime, " +
                  "CAST('2023-09-08 12:34:56+00:00' as TIMESTAMP) as timestamp, " +
                  "ST_GEOGPOINT(1, 2) as point, " +
                  "ARRAY[1, 2, 3] as numbers, " +
                  "STRUCT('John Doe' as name, 30 as age) as person," +
                  "PARSE_JSON('{\"name\":\"Jane Doe\",\"age\":29}') as json";

         string query2 = "SELECT " +
                   "CAST(1.7976931348623157e+308 as FLOAT64) as number, " +
                   "PARSE_NUMERIC(\"9.99999999999999999999999999999999E+28\") as decimal, " +
                 
       "PARSE_BIGNUMERIC(\"5.7896044618658097711785492504343953926634992332820282019728792003956564819968E+37\")

         string combinedQuery = query1 + ";" + query2 + ";"
         statement.SqlQuery = combinedQuery;

         QueryResult queryResult = statement.ExecuteQuery();

       // TODO: Assert the expected results from the two queries
     }
 }

This will work without any tables being created and you can validate the result(s) of the multi-statement evaluation.

An alternative way could be to use the BigQuery public datasets to retrieve data as well.

qifanzhang-ms · 2025-04-15T09:03:45Z

Is there a specific multi-statement test that should be added (presumably from both the DriverTests and the ClientTests)?
I tried to add a test. Since our current test framework requires developers to write query statements in json files themselves, the added test would be a bit abrupt.

The way I would do something like this is for any environment that can execute a query. For example, something like:

 [SkippableFact, Order(6)]
 public void CanExecuteMultiStatementQuery()
 {
     foreach (BigQueryTestEnvironment environment in _environments)
     {
         AdbcConnection adbcConnection = GetAdbcConnection(environment.Name);
         AdbcStatement statement = adbcConnection.CreateStatement();
         
          string query1 = "SELECT " +
                  "CAST(1 as INT64) as id, " +
                  "CAST(1.23 as FLOAT64) as number, " +
                  "PARSE_NUMERIC(\"4.56\") as decimal, " +
                  "PARSE_BIGNUMERIC(\"7.89000000000000000000000000000000000000\") as big_decimal, " +
                  "CAST(True as BOOL) as is_active, " +
                  "'John Doe' as name, " +
                  "FROM_BASE64('YWJjMTIz') as data, " +
                  "CAST('2023-09-08' as DATE) as date, " +
                  "CAST('12:34:56' as TIME) as time, " +
                  "CAST('2023-09-08 12:34:56' as DATETIME) as datetime, " +
                  "CAST('2023-09-08 12:34:56+00:00' as TIMESTAMP) as timestamp, " +
                  "ST_GEOGPOINT(1, 2) as point, " +
                  "ARRAY[1, 2, 3] as numbers, " +
                  "STRUCT('John Doe' as name, 30 as age) as person," +
                  "PARSE_JSON('{\"name\":\"Jane Doe\",\"age\":29}') as json";

         string query2 = "SELECT " +
                   "CAST(1.7976931348623157e+308 as FLOAT64) as number, " +
                   "PARSE_NUMERIC(\"9.99999999999999999999999999999999E+28\") as decimal, " +
                 
       "PARSE_BIGNUMERIC(\"5.7896044618658097711785492504343953926634992332820282019728792003956564819968E+37\")

         string combinedQuery = query1 + ";" + query2 + ";"
         statement.SqlQuery = combinedQuery;

         QueryResult queryResult = statement.ExecuteQuery();

       // TODO: Assert the expected results from the two queries
     }
 }

This will work without any tables being created and you can validate the result(s) of the multi-statement evaluation.

An alternative way could be to use the BigQuery public datasets to retrieve data as well.

Thanks, have added.

CurtHagenlocher · 2025-04-16T14:32:29Z

I feel like I don't entirely understand this change, so it would be nice to get a little more explanation.

Today, there's a limitation in ADBC which prevents a single execution batch from returning multiple results. Now obviously, multiple statements doesn't have to mean multiple results because a statement could be e.g. DDL or BEGIN TRAN or some other thing which impacts session state. But accepting multiple statements implies that they could each be returning results, and that's where things feel a little sketchy. It looks like the two added parameters are intended to filter down the set of results in order to pick just one. But they do so in a way that doesn't (to me) obviously ensure that only one result will match the criteria. What if there are two results with the same statement type and evaluation kind? Picking the first one feels a little arbitrary.

What I'd naively expect if I passed multiple statements and needed to indicate which one's results I wanted would be to supply the index of the statement. So for "statement1; statement2; statement3", if I wanted the results from statement2 I might pass either 1 or 2 depending on how I feel about zero-indexing vs one-indexing.

Is the current design something that users of the e.g. BigQuery ODBC driver would already be familiar with?
Do certain statement types and evaluation kinds never come with real result data and can therefore be omitted by default if e.g. an index wasn't specified?

CurtHagenlocher · 2025-04-16T14:34:18Z

(Also, it would be nice if we tried to maintain some alignment between the C# BigQuery driver and the Go BigQuery driver -- though it may already be a little late for that :(. CC: @lidavidm for possible additional feedback.)

CurtHagenlocher

The change looks mechanically fine; I just have questions around the API.

csharp/src/Drivers/BigQuery/BigQueryStatement.cs

csharp/test/Drivers/BigQuery/DriverTests.cs

davidhcoe · 2025-04-16T16:38:15Z

(Also, it would be nice if we tried to maintain some alignment between the C# BigQuery driver and the Go BigQuery driver -- though it may already be a little late for that :(. CC: @lidavidm for possible additional feedback.)

We started the C# one before the Go one was available and I haven’t followed the Go one too closely.

qifanzhang-ms · 2025-04-18T06:28:54Z

I feel like I don't entirely understand this change, so it would be nice to get a little more explanation.

Today, there's a limitation in ADBC which prevents a single execution batch from returning multiple results. Now obviously, multiple statements doesn't have to mean multiple results because a statement could be e.g. DDL or BEGIN TRAN or some other thing which impacts session state. But accepting multiple statements implies that they could each be returning results, and that's where things feel a little sketchy. It looks like the two added parameters are intended to filter down the set of results in order to pick just one. But they do so in a way that doesn't (to me) obviously ensure that only one result will match the criteria. What if there are two results with the same statement type and evaluation kind? Picking the first one feels a little arbitrary.

What I'd naively expect if I passed multiple statements and needed to indicate which one's results I wanted would be to supply the index of the statement. So for "statement1; statement2; statement3", if I wanted the results from statement2 I might pass either 1 or 2 depending on how I feel about zero-indexing vs one-indexing.

Is the current design something that users of the e.g. BigQuery ODBC driver would already be familiar with? Do certain statement types and evaluation kinds never come with real result data and can therefore be omitted by default if e.g. an index wasn't specified?

Your understanding is very accurate. The behavior of picking the first one statement is designed because the Connector implemented by the previous BigQuery ODBC driver has such behavior, which is also the behavior that customers want.

Supplying the index of the statement is indeed a reasonable feature, but there is no specific customer demand at present. I can add it, you can take a look first.

CurtHagenlocher

Thanks!

…ent type setting (apache#2698) Support evaluation kind and statement type setting for GBQ driver, and improve code at BigQueryStatement.cs.

qifanzhang-ms added 2 commits March 20, 2025 16:46

core code

6dd13a3

filter by EvaluationKind

a6b47a6

qifanzhang-ms requested a review from CurtHagenlocher as a code owner April 11, 2025 07:53

github-actions bot modified the milestone: ADBC Libraries 18 Apr 11, 2025

davidhcoe reviewed Apr 11, 2025

View reviewed changes

csharp/test/Drivers/BigQuery/BigQueryTestConfiguration.cs Show resolved Hide resolved

add test

f18dbb7

fix the test

8cac9e6

davidhcoe mentioned this pull request Apr 14, 2025

feat(csharp/src/Drivers/BigQuery): Add support for AAD/Entra authentication #2655

Merged

add test

2684472

CurtHagenlocher reviewed Apr 16, 2025

View reviewed changes

csharp/src/Drivers/BigQuery/BigQueryStatement.cs Show resolved Hide resolved

csharp/src/Drivers/BigQuery/BigQueryStatement.cs Show resolved Hide resolved

CurtHagenlocher reviewed Apr 16, 2025

View reviewed changes

csharp/test/Drivers/BigQuery/DriverTests.cs Outdated Show resolved Hide resolved

qifanzhang-ms added 3 commits April 18, 2025 16:27

Add statementIndex

1bdcafc

change the test

56af647

fix

d6bc961

CurtHagenlocher approved these changes Apr 18, 2025

View reviewed changes

CurtHagenlocher merged commit 6027c11 into apache:main Apr 18, 2025
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(csharp/src/Drivers/BigQuery): support evaluation kind and statement type setting #2698

feat(csharp/src/Drivers/BigQuery): support evaluation kind and statement type setting #2698

Uh oh!

qifanzhang-ms commented Apr 11, 2025

Uh oh!

Uh oh!

davidhcoe commented Apr 11, 2025

Uh oh!

qifanzhang-ms commented Apr 14, 2025

Uh oh!

davidhcoe commented Apr 14, 2025 •

edited

Loading

Uh oh!

qifanzhang-ms commented Apr 15, 2025

Uh oh!

CurtHagenlocher commented Apr 16, 2025

Uh oh!

CurtHagenlocher commented Apr 16, 2025

Uh oh!

CurtHagenlocher left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

davidhcoe commented Apr 16, 2025

Uh oh!

qifanzhang-ms commented Apr 18, 2025 •

edited

Loading

Uh oh!

CurtHagenlocher left a comment

Uh oh!

Uh oh!

Uh oh!

feat(csharp/src/Drivers/BigQuery): support evaluation kind and statement type setting #2698

feat(csharp/src/Drivers/BigQuery): support evaluation kind and statement type setting #2698

Uh oh!

Conversation

qifanzhang-ms commented Apr 11, 2025

Uh oh!

Uh oh!

davidhcoe commented Apr 11, 2025

Uh oh!

qifanzhang-ms commented Apr 14, 2025

Uh oh!

davidhcoe commented Apr 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

qifanzhang-ms commented Apr 15, 2025

Uh oh!

CurtHagenlocher commented Apr 16, 2025

Uh oh!

CurtHagenlocher commented Apr 16, 2025

Uh oh!

CurtHagenlocher left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

davidhcoe commented Apr 16, 2025

Uh oh!

qifanzhang-ms commented Apr 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CurtHagenlocher left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

davidhcoe commented Apr 14, 2025 •

edited

Loading

qifanzhang-ms commented Apr 18, 2025 •

edited

Loading