[Data] Fixing BQ datasink to be able to handle empty blocks by alexeykudinkin · Pull Request #60797 · ray-project/ray

alexeykudinkin · 2026-02-06T02:56:54Z

Thank you for contributing to Ray! 🚀
Please review the Ray Contribution Guide before opening a pull request.

⚠️ Remove these instructions before submitting your PR.

💡 Tip: Mark as draft if you want early feedback, or ready for review when it's complete.

Description

Briefly describe what this PR accomplishes and why it's needed.

Related issues

Link related issues: "Fixes #1234", "Closes #1234", or "Related to #1234".

Additional information

Optional: Add implementation details, API changes, usage examples, screenshots, etc.

Signed-off-by: Alexey Kudinkin <ak@anyscale.com>

gemini-code-assist

Code Review

This pull request correctly fixes an issue in the BigQuery datasink to handle empty blocks by filtering them out before processing. A corresponding unit test has been added to validate the fix. While the fix itself is correct, the assertion in the new test case is flawed and should be corrected to accurately reflect the expected behavior.

gemini-code-assist · 2026-02-06T02:57:44Z

python/ray/data/tests/datasource/test_bigquery.py

+            ctx=ctx,
+        )
+
+        ray_get_mock.assert_not_called()


The assertion assert_not_called() is incorrect because ray.get() is still invoked even when the list of remote tasks is empty. To correctly verify that no tasks are submitted for an empty block, you should assert that ray.get() was called once with an empty list.

Suggested change

ray_get_mock.assert_not_called()

ray_get_mock.assert_called_once_with([])

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

cursor · 2026-02-06T03:01:29Z

python/ray/data/tests/datasource/test_bigquery.py

+            ctx=ctx,
+        )
+
+        ray_get_mock.assert_not_called()


Test assertion incorrect for empty block case

Medium Severity

The test assertion ray_get_mock.assert_not_called() is incorrect. When the write method is called with only empty blocks, the list comprehension filters them out, producing an empty list []. However, ray.get([]) is still called (the ray.get call is unconditional). The mock would be called with an empty list argument, causing assert_not_called() to fail. The assertion should verify ray.get was called with an empty list instead.

alexeykudinkin added 2 commits February 5, 2026 18:47

Fixed BQ data-sink to avoid writing empty blocks

b2f02ce

Signed-off-by: Alexey Kudinkin <ak@anyscale.com>

Added test

05759a8

Signed-off-by: Alexey Kudinkin <ak@anyscale.com>

alexeykudinkin requested a review from a team as a code owner February 6, 2026 02:56

gemini-code-assist bot reviewed Feb 6, 2026

View reviewed changes

cursor bot reviewed Feb 6, 2026

View reviewed changes

alexeykudinkin linked an issue Feb 7, 2026 that may be closed by this pull request

[data] Zero-sized blocks crashes write_bigquery #51892

Open

alexeykudinkin mentioned this pull request Feb 7, 2026

[data] Zero-sized blocks crashes write_bigquery #51892

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Data] Fixing BQ datasink to be able to handle empty blocks#60797

[Data] Fixing BQ datasink to be able to handle empty blocks#60797
alexeykudinkin wants to merge 2 commits intomasterfrom
ak/bq-empt-blk-fix

alexeykudinkin commented Feb 6, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Feb 6, 2026

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	ray_get_mock.assert_not_called()
	ray_get_mock.assert_called_once_with([])

Conversation

alexeykudinkin commented Feb 6, 2026

Description

Related issues

Additional information

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Feb 6, 2026

Choose a reason for hiding this comment

Test assertion incorrect for empty block case

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant