feat: Emit warning with Diagnostic when doing = Null #15696

changsun20 · 2025-04-13T06:04:02Z

Which issue does this PR close?

Rationale for this change

This PR addresses a common SQL anti-pattern where users accidentally use = NULL instead of IS NULL. While syntactically valid, this comparison always returns NULL in SQL and often indicates a developer mistake. The changes help users identify this pitfall through rich warnings while maintaining full query execution capabilities.

What changes are included in this PR?

Added warning detection for = NULL comparisons in predicate contexts
Implemented span-based diagnostics highlighting the problematic expression
Enhanced SQL parser integration with upgraded sqlparser dependency (0.55+)
Warning collection plumbing using non-intrusive RefCell storage
Added help messages suggesting IS NULL alternative
Test coverage for single and multiple occurrences

Are these changes tested?

Yes, this PR includes:

Unit tests for single = NULL detection
Validation of multiple warnings in complex expressions
Span position verification
Help message content checks

Are there any user-facing changes?

Yes, but non-breaking. No API changes or behavior modifications - existing queries will still execute normally but may produce additional warnings.

changsun20 · 2025-04-13T06:31:42Z

Hi @eliaperantoni,

Thank you for your patience and guidance throughout this issue. I've implemented the core functionality per our discussions, but would like to confirm a few implementation details:

Predicate Context Validation
The warning detection is integrated during BinaryExpr processing, which should naturally limit it to predicate contexts. Statements like UPDATE users SET password = NULL won't trigger false warnings by default. Could you confirm this approach is acceptable?
Span Handling Strategy
In usual cases, the left operand is an Identifier. The current implementation combines the identifier's left span with NULL's right span for precise highlighting. For rare non-identifier cases (e.g., some complex expressions that I can't immediately come up with one right now), we fall back to using just the NULL span. This balances precision with robustness.
Test Coverage Request
While I've added tests for basic and multiple = NULL cases, could you suggest any edge cases or additional scenarios that should be validated?

I appreciate your expertise in reviewing these implementation choices. Please let me know if any adjustments are needed.

comphead

Thanks @changsun20 wondering if its possible to test those warnings in integration slt test files?

changsun20 · 2025-04-13T19:07:00Z

Thanks @changsun20 wondering if its possible to test those warnings in integration slt test files?

Thank you for the thoughtful question, @comphead. I appreciate your focus on validation through integration tests. The current implementation prioritizes unit testing for the diagnostic infrastructure itself, as the warnings are collected internally via SqlToRel and not yet exposed through user-facing APIs. This approach allows us to validate the core logic while minimizing disruption to existing systems.

I fully agree that end-to-end validation through sqllogictest becomes critical as we evolve toward a stable warning reporting interface. Building on the foundation from #14429, I'm committed to driving the design of a unified API surface for diagnostic propagation that would benefit both this implementation and future error reporting improvements. A follow-up ticket seems ideal to address these aspects systematically, ensuring we maintain both test coverage and architectural clarity.

If the community prefers earlier integration testing, I'm happy to explore interim solutions - perhaps a temporary hook for test validation. Let me know your preference, and I'll adapt accordingly.

comphead · 2025-04-13T21:08:46Z

Yeah, that was actually my question having the warnings without being returned to the end user, who is supposed to react on the warnings? 🤔

changsun20 · 2025-04-14T21:02:40Z

@comphead I understand your concern. If displaying warnings to end users is what you'd like to see in this PR, could you confirm if @eliaperantoni's proposed solution in #14434 of "replacing Result with DatafusionResult" aligns with what you're thinking?

My concern is that this approach might be too invasive, since we're dealing with warnings that should be passed to the end without interrupting execution, rather than immediate errors. However, as discussed in the issue, this could be more robust long-term.

Please share your thoughts and preference. If this is the direction the community chooses, I'll convert this PR to draft status and implement that approach later.

comphead · 2025-04-15T15:20:49Z

Thanks @changsun20 if I understood correctly #14434 is for emitting events for the users, the same way it is done for Errors, but without halting the query.

changsun20 · 2025-04-15T23:39:39Z

Thanks @changsun20 if I understood correctly #14434 is for emitting events for the users, the same way it is done for Errors, but without halting the query.

Thank you for the feedback. I'll implement it as soon as I have time.

eliaperantoni · 2025-04-22T13:51:11Z

The warning detection is integrated during BinaryExpr processing, which should naturally limit it to predicate contexts. Statements like UPDATE users SET password = NULL won't trigger false warnings by default. Could you confirm this approach is acceptable?

Absolutely, that sounds good. And very thoughtful of you to check that UPDATE is not affected :)

In usual cases, the left operand is an Identifier. The current implementation combines the identifier's left span with NULL's right span for precise highlighting. For rare non-identifier cases (e.g., some complex expressions that I can't immediately come up with one right now), we fall back to using just the NULL span. This balances precision with robustness.

Nice! 💯

While I've added tests for basic and multiple = NULL cases, could you suggest any edge cases or additional scenarios that should be validated?

Perhaps accessing fields of structs? eg:

SELECT get_field({'x': null}, 'x') = null;

changsun20 · 2025-04-25T23:56:52Z

Perhaps accessing fields of structs? eg:
SELECT get_field({'x': null}, 'x') = null;

Thanks for pointing that out, I'll take that into consideration. As for this PR improvement, I think I may still need to postpone it until the end of next week as there is so much going on near the end of the semester. Thank you for your patience.

github-actions bot added the sql SQL Planner label Apr 13, 2025

comphead reviewed Apr 13, 2025

View reviewed changes

changsun20 added 2 commits April 13, 2025 17:01

feat: Emit warning with when doing = Null

5dd2ac3

fix: fix clippy warnings

85ecef1

changsun20 force-pushed the feat/emit-warning-for-=-null branch from b7c80a2 to 85ecef1 Compare April 13, 2025 21:02

changsun20 marked this pull request as draft April 16, 2025 01:08

Merge branch 'main' into feat/emit-warning-for-=-null

df3ddcc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Emit warning with Diagnostic when doing = Null #15696

feat: Emit warning with Diagnostic when doing = Null #15696

Uh oh!

changsun20 commented Apr 13, 2025

Uh oh!

changsun20 commented Apr 13, 2025

Uh oh!

comphead left a comment

Uh oh!

changsun20 commented Apr 13, 2025

Uh oh!

comphead commented Apr 13, 2025

Uh oh!

changsun20 commented Apr 14, 2025

Uh oh!

comphead commented Apr 15, 2025

Uh oh!

changsun20 commented Apr 15, 2025

Uh oh!

eliaperantoni commented Apr 22, 2025

Uh oh!

changsun20 commented Apr 25, 2025

Uh oh!

Uh oh!

feat: Emit warning with Diagnostic when doing = Null #15696

Are you sure you want to change the base?

feat: Emit warning with Diagnostic when doing = Null #15696

Uh oh!

Conversation

changsun20 commented Apr 13, 2025

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

changsun20 commented Apr 13, 2025

Uh oh!

comphead left a comment

Choose a reason for hiding this comment

Uh oh!

changsun20 commented Apr 13, 2025

Uh oh!

comphead commented Apr 13, 2025

Uh oh!

changsun20 commented Apr 14, 2025

Uh oh!

comphead commented Apr 15, 2025

Uh oh!

changsun20 commented Apr 15, 2025

Uh oh!

eliaperantoni commented Apr 22, 2025

Uh oh!

changsun20 commented Apr 25, 2025

Uh oh!

Uh oh!