naive statement splitter #142

psteinroe · 2024-10-04T21:05:11Z

naive attempt at splitting statements.

implements the splitter as a simple Pratt Parser. The approach now is the opposite of what I tried before: Instead of assuming it works perfectly as long as we cover all possible edge cases, we assume it does not work at all. We want to make all important cases work (mostly DML, a few DDL statements), while asking the user to fallback to a double newline if they are seeing weird things coming from the lsp.

make it work well for all dml statements and all create (or replace) ddl statements

juleswritescode

Wow, this is so much simpler than the previous implementation. I love it.

crates/pg_statement_splitter/src/parser/dml.rs

crates/pg_statement_splitter/src/lib.rs

crates/pg_statement_splitter/src/parser.rs

crates/pg_statement_splitter/src/parser/data.rs

juleswritescode · 2024-10-18T05:56:58Z

crates/pg_statement_splitter/src/parser/common.rs

+            // delete(p);
+        }
+        t => {
+            panic!("stmt: Unknown token {:?}", t);


Suggested change

panic!("stmt: Unknown token {:?}", t);

panic!("stmt: Unknown start token {:?}", t);

crates/pg_statement_splitter/src/parser.rs

…popping and cloning

psteinroe · 2024-10-18T09:47:21Z

@juleswritescode I refactored the parser to work with an index into the vector. this comes with a few benefits:

no cloning
no "save last token" anymore
just a single pass over tokens since we do not filter and clone on init
no impact for the user of the parser

edit: added a little test helper

crates/pg_statement_splitter/src/parser.rs

crates/pg_statement_splitter/src/lib.rs

psteinroe · 2024-10-20T15:47:30Z

@juleswritescode I think this is ready now. dml statements should work good enough.

juleswritescode

amazing work 💪

juleswritescode · 2024-10-21T05:19:29Z

crates/pg_statement_splitter/tests/skipped.txt

@@ -1,12 +0,0 @@
-brin


YEAH ⛹️

juleswritescode · 2024-10-21T16:52:23Z

crates/pg_base_db/src/change.rs

-                                r.start(),
-                                r.end() + TextSize::from(self.diff_size()),
-                            );
+                            *r = TextRange::new(r.start(), r.end() + self.diff_size());


hmm, strange that there's no method on the type for increasing the range 🤷

juleswritescode · 2024-10-21T16:55:00Z

crates/pg_base_db/src/document.rs

+            statement_ranges: text.as_ref().map_or_else(Vec::new, |f| {
+                pg_statement_splitter::split(f).ranges.to_vec()
+            }),


pretty cool changes here in the file!

juleswritescode · 2024-10-21T17:08:03Z

crates/pg_statement_splitter/src/parser.rs

+        self.errors.push(SyntaxError::new(
+            format!("Expected {:#?}", kind),
+            self.peek().span,
+        ));


very nice! much cleaner.

fix: save

79e2810

psteinroe changed the title ~~fix: save~~ naive statement splitter Oct 4, 2024

psteinroe added 2 commits October 5, 2024 18:17

fix: save

9ae67e0

fix: ci

0d757f6

juleswritescode reviewed Oct 18, 2024

View reviewed changes

psteinroe added 2 commits October 18, 2024 08:58

fix: address pr feedback

58c0374

refactor: parser now uses a pointer into the token vector instead of …

3849cf7

…popping and cloning

psteinroe requested a review from juleswritescode October 18, 2024 09:47

add test helper

4e9dc81

juleswritescode reviewed Oct 18, 2024

View reviewed changes

crates/pg_statement_splitter/src/parser.rs Show resolved Hide resolved

crates/pg_statement_splitter/src/lib.rs Show resolved Hide resolved

psteinroe added 4 commits October 19, 2024 11:02

feat: add remaining dml statements

345b1ec

cleanup stmts

2d130d7

fix: handle insert update delete and select within unknown

d21f261

fix: cleanup and fix some clippy warnings (sorry, unrelated)

1308770

psteinroe marked this pull request as ready for review October 20, 2024 15:43

fix: handle create rule with select

e2dba40

psteinroe requested a review from juleswritescode October 20, 2024 15:46

psteinroe added 2 commits October 20, 2024 17:48

fix: make ntest a dev dep

9f15953

fix: build error

5f08420

juleswritescode approved these changes Oct 21, 2024

View reviewed changes

psteinroe merged commit b930638 into main Oct 21, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

naive statement splitter #142

naive statement splitter #142

Uh oh!

psteinroe commented Oct 4, 2024 •

edited

Loading

Uh oh!

juleswritescode left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

juleswritescode Oct 18, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

psteinroe commented Oct 18, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

psteinroe commented Oct 20, 2024

Uh oh!

juleswritescode left a comment

Uh oh!

juleswritescode Oct 21, 2024

Uh oh!

juleswritescode Oct 21, 2024

Uh oh!

juleswritescode Oct 21, 2024

Uh oh!

juleswritescode Oct 21, 2024

Uh oh!

Uh oh!

Uh oh!

	panic!("stmt: Unknown token {:?}", t);
	panic!("stmt: Unknown start token {:?}", t);

naive statement splitter #142

naive statement splitter #142

Uh oh!

Conversation

psteinroe commented Oct 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

juleswritescode left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

juleswritescode Oct 18, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

psteinroe commented Oct 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

psteinroe commented Oct 20, 2024

Uh oh!

juleswritescode left a comment

Choose a reason for hiding this comment

Uh oh!

juleswritescode Oct 21, 2024

Choose a reason for hiding this comment

Uh oh!

juleswritescode Oct 21, 2024

Choose a reason for hiding this comment

Uh oh!

juleswritescode Oct 21, 2024

Choose a reason for hiding this comment

Uh oh!

juleswritescode Oct 21, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

psteinroe commented Oct 4, 2024 •

edited

Loading

psteinroe commented Oct 18, 2024 •

edited

Loading