(WIP) Add benchmarks comparing performance of parse functions #50

duckinator · 2025-02-14T00:23:49Z

Additional changes that were needed for benchmarking purposes, but may not be wanted:

adding pg_query_raw_parse to the bindings
exposing pg_query::bindings

…hmark flamegraphs.

duckinator · 2025-02-14T02:14:32Z

Summary

I couldn't find a way to expose pg_query_raw_parse(), so I focused on the rest of the task.

For this query, pg_query_parse_protobuf() seems to take 5x as long as pg_query_parse().

Details

As I discussed with @seanlinsley on Slack, I couldn't find a way to expose pg_query_raw_parse(), so I set that aside since this work is time-boxed.

You can run cargo bench to get benchmark output. Unfortunately, in order to accommodate per-benchmark flamegraph generation, the cargo bench output is a bit messier.

cargo bench output:

     Running benches/parse.rs (target/release/deps/parse-e7cc8bbbe183b46f)
Starting: Running benchmark(s). Stand by!

•

Method                 Mean        Samples
------------------------------------------
pg_query_parse    922.41 μs    2,463/2,500

     Running benches/parse_protobuf.rs (target/release/deps/parse_protobuf-93fcb930536d0376)
Starting: Running benchmark(s). Stand by!

•

Method                        Mean        Samples    Change
-----------------------------------------------------------
pg_query_parse_protobuf    5.03 ms    1,934/1,960    -0.71%

Flamegraphs

pg_query_parse():

pg_query_parse_protobuf():

duckinator · 2025-02-14T02:37:06Z

The initial flamegraph for pg_query_parse_protobuf() was wrong. Regenerating it fixed the problem, and I uploaded a new one + updated my summary accordingly.

lfittl · 2025-02-14T21:15:41Z

benches/parse_protobuf.rs

+brunch::benches!(
+    Bench::new("pg_query_parse_protobuf")
+        .run_seeded_with(c_seed, |query| {
+            unsafe { pg_query_parse_protobuf(query.as_ptr() as *const c_char) }


It might be interesting to include the deserialization in Rust as well, so we can get a sense for where to optimize (if we were to optimize the serialization itself).

For the protobuf parse benchmark we can use the same mechanism the crate currently uses. For JSON we could (just for testing) use the mechanism that pg_parse uses (which shares a common history with this crate, but we since diverged to focus on the Protobuf format).

vrmiguel · 2025-03-29T17:55:51Z

Probably out of scope of this PR but a cool addition would be comparisons to datafusion-sqlparser-rs

seanlinsley · 2025-06-08T16:53:47Z

Cargo.toml

@@ -30,3 +30,15 @@ glob = "0.3.1"
 easy-parallel = "3.2.0"
 pretty_assertions = "1.4.0"
 regex = "1.6.0"
+brunch = "0.8.*"


Crates should be ordered alphabetically. Also it's better to use 0.8 (now 0.10) when you don't intend to pin to a patch version.

seanlinsley · 2025-06-17T18:31:41Z

Cargo.toml

+
+[[bench]]
+name = "parse_protobuf"
+harness = false


Since the actual code in the benchmarks is so simple, I wonder if they should be merged into a single benchmark file:

use brunch::Bench; use pg_query::bindings::*; use std::ffi::{c_char, CString}; brunch::benches!( Bench::new("parse_json").run_seeded_with(seed, |query| unsafe { pg_query_parse(query.as_ptr() as *const c_char) }), Bench::new("parse_protobuf").run_seeded_with(seed, |query| unsafe { pg_query_parse_protobuf(query.as_ptr() as *const c_char) }), Bench::new("parse_summary").run_seeded_with(seed, |query| unsafe { pg_query_parse_summary(query.as_ptr() as *const c_char, 0, 0) }), ); fn seed() -> CString { CString::new(build_query(100)).unwrap() } fn build_query(table_references: i32) -> String { let mut query = "SELECT * FROM t".to_string(); for i in 0..table_references { query = format!("{query} JOIN t{i} ON t.id = t{i}.t_id AND t{i}.k IN (1, 2, 3, 4) AND t{i}.f IN (SELECT o FROM p WHERE q = 'foo')"); } query }

Add pg_query_parse and pg_query_parse_protobuf benchmarks.

49d61b1

duckinator force-pushed the benchmarks branch from 2871b73 to 49d61b1 Compare February 14, 2025 01:45

Split benchmarks into multiple files, to allow generation of per-benc…

99a414a

…hmark flamegraphs.

lfittl reviewed Feb 14, 2025

View reviewed changes

seanlinsley reviewed Jun 17, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

(WIP) Add benchmarks comparing performance of parse functions #50

(WIP) Add benchmarks comparing performance of parse functions #50

Uh oh!

duckinator commented Feb 14, 2025

Uh oh!

duckinator commented Feb 14, 2025 •

edited

Loading

Uh oh!

duckinator commented Feb 14, 2025

Uh oh!

lfittl Feb 14, 2025

Uh oh!

vrmiguel commented Mar 29, 2025

Uh oh!

seanlinsley Jun 8, 2025

Uh oh!

seanlinsley Jun 17, 2025

Uh oh!

Uh oh!

(WIP) Add benchmarks comparing performance of parse functions #50

Are you sure you want to change the base?

(WIP) Add benchmarks comparing performance of parse functions #50

Uh oh!

Conversation

duckinator commented Feb 14, 2025

Uh oh!

duckinator commented Feb 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Details

Uh oh!

duckinator commented Feb 14, 2025

Uh oh!

lfittl Feb 14, 2025

Choose a reason for hiding this comment

Uh oh!

vrmiguel commented Mar 29, 2025

Uh oh!

seanlinsley Jun 8, 2025

Choose a reason for hiding this comment

Uh oh!

seanlinsley Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

duckinator commented Feb 14, 2025 •

edited

Loading