When executing a COPY command with a jsonb/json column, 1 core out of 16 is loaded. #947
Unanswered
tsorokin36
asked this question in
Q&A
Replies: 1 comment
-
Hi, this is expected. We rely on Postgres utility functions for JSON/JSONB (also list, varbit, bit type) conversion (from pg layout to duckdb's). Since those utility functions are not thread safe, we limit the conversion to single thread. The related PR for parallel pg scan: #762 (comment) |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I'm running the following query on an ec2 m7g.4xlarge with postgres 15 and pg_duckdb 1.0.0 installed :
I see that out of 16 cores, one is running at 100%, and there's minimal reading from EBS.
My table weighs more than 1 TB, but I don't think it matters.
When I run a query without JSON/JSONB columns, htop shows the expected 16 cores are running at 50-70%, and maximum reading from EBS.
Why are other cores being ignored when working with JSON/JSONB? Are there ways to achieve the same parallelism with these columns?
Beta Was this translation helpful? Give feedback.
All reactions