Skip to content

Commit 91cba48

Browse files
ci: Bump pyarrow version in CI to 19.0.1 (#4301)
## Changes Made Arrow 19.0.0 is unable to read parquet files written by newer versions of arrow with statistics. The issue arises from an optimization where level histograms in statistics (e.g. the level histogram used for repetition levels) is omitted when the max level is 0. For more details, see apache/arrow#45283 At the same time we can't bump to pyarrow 20.0.0 because this is incompatible with ray, see ray-project/ray#52685 This means we should use the minor release of arrow 19, i.e. arrow 19.0.1
1 parent 1e2a545 commit 91cba48

File tree

4 files changed

+7
-4
lines changed

4 files changed

+7
-4
lines changed

.github/workflows/pr-test-suite.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,7 @@ jobs:
2222
matrix:
2323
python-version: ['3.9', '3.10']
2424
daft-runner: [py, ray, native]
25-
pyarrow-version: [8.0.0, 19.0.0]
25+
pyarrow-version: [8.0.0, 19.0.1]
2626
enable-aqe: [1, 0]
2727
os: [ubuntu-latest, macos-latest]
2828
exclude:
Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
pytest==7.4.0
22
pytest-benchmark==4.0.0
33
pytest-memray==1.4.1
4-
pyarrow==19.0.0
4+
pyarrow==19.0.1
55
boto3==1.28.3

requirements-dev.txt

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -40,7 +40,10 @@ duckdb==1.1.2
4040
tqdm
4141

4242
# Pyarrow
43-
pyarrow==19.0.0
43+
# Pyarrow 19.0.0 has a forwards compatibility bug so we use pyarrow 19.0.1 instead.
44+
# Pyarrow 20.0.0 is incompatible with Ray hence we can't use newer versions for now.
45+
# See: https://github.com/Eventual-Inc/Daft/pull/4301
46+
pyarrow==19.0.1
4447
# Ray
4548
ray[data, client]==2.34.0
4649

tests/integration/io/docker-compose/retry_server/retry-server-requirements.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@ uvicorn==0.23.2
1717
uvloop==0.17.0
1818
watchfiles==0.19.0
1919
websockets==11.0.3
20-
pyarrow==19.0.0
20+
pyarrow==19.0.1
2121
slowapi==0.1.8
2222

2323
# Pin numpy version otherwise pyarrow doesn't work

0 commit comments

Comments
 (0)