Support Spark 4.x by pang-wu · Pull Request #450 · ray-project/raydp

pang-wu · 2025-12-08T18:10:44Z

This PR adapt raydp with Spark 4.x but leave the following work for future improvement:

Support tensorflow 2.16+ (see https://keras.io/getting_started/#tensorflow--keras-2-backwards-compatibility) and numpy 2.x
Support python 3.12 - we deprecated Python 3.9 because it is no longer supported by Spark. Need to modernize python build system.
Deprecate Ray AIR.

To make the tests pass, the PR is based on #458. Once PR#458 is merged this PR should rebase again.

rexminnis

Thanks for putting this together — the CommandLineUtilsBridge pattern and the SparkSubmit rework are clean solutions to the cross-version API drift. A few things I noticed:

Bug: spark340/SparkSqlUtils.toArrowRDD has infinite recursion (see inline comment)
Java target: maven.compiler.source is still 1.8 — worth bumping to 17?
Spark version: spark410.version targets 4.1.0 — consider 4.1.1 (current release)

Happy to help with testing or any of the shim work. I have a working Spark 4.1.1 setup locally and have been validating the Arrow conversion paths end-to-end.

rexminnis · 2026-02-17T14:24:46Z

core/shims/spark340/src/main/scala/org/apache/spark/sql/SparkSqlUtils.scala

    ArrowUtils.toArrowSchema(schema = schema, timeZoneId = timeZoneId)
  }
+
+  def toArrowRDD(dataFrame: DataFrame, sparkSession: SparkSession): RDD[Array[Byte]] = {


Bug — this is infinitely recursive. SparkSqlUtils.toArrowRDD calls itself:

def toArrowRDD(dataFrame: DataFrame, sparkSession: SparkSession): RDD[Array[Byte]] = { SparkSqlUtils.toArrowRDD(dataFrame, dataFrame.sparkSession) }

This will StackOverflowError at runtime. Should be dataFrame.toArrowBatchRdd like the other shims (spark322, spark330, spark350).

good catch, fixed.

rexminnis · 2026-02-17T14:24:46Z

core/pom.xml

@@ -29,9 +31,9 @@
    <project.reporting.outputEncoding>UTF-8</project.reporting.outputEncoding>


The Maven compiler source/target is still 1.8. Since Spark 4.x requires Java 17 at runtime and CI now uses JDK 17, should we bump the compile target to 17 as well? This would catch any bytecode-level incompatibilities at compile time rather than runtime.

rexminnis · 2026-02-17T14:24:46Z

core/pom.xml

    <spark340.version>3.4.0</spark340.version>
    <spark350.version>3.5.0</spark350.version>
+    <spark400.version>4.0.0</spark400.version>
+    <spark410.version>4.1.0</spark410.version>


Minor: spark410.version is 4.1.0 — worth bumping to 4.1.1 (current release)? The SparkShimProvider already covers it at runtime, but compiling against the latest patch would catch any API changes at build time.

I would keep it 4.1.0 -- the idea is we should support the minimum API from the initial version otherwise the lib might introduce broken changes between Spark's patch versions (Spark is supposed to be backward compatible on patch versions)

pang-wu changed the title ~~Support SPark 4.0.0~~ Support Spark 4.0.0 Dec 8, 2025

pang-wu mentioned this pull request Feb 16, 2026

[RayDP 2.0] Support Spark 4.1, Java 17, and Scala 2.13 #460

Draft

6 tasks

pang-wu force-pushed the pang/spark4 branch 7 times, most recently from 21be2c9 to 1f04b26 Compare February 16, 2026 17:47

pang-wu changed the title ~~Support Spark 4.0.0~~ Support Spark 4.x Feb 16, 2026

pang-wu force-pushed the pang/spark4 branch from 36323e4 to e587da8 Compare February 16, 2026 18:56

rexminnis mentioned this pull request Feb 17, 2026

[RFC] RayDP 2.0: Migration to Spark 4.1 & Java 17 #459

Closed

rexminnis reviewed Feb 17, 2026

View reviewed changes

pang-wu force-pushed the pang/spark4 branch from 9a330d8 to fd27c93 Compare February 17, 2026 14:24

rexminnis mentioned this pull request Feb 17, 2026

Fix TaskContext leak and resource cleanup in getRDDPartition #463

Open

3 tasks

pang-wu force-pushed the pang/spark4 branch 8 times, most recently from 7acc670 to c40d89d Compare February 17, 2026 18:19

rexminnis mentioned this pull request Feb 17, 2026

Fix CI: drop EOL Python 3.9 and update GitHub Actions #464

Open

1 task

pang-wu force-pushed the pang/spark4 branch 2 times, most recently from 26d576d to ac217b9 Compare February 18, 2026 02:53

pang-wu force-pushed the pang/spark4 branch from ac217b9 to b7d339a Compare March 1, 2026 02:08

pang-wu added 3 commits March 12, 2026 09:41

do one hop forward fetch if recache data change executor

a45140e

more robust executor id parse

22259a6

add test

7b50558

pang-wu and others added 29 commits March 12, 2026 09:41

revert change in dataset.py

099007f

clean up

a489788

clean up

2c8df45

strip off table metadata again

fb31b09

fix spark gc race condition

b60bffd

Add spark 3.4.4 and 3.5.4 support

86564cb

Support Spark 4.0.0

5334d13

exclude spark 3.x

b64a284

add distribution

64f9794

lint

67faeb5

Do not use largeVarTypes

0a71973

class to classic session to convert internalRowRdd to rdd

067f277

arrow to rdd

0651c9d

pin click<8.3.0

f3f6e75

make jackson provided

66656ee

Support spark 4.0.1

90a4699

tf/estimator.py: only write checkpoint in rank0

201e96a

revert tf/estimator.py

f520469

support spark 4.1.x

679eca6

deprecate python 3.9, add 3.11 to CI

6c7a1b3

update pylint

187ce96

fix pyint rules

8cfc832

fix tensorflow version

6473ccc

pin pandas<3 version

93d5d42

remove df.sqlContext reference

f83db26

extract commandlineutils to custom spark submit

7d4c4de

add new shims

e7148fe

compile against 4.0.0

1989452

use legacy keras

a86d51d

pang-wu force-pushed the pang/spark4 branch from b7d339a to a86d51d Compare March 12, 2026 16:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Spark 4.x#450

Support Spark 4.x#450
pang-wu wants to merge 34 commits intoray-project:masterfrom
pang-wu:pang/spark4

pang-wu commented Dec 8, 2025 •

edited

Loading

Uh oh!

rexminnis left a comment

Uh oh!

rexminnis Feb 17, 2026

Uh oh!

pang-wu Feb 17, 2026 •

edited

Loading

Uh oh!

rexminnis Feb 17, 2026

Uh oh!

rexminnis Feb 17, 2026

Uh oh!

pang-wu Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -29,9 +31,9 @@
		<project.reporting.outputEncoding>UTF-8</project.reporting.outputEncoding>

Conversation

pang-wu commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rexminnis left a comment

Choose a reason for hiding this comment

Uh oh!

rexminnis Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

pang-wu Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rexminnis Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

rexminnis Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

pang-wu Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pang-wu commented Dec 8, 2025 •

edited

Loading

pang-wu Feb 17, 2026 •

edited

Loading