Skip to content
This repository was archived by the owner on Jan 12, 2026. It is now read-only.

Commit a7a46f0

Browse files
authored
Fix dataset conversion (#270)
Signed-off-by: Antoni Baum <antoni.baum@protonmail.com>
1 parent 3e563f9 commit a7a46f0

File tree

1 file changed

+7
-2
lines changed

1 file changed

+7
-2
lines changed

xgboost_ray/data_sources/ray_dataset.py

Lines changed: 7 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -57,8 +57,13 @@ def load_data(data: "ray.data.dataset.Dataset",
5757
else:
5858
data = [data[i] for i in indices]
5959

60-
local_df = [ds.to_pandas(limit=DATASET_TO_PANDAS_LIMIT) for ds in data]
61-
return Pandas.load_data(pd.concat(local_df, copy=False), ignore=ignore)
60+
if isinstance(data, ray.data.dataset.Dataset):
61+
local_df = data.to_pandas(limit=DATASET_TO_PANDAS_LIMIT)
62+
else:
63+
local_df = pd.concat(
64+
[ds.to_pandas(limit=DATASET_TO_PANDAS_LIMIT) for ds in data],
65+
copy=False)
66+
return Pandas.load_data(local_df, ignore=ignore)
6267

6368
@staticmethod
6469
def convert_to_series(data: Union["ray.data.dataset.Dataset", Sequence[

0 commit comments

Comments
 (0)