CPU Memory exhausted when loading data with random_grid_cell_assignment() #1578
Unanswered
trettelbach
asked this question in
Q&A
Replies: 1 comment 5 replies
-
A couple comments:
|
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
I'm currently loading data with the
random_grid_cell_assignment(self.dataset_full, [0.7, 0.15, 0.15], grid_size=6)
. I hadnum_workers=4
.The dataset I was loading consisted of 3 files, each 15-16 GB in size; ca. 47 GB total.
When I ran the code, and monitored my resources with
top
, I saw that each individual process had loaded 47+ GB into its memory; cumulatively crashing my workstation as it ran out of CPU memory.Is there a way that not all datasets are loaded entirely with all processes?
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions