Skip to content

Commit 7441c17

Browse files
yonromaiclaude
andcommitted
nemotron: set 4GB worker memory for CC download
Each Nemotron CC download worker decompresses a ~350MB zstd file to ~1.5-2GB in memory. The default ZephyrContext resources (1GB) caused OOMKill when workers exceeded their memory limit. Set 4GB to give sufficient headroom. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
1 parent d41887d commit 7441c17

File tree

1 file changed

+4
-1
lines changed

1 file changed

+4
-1
lines changed

lib/marin/src/marin/download/nemotron_cc/download_nemotron_cc.py

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,7 @@
2020
from marin.download.nemotron_cc.utils import decompress_zstd_stream
2121
from marin.execution import THIS_OUTPUT_PATH
2222
from marin.utils import fsspec_exists
23+
from fray.cluster import ResourceConfig
2324
from zephyr import Dataset, ZephyrContext
2425
from zephyr.writers import atomic_rename
2526

@@ -102,7 +103,9 @@ def download_nemotron_cc(cfg: NemotronIngressConfig):
102103
.write_jsonl(os.path.join(cfg.output_path, ".metrics/download-{shard:05d}.jsonl"), skip_existing=True)
103104
)
104105

105-
ctx = ZephyrContext(name="download-nemotron-cc")
106+
# Each worker downloads a ~350MB zstd file and decompresses to ~1.5-2GB in memory.
107+
# Default ZephyrContext resources (1GB) causes OOMKill; 4GB gives sufficient headroom.
108+
ctx = ZephyrContext(name="download-nemotron-cc", resources=ResourceConfig(cpu=1, ram="4g"))
106109
ctx.execute(pipeline)
107110

108111
logger.info(f"Downloaded Nemotron CC files to {cfg.output_path}")

0 commit comments

Comments
 (0)