Skip to content

Commit e81bb3b

Browse files
committed
Merge remote-tracking branch 'origin/main' into weaver/marin-users-directory-for-output
# Conflicts: # lib/rigging/pyproject.toml # uv.lock
2 parents 7e9ab3e + 6d14d1c commit e81bb3b

78 files changed

Lines changed: 3323 additions & 584 deletions

File tree

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

.agents/projects/2026-06-20_rigging_connection_auth.md

Lines changed: 699 additions & 0 deletions
Large diffs are not rendered by default.

docs/explanations/executor.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -44,7 +44,8 @@ from any region. The `mirrored()` wrapper marks an input path for
4444
marin prefix before the step runs.
4545

4646
```python
47-
from marin.execution.executor import mirrored, versioned
47+
from marin.execution.executor import mirrored
48+
from marin.execution.types import versioned
4849

4950
step = ExecutorStep(
5051
name="train",

docs/references/resource-config.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@ tpu_multislice = ResourceConfig.with_tpu("v4-8", slice_count=2)
1515

1616
# GPU configuration
1717
gpu_config = ResourceConfig.with_gpu("H100", count=8)
18-
gpu_auto = ResourceConfig.with_gpu() # auto-detect GPU type
18+
single_gpu = ResourceConfig.with_gpu("H100") # count defaults to 1
1919

2020
# CPU-only configuration
2121
cpu_config = ResourceConfig.with_cpu()

docs/tutorials/executor-101.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -46,8 +46,8 @@ from marin.execution.executor import (
4646
ExecutorStep,
4747
executor_main,
4848
output_path_of,
49-
this_output_path
5049
)
50+
from marin.execution.types import this_output_path
5151
```
5252

5353
Key imports:

docs/tutorials/first-experiment.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -50,7 +50,7 @@ The tokenized dataset is a directory containing one file per shard of the datase
5050

5151
```python
5252
from levanter.data.text import TextLmDatasetFormat
53-
from marin.execution.executor import versioned
53+
from marin.execution.types import versioned
5454
from experiments.tokenization import default_tokenize
5555
from experiments.marin_models import marin_tokenizer
5656

lib/finelog/AGENTS.md

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -16,9 +16,7 @@ Start with the shared instructions in `/AGENTS.md`. Finelog-specific notes:
1616
- `src/finelog/store/``MemStore` (in-memory) and `DuckDBLogStore` (Parquet + DuckDB)
1717
- `src/finelog/server/``LogServiceImpl`, `StatsServiceImpl`, ASGI builder, CLI launcher
1818
- `src/finelog/client/``LogClient` (single user-facing entry; covers logs and stats),
19-
`RemoteLogHandler`, error types in `errors.py`. `proxy.py` hosts
20-
`LogServiceProxy`, an internal server-side adapter used when iris mounts the
21-
log service as a forwarding proxy; not re-exported.
19+
`RemoteLogHandler`, error types in `errors.py`.
2220
- `tests/` — store + server tests
2321
- `deploy/` — Dockerfile, k8s manifests, GCP snippets
2422

Lines changed: 29 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,29 @@
1+
# finelog log server for the CoreWeave US-EAST-02A cluster (cw-us-east-02a).
2+
#
3+
# Deployed as an in-cluster k8s Deployment + ClusterIP Service in the `iris`
4+
# namespace, the same namespace as the iris controller. iris references it from
5+
# its cluster config via `log_server_config: cw-us-east-02a`, which resolves to
6+
# the endpoint `k8s://finelog-cw-use02a.iris` ->
7+
# finelog-cw-use02a.iris.svc.cluster.local:10001.
8+
#
9+
# Archive: segments offload to Cloudflare R2 (s3://), the same object store the
10+
# cluster uses for controller state. `finelog deploy up` mints the
11+
# `finelog-cw-use02a-env` Secret from the operator's R2 creds + object_storage_endpoint
12+
# and projects it into the pod via envFrom (see deploy/k8s/02-deployment).
13+
#
14+
# Deploy: export KUBECONFIG=~/.kube/coreweave-iris-gpu and the R2 creds, then
15+
# export R2_ACCESS_KEY_ID=... R2_SECRET_ACCESS_KEY=...
16+
# uv run finelog deploy up cw-us-east-02a
17+
name: finelog-cw-use02a
18+
port: 10001
19+
image: ghcr.io/marin-community/finelog:latest
20+
remote_log_dir: s3://marin-na/finelog/cw-us-east-02a
21+
deployment:
22+
k8s:
23+
namespace: iris
24+
# storage_class omitted -> the cluster's default StorageClass (shared-vast,
25+
# VAST CSI) backs the PVC. Segments offload to R2, but size the local cache
26+
# generously to absorb ingest bursts + L0->L1 compaction before offload.
27+
storage_gb: 250
28+
# R2 endpoint for the s3:// archive; folded into the minted creds Secret.
29+
object_storage_endpoint: https://74981a43be0de7712369306c7b19133d.r2.cloudflarestorage.com

lib/finelog/config/marin.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,7 @@ name: finelog-marin
22
port: 10001
33
image: ghcr.io/marin-community/finelog:latest
44
remote_log_dir: gs://marin-us-central2/finelog/marin
5+
client_url: iap+https://iris-marin.oa.dev/proxy/system.log-server
56
deployment:
67
gcp:
78
project: hai-gcp-models

lib/finelog/deploy/k8s/02-deployment.yaml.tmpl

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -44,6 +44,15 @@ spec:
4444
- name: rpc
4545
containerPort: {{ port }}
4646
protocol: TCP
47+
# S3 credentials for an s3:// remote_log_dir. `finelog deploy up` mints
48+
# the `{{ name }}-env` Secret (R2 AWS_* creds + AWS_ENDPOINT_URL) from
49+
# the operator's env and projects it here. optional=true keeps the pod
50+
# schedulable when no Secret exists (gs:// or local archives, where GCS
51+
# uses workload identity). The explicit env below wins on key collision.
52+
envFrom:
53+
- secretRef:
54+
name: {{ name }}-env
55+
optional: true
4756
env:
4857
- name: FINELOG_PORT
4958
value: "{{ port }}"

lib/finelog/rust/Cargo.lock

Lines changed: 1 addition & 0 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

0 commit comments

Comments
 (0)