You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -530,7 +530,7 @@ Once `share.remote` is configured, read commands auto-fetch and import when the
530
530
531
531
Hybrid mode is supported too: keep normal Discord credentials configured and set `share.remote`. `discrawl sync --update=auto` and `discrawl messages --sync` import the Git snapshot first, usually as a changed-shard delta, then use live Discord for latest-message deltas. Use `sync --all-channels` or `sync --full` when you intentionally want a broader live repair/backfill pass.
532
532
533
-
Git snapshots publish non-DM archive tables and cached non-DM attachment media by default. DMs, desktop wiretap rows, DM media, and local secrets are never exported. Use `publish --no-media` to omit cached media files.
533
+
Git snapshots publish non-DM archive tables and cached non-DM attachment media by default. Cached media is written as gzip-compressed files under `media/` and restored to raw local cache files on import. Older snapshots that contain raw media files still import, and the next media-enabled `publish` rewrites the media tree into gzip form. DMs, desktop wiretap rows, DM media, and local secrets are never exported. Use `publish --no-media` to omit cached media files.
534
534
Subscribers can use `subscribe --no-media` or `update --no-media` to import only SQLite rows and skip restoring cached files.
535
535
536
536
Media backup is a two-step publisher workflow: first fetch bytes with `discrawl sync --with-media` or `discrawl attachments fetch`, then publish with `discrawl publish --push`. Scheduled publishers that should include media can set `sync.attachment_media = true` and leave `share.media = true`, which is the default. `publish` never downloads missing Discord files by itself; it exports only media already present in the local cache.
- media bytes are stored under `cache_dir/media`, not in SQLite
39
39
- SQLite stores attachment metadata, content hash, cached media path, fetch status, and errors
40
40
- Discord CDN URLs can expire or be removed; those fetches are recorded as failed with their HTTP status, commonly `404`
41
-
-`attachments fetch` only populates the local cache; run `publish --push` afterward to copy cached non-DM media into the Git snapshot repo
42
-
-`publish` backs up cached non-DM media files by default; use `publish --no-media` to omit them
41
+
-`attachments fetch` only populates the local cache; run `publish --push` afterward to copy cached non-DM media into the Git snapshot repo as gzip-compressed files
42
+
-`publish` backs up cached non-DM media files by default and migrates older raw snapshot media to gzip form; use `publish --no-media` to omit them
43
43
-`@me` DM media is local-only and is not published to Git snapshots
Copy file name to clipboardExpand all lines: docs/commands/sync.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -68,7 +68,7 @@ discrawl sync --with-media
68
68
69
69
-`--latest-only` is the default for untargeted `sync`. Use `--all-channels` to opt out without doing a full historical crawl.
70
70
-`--with-media` records expired or removed Discord CDN URLs as failed fetches with the HTTP status, commonly `404`.
71
-
-`--with-media` updates the local cache only; run `publish --push` afterward to include cached non-DM media in the Git backup.
71
+
-`--with-media` updates the local cache only; run `publish --push` afterward to include cached non-DM media in the Git backup as gzip-compressed files.
72
72
-`--since` does not mark older history as complete, so a later `sync --full` without `--since` can continue the backfill.
73
73
- Long runs emit periodic progress logs to stderr.
74
74
- Heartbeat logs (`message sync waiting`) name the oldest active channel and per-channel page activity if in-flight channels stop completing for a while.
Copy file name to clipboardExpand all lines: docs/configuration.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -122,7 +122,7 @@ Set `discord.token_source = "keyring"` if you want to require keyring lookup and
122
122
- changing `[search.embeddings]` provider/model/input version retargets pending jobs and resets prior attempts; existing vectors for another identity remain in SQLite but are not used for semantic search
123
123
- changing `db_path` does not migrate existing data; copy the file yourself if you want to keep history
124
124
-`sync.attachment_media = true` makes `sync` behave like `sync --with-media`; media bytes are cached under `cache_dir/media`, and CDN `404`/other fetch failures are recorded on attachment rows
125
-
-`share.media = false` makes publish/update/auto-update omit or skip restoring cached media; `subscribe --no-media` writes this for Git-only readers. With the default `share.media = true`, publish/update include cached non-DM media, but publish does not fetch missing Discord files by itself.
125
+
-`share.media = false` makes publish/update/auto-update omit or skip restoring cached media; `subscribe --no-media` writes this for Git-only readers. With the default `share.media = true`, publish/update include cached non-DM media as gzip-compressed snapshot files, but publish does not fetch missing Discord files by itself.
126
126
-`[share.filter]` narrows only `publish` output; sync can still keep a richer local archive
127
127
-`share.filter.public_only` exports only channels visible to the guild
128
128
`@everyone` role after category/channel permission overwrites; private
Copy file name to clipboardExpand all lines: docs/security.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -42,7 +42,7 @@ Attachment binaries are not stored in SQLite. Only attachment metadata, optional
42
42
43
43
Set `sync.attachment_text = false` if you want to keep attachment metadata and filenames but disable attachment body fetches for text indexing.
44
44
45
-
Git snapshots include cached non-DM media files by default. Use `publish --no-media` to omit them. `publish` exports only files already in the local cache; it does not fetch missing Discord media. DM media under `@me` stays local-only.
45
+
Git snapshots include cached non-DM media files by default. Use `publish --no-media` to omit them. `publish` exports only files already in the local cache; it does not fetch missing Discord media. Published media is gzip-compressed under `media/`, while import still accepts older raw-media snapshots for backward compatibility. DM media under `@me` stays local-only.
0 commit comments