Releases: internetarchive/Zeno
v2.0.23
What's Changed
- Fix pause controller race condition by @AltayAkkus in #553
- refactor: hq source package select statements by @willmhowes in #579
- chore(deps): bump the go-modules group with 2 updates by @dependabot[bot] in #576
- chore(deps): bump golang from 1.26.0-alpine3.22 to 1.26.1-alpine3.22 in the ci group by @dependabot[bot] in #575
- chore(deps): bump the go-modules group across 1 directory with 4 updates by @dependabot[bot] in #583
- chore(deps): bump codecov/codecov-action from 5.5.2 to 5.5.3 in the github-actions group by @dependabot[bot] in #582
- chore(deps): bump the go-modules group with 5 updates by @dependabot[bot] in #586
- chore(deps): bump codecov/codecov-action from 5.5.3 to 6.0.0 in the github-actions group by @dependabot[bot] in #585
- chore(deps): bump golang.org/x/image from 0.32.0 to 0.38.0 by @dependabot[bot] in #587
New Contributors
- @AltayAkkus made their first contribution in #553
Full Changelog: v2.0.22...v2.0.23
v2.0.22
What's Changed
- workaround: add recover to prevent panic in PDF outlink extractor by @yzqzss in #565
- Revert "workaround: add recover to prevent panic in PDF outlink extractor" by @yzqzss in #566
- debug: api: enable pprof by default for convenience by @yzqzss in #568
- fix: strict regex performance issue by @yzqzss in #570
- chore(deps): bump github.com/hashicorp/consul/api from 1.33.2 to 1.33.3 in the go-modules group by @dependabot[bot] in #564
- chore(deps): bump the go-modules group with 3 updates by @dependabot[bot] in #571
Full Changelog: v2.0.21...v2.0.22
v2.0.21
What's Changed
- Detect and ignore path loops by @NGTmeaty in #558
- Add ItemFailed status when seencheck fails rather than panic by @NGTmeaty in #557
- chore(deps): bump golang from 1.25.6-alpine3.22 to 1.25.7-alpine3.22 in the ci group by @dependabot[bot] in #559
- chore(deps): bump the go-modules group across 1 directory with 8 updates by @dependabot[bot] in #561
- chore(deps): bump golang from 1.25.7-alpine3.22 to 1.26.0-alpine3.22 in the ci group by @dependabot[bot] in #562
Full Changelog: v2.0.20...v2.0.21
v2.0.20
What's Changed
-
Use memory pools where needed to help reduce allocations by @NGTmeaty in #541
-
adjust disk space threshold for CI/CD environments by @yzqzss in #546
-
chore(deps): bump the go-modules group with 2 updates by @dependabot[bot] in #534
-
chore(deps): bump codecov/codecov-action from 5.5.1 to 5.5.2 in the github-actions group by @dependabot[bot] in #533
-
chore(deps): bump the go-modules group with 2 updates by @dependabot[bot] in #537
-
chore(deps): bump the go-modules group with 4 updates by @dependabot[bot] in #544
-
chore(deps): bump mlugg/setup-zig from 2.1.0 to 2.2.0 in the github-actions group by @dependabot[bot] in #543
-
chore(deps): bump mlugg/setup-zig from 2.2.0 to 2.2.1 in the github-actions group by @dependabot[bot] in #549
-
chore(deps): bump the go-modules group across 1 directory with 5 updates by @dependabot[bot] in #550
-
chore(deps): bump golang from 1.25.5-alpine3.22 to 1.26rc2-alpine3.22 in the ci group by @dependabot[bot] in #547
Full Changelog: v2.0.19...v2.0.20
v2.0.19
v2.0.18
What's Changed
- chore(deps): bump golang from 1.25.1-alpine3.22 to 1.25.2-alpine3.22 in the ci group by @dependabot[bot] in #507
- chore(deps): bump golang from 1.25.2-alpine3.22 to 1.25.3-alpine3.22 in the ci group by @dependabot[bot] in #508
- fix: "error while reseting" by @yzqzss in #512
- Add
Zeno get listcommand for archiving URLs from files by @Copilot in #514
Full Changelog: v2.0.17...v2.0.18
v2.0.17
What's Changed
- Make Zeno stop automatically when local queue crawl is finished by @Copilot in #496
- chore(deps): bump the go-modules group with 3 updates by @dependabot[bot] in #494
- Revert #465 by @NGTmeaty in #503
- chore(deps): bump the go-modules group with 2 updates by @dependabot[bot] in #505
Full Changelog: v2.0.16...v2.0.17
v2.0.16
What's Changed
- Make Zeno to be signalable by HQ via websocket by @yzqzss in #429
- Add support for confirmed messages through WS by @NGTmeaty in #451
- More versatile HQ client init by @vbanos in #442
- chore(deps): bump github.com/ulikunitz/xz from 0.5.12 to 0.5.14 in the go_modules group by @dependabot[bot] in #454
- Add support for Sentry by @NGTmeaty in #456
- Bump gowarc to v0.8.88 by @CorentinB in #457
- dependency: upgrade go-css-lexer to v0.1.0 by @yzqzss in #423
- Optimize domainscrawl domain matching by @CorentinB in #452
- Fix domainscrawl race in tests by @CorentinB in #459
- Add comprehensive GitHub Copilot instructions with development best practices for Zeno web crawler by @Copilot in #461
- Add GCC 12+ requirement documentation with Ada URL parser context by @Copilot in #463
- refactor
TestPauseResumeE2Ewith synctest for accuracy by @yzqzss in #466 - Fix config validation to report errors for invalid configuration files by @Copilot in #458
- chore(deps): bump the github-actions group with 2 updates by @dependabot[bot] in #467
- Simplify HQ websocket functions by @vbanos in #469
- chore(deps): bump the go-modules group with 10 updates by @dependabot[bot] in #468
- refine preprocess error reporting by @vbanos in #474
- Refactor finisher by @vbanos in #476
- Return error instead of using panic() in the Reactor by @vbanos in #475
- Switch default digest algorithm from SHA1 to BLAKE3 by @CorentinB in #450
- Optionally send outlinks to 2nd HQ project by @vbanos in #465
- chore(deps): bump the go-modules group with 5 updates by @dependabot[bot] in #478
- fix(docker): upgrades go version to 1.25.1 by @wilsoniya in #480
- Remove duplicate data-srcset by @vbanos in #486
- Refactor item errors by @vbanos in #484
- Optimise encodeNonUTF8QueryURLs by @vbanos in #485
- Improve startPipeline error handling by @vbanos in #481
- Expose --max-outlinks by @yzqzss in #487
- Replace context.TODO() with the current component's context by @vbanos in #488
- Drop panic from preprocessor seencheck by @vbanos in #489
- Return errors in config init instead of os.Exit by @vbanos in #483
- Improve Archiver error reporting by @vbanos in #482
- fix: ensure LQ context is cancelled after seeds are reset. by @NGTmeaty in #493
- Order local queue URLs by number of hops and timestamp by @NGTmeaty in #490
New Contributors
- @Copilot made their first contribution in #461
- @wilsoniya made their first contribution in #480
Full Changelog: v2.0.15...v2.0.16
v2.0.15
What's Changed
-
Add --warc-digest-algorithm by @CorentinB in #449
-
Add faster domain lookup for domainscrawl by @CorentinB in #445
-
Use sync.Map in RateBucket instead of manual locking by @vbanos in #427
-
Move reddit.com asset URL fix to SanitizeAssetsOutlinks by @vbanos in #416
-
Redundant newItem.SetStatus(models.ItemFresh) by @vbanos in #425
-
Add live reload of exclusion lists (and switch to go 1.25) by @CorentinB in #428
-
Avoid creating a fresh context for each log call by @vbanos in #434
-
Modernize code by @CorentinB in #433
-
Create rootContext and share with components by @vbanos in #435
-
Revert "Create rootContext and share with components" by @yzqzss in #436
-
chore(deps): bump actions/checkout from 4 to 5 in the github-actions group by @dependabot[bot] in #438
-
chore(deps): bump the go-modules group across 1 directory with 3 updates by @dependabot[bot] in #439
-
chore(deps): bump NGTmeaty/go-release-action from 1 to 2 in the github-actions group by @dependabot[bot] in #440
-
refactor(test): replace the socket logging with
net.Pipe()by @yzqzss in #441 -
chore(deps): bump the go-modules group with 2 updates by @dependabot[bot] in #448
-
chore(deps): bump codecov/codecov-action from 5.4.3 to 5.5.0 in the github-actions group by @dependabot[bot] in #447
-
chore(deps): bump github.com/go-viper/mapstructure/v2 from 2.3.0 to 2.4.0 in the go_modules group by @dependabot[bot] in #444
Full Changelog: v2.0.14...v2.0.15
v2.0.14
What's Changed
- New CSS parser with
go-css-lexerby @yzqzss in #415 - Improve
extractLinksFromPage()to process line by line by @yzqzss in #410 - Exit with retcode by @yzqzss in #402
- chore(deps): bump the go-modules group with 4 updates by @dependabot[bot] in #405
- Less "feed is empty" logs by @yzqzss in #398
- E2E testing and start tracking coverage by @yzqzss in #403
- chore(deps): bump github.com/go-logfmt/logfmt from 0.4.0 to 0.6.0 in the go-modules group by @dependabot[bot] in #407
- chore(deps): bump mlugg/setup-zig from 2.0.4 to 2.0.5 in the github-actions group by @dependabot[bot] in #408
- fix: double hops by @yzqzss in #409
- handle non-utf8 html document and their URLs by @yzqzss in #370
- chore(deps): bump the go-modules group with 2 updates by @dependabot[bot] in #412
- Optimize
extractLinksFromPageby splitting words instead of lines by @yzqzss in #414
Full Changelog: v2.0.13...v2.0.14