Skip to content

Releases: internetarchive/Zeno

v2.0.23

09 Apr 02:56
0b16c53

Choose a tag to compare

What's Changed

  • Fix pause controller race condition by @AltayAkkus in #553
  • refactor: hq source package select statements by @willmhowes in #579
  • chore(deps): bump the go-modules group with 2 updates by @dependabot[bot] in #576
  • chore(deps): bump golang from 1.26.0-alpine3.22 to 1.26.1-alpine3.22 in the ci group by @dependabot[bot] in #575
  • chore(deps): bump the go-modules group across 1 directory with 4 updates by @dependabot[bot] in #583
  • chore(deps): bump codecov/codecov-action from 5.5.2 to 5.5.3 in the github-actions group by @dependabot[bot] in #582
  • chore(deps): bump the go-modules group with 5 updates by @dependabot[bot] in #586
  • chore(deps): bump codecov/codecov-action from 5.5.3 to 6.0.0 in the github-actions group by @dependabot[bot] in #585
  • chore(deps): bump golang.org/x/image from 0.32.0 to 0.38.0 by @dependabot[bot] in #587

New Contributors

Full Changelog: v2.0.22...v2.0.23

v2.0.22

06 Mar 03:20
01c3fa6

Choose a tag to compare

What's Changed

  • workaround: add recover to prevent panic in PDF outlink extractor by @yzqzss in #565
  • Revert "workaround: add recover to prevent panic in PDF outlink extractor" by @yzqzss in #566
  • debug: api: enable pprof by default for convenience by @yzqzss in #568
  • fix: strict regex performance issue by @yzqzss in #570
  • chore(deps): bump github.com/hashicorp/consul/api from 1.33.2 to 1.33.3 in the go-modules group by @dependabot[bot] in #564
  • chore(deps): bump the go-modules group with 3 updates by @dependabot[bot] in #571

Full Changelog: v2.0.21...v2.0.22

v2.0.21

16 Feb 06:27
d475a60

Choose a tag to compare

What's Changed

  • Detect and ignore path loops by @NGTmeaty in #558
  • Add ItemFailed status when seencheck fails rather than panic by @NGTmeaty in #557
  • chore(deps): bump golang from 1.25.6-alpine3.22 to 1.25.7-alpine3.22 in the ci group by @dependabot[bot] in #559
  • chore(deps): bump the go-modules group across 1 directory with 8 updates by @dependabot[bot] in #561
  • chore(deps): bump golang from 1.25.7-alpine3.22 to 1.26.0-alpine3.22 in the ci group by @dependabot[bot] in #562

Full Changelog: v2.0.20...v2.0.21

v2.0.20

04 Feb 17:45
8d29277

Choose a tag to compare

What's Changed

  • feat: update gowarc by @NGTmeaty in #554

  • Use memory pools where needed to help reduce allocations by @NGTmeaty in #541

  • fix: connection is nil by @yzqzss in #545

  • adjust disk space threshold for CI/CD environments by @yzqzss in #546

  • chore(deps): bump the go-modules group with 2 updates by @dependabot[bot] in #534

  • chore(deps): bump codecov/codecov-action from 5.5.1 to 5.5.2 in the github-actions group by @dependabot[bot] in #533

  • chore(deps): bump the go-modules group with 2 updates by @dependabot[bot] in #537

  • chore(deps): bump the go-modules group with 4 updates by @dependabot[bot] in #544

  • chore(deps): bump mlugg/setup-zig from 2.1.0 to 2.2.0 in the github-actions group by @dependabot[bot] in #543

  • chore(deps): bump mlugg/setup-zig from 2.2.0 to 2.2.1 in the github-actions group by @dependabot[bot] in #549

  • chore(deps): bump the go-modules group across 1 directory with 5 updates by @dependabot[bot] in #550

  • chore(deps): bump golang from 1.25.5-alpine3.22 to 1.26rc2-alpine3.22 in the ci group by @dependabot[bot] in #547

Full Changelog: v2.0.19...v2.0.20

v2.0.19

10 Dec 18:38
73cf7ad

Choose a tag to compare

What's Changed

Full Changelog: v2.0.18...v2.0.19

v2.0.18

07 Nov 22:26
73c7dde

Choose a tag to compare

What's Changed

  • chore(deps): bump golang from 1.25.1-alpine3.22 to 1.25.2-alpine3.22 in the ci group by @dependabot[bot] in #507
  • chore(deps): bump golang from 1.25.2-alpine3.22 to 1.25.3-alpine3.22 in the ci group by @dependabot[bot] in #508
  • fix: "error while reseting" by @yzqzss in #512
  • Add Zeno get list command for archiving URLs from files by @Copilot in #514

Full Changelog: v2.0.17...v2.0.18

v2.0.17

07 Oct 00:27
33782fc

Choose a tag to compare

What's Changed

  • Make Zeno stop automatically when local queue crawl is finished by @Copilot in #496
  • chore(deps): bump the go-modules group with 3 updates by @dependabot[bot] in #494
  • Revert #465 by @NGTmeaty in #503
  • chore(deps): bump the go-modules group with 2 updates by @dependabot[bot] in #505

Full Changelog: v2.0.16...v2.0.17

v2.0.16

23 Sep 22:57
1f0d58f

Choose a tag to compare

What's Changed

  • Make Zeno to be signalable by HQ via websocket by @yzqzss in #429
  • Add support for confirmed messages through WS by @NGTmeaty in #451
  • More versatile HQ client init by @vbanos in #442
  • chore(deps): bump github.com/ulikunitz/xz from 0.5.12 to 0.5.14 in the go_modules group by @dependabot[bot] in #454
  • Add support for Sentry by @NGTmeaty in #456
  • Bump gowarc to v0.8.88 by @CorentinB in #457
  • dependency: upgrade go-css-lexer to v0.1.0 by @yzqzss in #423
  • Optimize domainscrawl domain matching by @CorentinB in #452
  • Fix domainscrawl race in tests by @CorentinB in #459
  • Add comprehensive GitHub Copilot instructions with development best practices for Zeno web crawler by @Copilot in #461
  • Add GCC 12+ requirement documentation with Ada URL parser context by @Copilot in #463
  • refactor TestPauseResumeE2E with synctest for accuracy by @yzqzss in #466
  • Fix config validation to report errors for invalid configuration files by @Copilot in #458
  • chore(deps): bump the github-actions group with 2 updates by @dependabot[bot] in #467
  • Simplify HQ websocket functions by @vbanos in #469
  • chore(deps): bump the go-modules group with 10 updates by @dependabot[bot] in #468
  • refine preprocess error reporting by @vbanos in #474
  • Refactor finisher by @vbanos in #476
  • Return error instead of using panic() in the Reactor by @vbanos in #475
  • Switch default digest algorithm from SHA1 to BLAKE3 by @CorentinB in #450
  • Optionally send outlinks to 2nd HQ project by @vbanos in #465
  • chore(deps): bump the go-modules group with 5 updates by @dependabot[bot] in #478
  • fix(docker): upgrades go version to 1.25.1 by @wilsoniya in #480
  • Remove duplicate data-srcset by @vbanos in #486
  • Refactor item errors by @vbanos in #484
  • Optimise encodeNonUTF8QueryURLs by @vbanos in #485
  • Improve startPipeline error handling by @vbanos in #481
  • Expose --max-outlinks by @yzqzss in #487
  • Replace context.TODO() with the current component's context by @vbanos in #488
  • Drop panic from preprocessor seencheck by @vbanos in #489
  • Return errors in config init instead of os.Exit by @vbanos in #483
  • Improve Archiver error reporting by @vbanos in #482
  • fix: ensure LQ context is cancelled after seeds are reset. by @NGTmeaty in #493
  • Order local queue URLs by number of hops and timestamp by @NGTmeaty in #490

New Contributors

  • @Copilot made their first contribution in #461
  • @wilsoniya made their first contribution in #480

Full Changelog: v2.0.15...v2.0.16

v2.0.15

26 Aug 23:21
2ae11a9

Choose a tag to compare

What's Changed

  • Headless/Headful mode by @yzqzss in #356

  • Add --warc-digest-algorithm by @CorentinB in #449

  • Add faster domain lookup for domainscrawl by @CorentinB in #445

  • Simplify HTML extractor logging by @vbanos in #417

  • Simplify Archiver logging by @vbanos in #418

  • Simplify Preprocessor logging by @vbanos in #419

  • Add max outlinks option by @vbanos in #420

  • Drop unused IncRedirects URL method by @vbanos in #426

  • Use sync.Map in RateBucket instead of manual locking by @vbanos in #427

  • Move reddit.com asset URL fix to SanitizeAssetsOutlinks by @vbanos in #416

  • Redundant newItem.SetStatus(models.ItemFresh) by @vbanos in #425

  • Simplify postprocessor logging by @vbanos in #424

  • Minor code dedup in Prometheus stats by @vbanos in #432

  • Remove multiple redundant stats.Init() by @vbanos in #430

  • Remove multiple redundant log.Start() by @vbanos in #431

  • Add live reload of exclusion lists (and switch to go 1.25) by @CorentinB in #428

  • Avoid creating a fresh context for each log call by @vbanos in #434

  • Modernize code by @CorentinB in #433

  • Create rootContext and share with components by @vbanos in #435

  • Revert "Create rootContext and share with components" by @yzqzss in #436

  • chore(deps): bump actions/checkout from 4 to 5 in the github-actions group by @dependabot[bot] in #438

  • chore(deps): bump the go-modules group across 1 directory with 3 updates by @dependabot[bot] in #439

  • Add CF challenge stat by @NGTmeaty in #437

  • chore(deps): bump NGTmeaty/go-release-action from 1 to 2 in the github-actions group by @dependabot[bot] in #440

  • refactor(test): replace the socket logging with net.Pipe() by @yzqzss in #441

  • fix multiLogger data race by @yzqzss in #446

  • chore(deps): bump the go-modules group with 2 updates by @dependabot[bot] in #448

  • chore(deps): bump codecov/codecov-action from 5.4.3 to 5.5.0 in the github-actions group by @dependabot[bot] in #447

  • chore(deps): bump github.com/go-viper/mapstructure/v2 from 2.3.0 to 2.4.0 in the go_modules group by @dependabot[bot] in #444

Full Changelog: v2.0.14...v2.0.15

v2.0.14

06 Aug 17:09
f160c5b

Choose a tag to compare

What's Changed

  • New CSS parser with go-css-lexer by @yzqzss in #415
  • Improve extractLinksFromPage() to process line by line by @yzqzss in #410
  • Exit with retcode by @yzqzss in #402
  • chore(deps): bump the go-modules group with 4 updates by @dependabot[bot] in #405
  • Less "feed is empty" logs by @yzqzss in #398
  • E2E testing and start tracking coverage by @yzqzss in #403
  • chore(deps): bump github.com/go-logfmt/logfmt from 0.4.0 to 0.6.0 in the go-modules group by @dependabot[bot] in #407
  • chore(deps): bump mlugg/setup-zig from 2.0.4 to 2.0.5 in the github-actions group by @dependabot[bot] in #408
  • fix: double hops by @yzqzss in #409
  • handle non-utf8 html document and their URLs by @yzqzss in #370
  • chore(deps): bump the go-modules group with 2 updates by @dependabot[bot] in #412
  • Optimize extractLinksFromPage by splitting words instead of lines by @yzqzss in #414

Full Changelog: v2.0.13...v2.0.14