Releases: iipc/jwarc
Releases · iipc/jwarc
0.16.4
v0.16.3: Release 0.16.3
Bugs fixed
- The ARC parser now accepts non-ASCII characters in URIs. (The ARC
format was fairly loosely defined so we've been progressively
relaxing the grammar as we discovered counter-examples.)
v0.16.2: Release 0.16.2
Bugs fixed
- Fixed calculation of position for uncompressed (W)ARCs when the trailer is not 4 bytes
- ARC parser now accepts any character except CTLs and spaces in the URL field
v0.16.1: Release 0.16.1
Bug fixes
- The ARC parser now tries to recover if the trailer is missing
- The ARC parser now copes with the MIME field being missing or a single token
- Lenient HTTP parser now ignores multiple CRs at the end of header lines
- CDXTool now skips the record and continues if it fails to parse the HTTP message
v0.16.0: Release 0.16.0
New features
- WarcReader will now emit a warning and attempt to recover when encountering a record with a missing or truncated trailer
- Added WarcReader.onWarning(handler) which can be used to report recoverable errors
Bugs fixed
- The ARC parser now handles the special value "no-type" in the MIME field
- The ARC parser now accepts URLs containing "[" or "]"
v0.15.0: Release 0.15.0
New features:
- Added validate tool which checks parse errors, validating digests and other headers #60 (Sebastian Nagel)
- WarcReader gained a calculateBlockDigest() mode which populates a corresponding WarcRecord.calculatedBlockDigest() #60 (Sebastian Nagel)
- WarcDigest: SHA-2 support, base64 and encoding auto-detection #59 (Sebastian Nagel)
Bugs fixed:
- Setting the record version after calling date() would produce the incorrect WARC-Date precision #58
- The lenient HTTP parser now accepts requests missing the HTTP version field (improves compatibility with the non-standard records produced by the ArchiveWeb.page browser extension)
v0.14.0: Release 0.14.0
New features:
- Saveback tool for reconstructing WARC records from replay systems
Bugs fixed:
- Replay proxy doesn't start because of sw.js file not found #57
v0.13.1: Release 0.13.1
Bugs fixed:
- GunzipChannel fails on payload with uncompressed size exceeding int_max #54 (Sebastian Nagel)
v0.13.0: Release 0.13.0
v0.12.0: Release 0.12.0
New features
- Added contains(name, value) to MessageHeaders for look for values in comma-list headers
Bugs fixed