Skip to content

Better Closed Caption Parsing and Indexing Changes

Compare
Choose a tag to compare
@evamaxfield evamaxfield released this 15 Jun 18:07
· 150 commits to main since this release

We have made it a new minor version!

This release include two changes:

  1. Better / safer WebVTT parsing (specifically to address Boston closed caption files) -- thanks to @isaacna
  2. The index pipeline has been split into two. This is a breaking change and deployment maintainers should update their deployment repositories with the make update-from-cookiecutter to pull in these changes.

What's Changed

  • bugfix/fix-newline-webvtt-sentence-segmentation by @isaacna in #193
  • feature/reduce-and-fan-ngram-index by @JacksonMaxfield in #189

Full Changelog: v3.0.16...v3.1.0