Better Closed Caption Parsing and Indexing Changes
We have made it a new minor version!
This release include two changes:
- Better / safer WebVTT parsing (specifically to address Boston closed caption files) -- thanks to @isaacna
- The index pipeline has been split into two. This is a breaking change and deployment maintainers should update their deployment repositories with the
make update-from-cookiecutter
to pull in these changes.
What's Changed
- bugfix/fix-newline-webvtt-sentence-segmentation by @isaacna in #193
- feature/reduce-and-fan-ngram-index by @JacksonMaxfield in #189
Full Changelog: v3.0.16...v3.1.0