This release finalized the removal of Python dependency and containment of tokenization/preprocessing in the UDS tokenization service, along with initial Otel tracing support.
What's Changed
- Switch to pre-built vLLM wheels for CPU builds by @sagearc in #268
- fix: Update storage connector to support vllm v0.14.1 by @kfirtoledo in #269
- Fix vllm ci by @hyeongyun0916 in #265
- Optimize get_file_name to improve lookup performance by @rshavitt in #264
- build: Add a wheel creation for the release by @kfirtoledo in #270
- Refactor: Replace daulet/tokenizers with vLLM tokenizer by @hyeongyun0916 in #254
- build: split wheel build into a separate GitHub Actions workflow by @kfirtoledo in #272
- cleanup: remove deprecated daulet tokenizer by @hyeongyun0916 in #274
- fix: Fix block_hash to use big-endian by @kfirtoledo in #282
- fix: Truncate prompt if
truncate_prompt_tokensset in request by @albertoperdomo2 in #281 - Test and document extra field type support by @albertoperdomo2 in #273
- fix: Fix CI wheels to support arm64 by @kfirtoledo in #279
- refactor: align tokenizer interface with vLLM architecture by @hyeongyun0916 in #278
- docs: Add usage example to UDS tokenization service by @sagearc in #294
- Remove deprecated prefixstore by @sagearc in #296
- misc: UDS server mock for testing and UDS tokenizer unit tests by @sagearc in #297
- feat: FS backend structured logger by @albertoperdomo2 in #300
- refactor: isolate embedded tokenizers behind build tags by @sagearc in #303
- build: enable image build for uds service by @zdtsw in #307
- 🌱 Standardize governance workflows, tooling, and Dependabot by @clubanderson in #311
- 🌱 Remove legacy typo and link checker workflows by @clubanderson in #324
- update(build): UDS dockerfile enhancement + github workflow by @zdtsw in #328
- ci: add service UDS tokenizer tests by @sagearc in #335
- test: e2e tests against uds (tcp) tokenization service by @sagearc in #334
- feat: configurable tokenizers directory by @sagearc in #340
- fix: markdown lint by @zdtsw in #338
- build: Add pre-commit checks to CI by @kfirtoledo in #342
- PVC Evictor by @guygir in #215
- Initial distributed tracing instrumentation by @sallyom in #48
- add @liu-cong to maintainers by @vMaroon in #360
- fix: increase gRPC keepalive ping interval to match server minimum by @pierDipi in #363
- update: change make target for tests to only run UDS and add support for podman by @zdtsw in #354
- fix: github action re-run not working + bump version by @zdtsw in #355
- feat: add map for UDS for local model by @zdtsw in #352
- deps(go): bump the kubernetes group with 3 updates by @dependabot[bot] in #350
- Remove services/ exclusion from pre-commit checks by @yankay in #362
- Ignore large reformatting PRs in git blame by @yankay in #364
New Contributors
- @rshavitt made their first contribution in #264
- @albertoperdomo2 made their first contribution in #281
- @zdtsw made their first contribution in #307
- @sallyom made their first contribution in #48
- @dependabot[bot] made their first contribution in #350
Full Changelog: v0.5.0...v0.6.0