|
1 | 1 | # v6.3.0 |
2 | 2 |
|
3 | | -## Key Features |
4 | | - |
5 | | -### Fix Rag Pipeline Events |
6 | | -This PR addresses issues with the RAG pipeline ingestion and deletion flow, improving the reliability of collection ID resolution. Key fixes include: |
7 | | -- Resolved a bug where pipelines storing a user-entered collection name (instead of UUID) as would fail to find the collection. Added a fallback lookup in . |
8 | | -- Fixed EventBus update mismatches on deployment. |
9 | | - |
10 | | -### RAG Citations Preview Side Panel |
11 | | -This enhancement introduces a new RAG citation preview side panel accessible from the Chat UI, improving how RAG citations appear in chat bubbles. Additionally, it adds a Dismiss all button for dismissing all notifications in a notification stack. |
12 | | - |
13 | | -### VLLM Updates |
14 | | -This update improves the VLLM image by: |
15 | | -- Updating the base VLLM image to use the latest deep-learning VLLM from AWS (public.ecr.aws/deep-learning-containers/vllm:0.15-gpu-py312-ec2) |
16 | | -- Adding explicit GPU settings for ECS host images, including adding the nvidia runtime to containers |
17 | | -- Adding memory reservation as a field for configuring images |
18 | | -- Properly setting tensor parallelization based on the number of GPUs |
19 | | -- Updating example model hosting configurations to tune for improved performance |
20 | | - |
21 | | -### Improved Loading Configuration Text Formatting |
22 | | -This change fixes the styling of the Loading Configuration... text to match the rest of the LISA UI by using Cloudscape's components. |
23 | | - |
24 | | -### Dockerfile Patching for Embedded Models |
25 | | -The Dockerfiles for embedding models (instructor, tei), text generation models (tgi), and VLLM were not running during the build process, meaning OS packages inherited from the base images were not updating when deploying. This change ensures that each time the images are built and pushed to the LISA ECR, the host OS packages are upgraded to their latest available versions. |
26 | | - |
27 | | -### Claude Code Setup Guide for LISA Serve Integration |
28 | | -This addition provides documentation on how to get Claude Code working with models hosted via LISA Serve. |
29 | | - |
30 | | -### Resolve max_tokens Handling for Non-Anthropic Models on Anthropic Routes |
31 | | -This fix resolves an issue where the parameter was not being properly handled for non-Anthropic models when using Anthropic routes. |
32 | | - |
33 | | -### Improved RAG PDF Parsing Quality |
34 | | -This fix improves the quality of text extraction from PDF documents ingested into the RAG pipeline. It addresses issues with excessive whitespace and the inclusion of invisible Unicode characters that degraded retrieval quality. |
35 | | - |
36 | | -### Dependency Updates |
37 | | -This PR updates various dependencies across the LISA codebase. |
| 3 | +## UI Updates |
| 4 | +- Added RAG citation document preview side panel in Chat UI |
| 5 | +- Exposed the document preview panel in the document library for viewing documents |
| 6 | +- Added "Dismiss all" button for notification stacks |
| 7 | +- Fixed "Loading Configuration..." text styling to match LISA UI using Cloudscape components |
| 8 | +- Added last updated date/time to session displays |
38 | 9 |
|
39 | | -### Removal of LISA Serve V1 Endpoints |
40 | | -These endpoints have been on the deprecation path for 2 years and have been dead code since LiteLLM was introduced to the baseline. This PR removes all V1 endpoints and the supporting infrastructure. |
| 10 | +## Other Key Changes |
| 11 | +- Updated VLLM image to latest AWS deep-learning base with GPU settings for ECS, memory reservation, and tensor parallelization from GPU count |
| 12 | +- Dockerfiles for embedding (instructor, tei), text generation (tgi), and VLLM now run OS package upgrades during build |
| 13 | +- Removed deprecated LISA Serve V1 endpoints and supporting infrastructure |
| 14 | +- Updated dependencies across the codebase |
41 | 15 |
|
42 | | -### UI Validation Fix |
43 | | -This fix addresses an issue with UI validation. |
| 16 | +## Bug Fixes |
| 17 | +- Fixed RAG pipeline collection ID resolution (find_by_id_or_name fallback) and EventBus update mismatches on deployment |
| 18 | +- Resolved max_tokens handling for non-Anthropic models on Anthropic routes |
| 19 | +- Improved RAG PDF parsing quality (excessive whitespace and invisible Unicode characters) |
| 20 | +- Addressed consistency of UI validation warnings for field format and required fields |
| 21 | +- Added missing required role for batch ingestion |
| 22 | +- Added cache clearing at login to prevent cache corruption issues |
44 | 23 |
|
45 | | -## Key Changes |
46 | | -- **Dependency Updates**: Updated various dependencies across the LISA codebase. |
47 | | -- **Dockerfile Patching**: Ensured OS packages are upgraded to their latest available versions during the build process for embedded models, text generation models, and VLLM. |
48 | | -- **RAG Pipeline Improvements**: Fixed issues with collection ID resolution and EventBus update mismatches in the RAG pipeline. |
49 | | -- **RAG Citations UI**: Introduced a new RAG citation preview side panel and a Dismiss all button for notifications. |
50 | | -- **VLLM Updates**: Improved the VLLM image with updated base, GPU settings, memory reservation, and tensor parallelization configurations. |
51 | | -- **Endpoint Removal**: Removed deprecated LISA Serve V1 endpoints. |
52 | | -- **UI Validation Fix**: Addressed an issue with UI validation. |
| 24 | +## Documentation |
| 25 | +- Added Claude Code setup guide for LISA Serve integration |
| 26 | +- Updated deployment guide |
53 | 27 |
|
54 | 28 | ## Acknowledgements |
55 | | -* @121983012+jmharold |
56 | | -* @32586639+gingerknight |
57 | | -* @49699333+dependabot[bot] |
58 | | -* @99225408+Ernest-Gray |
59 | 29 | * @bedanley |
| 30 | +* @Ernest-Gray |
60 | 31 | * @estohlmann |
| 32 | +* @gingerknight |
| 33 | +* @jmharold |
61 | 34 |
|
62 | 35 | **Full Changelog**: https://github.com/awslabs/LISA/compare/v6.2.1..v6.3.0 |
63 | 36 |
|
|
0 commit comments