Skip to content

feat(promptcompression): pluggable domain-aware NLP compression pipeline#1564

Draft
rootfs wants to merge 1 commit intovllm-project:mainfrom
rootfs:feat/pluggable-prompt-compression
Draft

feat(promptcompression): pluggable domain-aware NLP compression pipeline#1564
rootfs wants to merge 1 commit intovllm-project:mainfrom
rootfs:feat/pluggable-prompt-compression

Conversation

@rootfs
Copy link
Copy Markdown
Collaborator

@rootfs rootfs commented Mar 15, 2026

This feat splits the compression into pre-classification and pre-inference so to adopt different compression optimizers.

@rootfs rootfs requested a review from Xunzhuo as a code owner March 15, 2026 17:49
@netlify
Copy link
Copy Markdown

netlify bot commented Mar 15, 2026

Deploy Preview for vllm-semantic-router ready!

Name Link
🔨 Latest commit e25225d
🔍 Latest deploy log https://app.netlify.com/projects/vllm-semantic-router/deploys/69b6f12a2578b300082fa2f9
😎 Deploy Preview https://deploy-preview-1564--vllm-semantic-router.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@github-actions
Copy link
Copy Markdown
Contributor

👥 vLLM Semantic Team Notification

The following members have been identified for the changed files in this PR and have been automatically assigned:

📁 config

Owners: @rootfs, @Xunzhuo
Files changed:

  • config/prompt-compression/coding.yaml
  • config/prompt-compression/default.yaml
  • config/prompt-compression/medical.yaml
  • config/prompt-compression/multi-turn.yaml
  • config/prompt-compression/security.yaml

📁 src

Owners: @rootfs, @Xunzhuo, @wangchen615
Files changed:

  • src/semantic-router/cmd/compressdemo/main.go
  • src/semantic-router/pkg/config/model_config_types.go
  • src/semantic-router/pkg/extproc/processor_req_body_routing.go
  • src/semantic-router/pkg/extproc/req_filter_inference_compression.go
  • src/semantic-router/pkg/extproc/router.go
  • src/semantic-router/pkg/extproc/router_build.go
  • src/semantic-router/pkg/promptcompression/README.md
  • src/semantic-router/pkg/promptcompression/compressor.go
  • src/semantic-router/pkg/promptcompression/config_claude_code.yaml
  • src/semantic-router/pkg/promptcompression/config_default.yaml
  • src/semantic-router/pkg/promptcompression/opt_builtin.go
  • src/semantic-router/pkg/promptcompression/opt_extensions.go
  • src/semantic-router/pkg/promptcompression/optimizer.go
  • src/semantic-router/pkg/promptcompression/pipeline.go
  • src/semantic-router/pkg/promptcompression/pipeline_selector.go
  • src/semantic-router/pkg/promptcompression/pipeline_test.go

vLLM

🎉 Thanks for your contributions!

This comment was automatically generated based on the OWNER files in the repository.

@rootfs rootfs marked this pull request as draft March 15, 2026 17:50
@Xunzhuo
Copy link
Copy Markdown
Member

Xunzhuo commented Mar 17, 2026

Sorry about this. #1553 introduced a large recent refactor, which caused conflicts here. Please rebase onto main and resolve them.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants