Skip to content

What's the future plan for torchchat serving #1491

Open
@jenniew

Description

I see current torchchat serving provides basic serving function. I'm wondering what the future plan for serving. What's the target of torchchat serve? Will it provide more optimized and high performance serving features(like Continuous batching, prefix-caching, chunked prefill, etc.)

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Assignees

Labels

enhancementNew feature or requesttriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate module

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions