Skip to content

有人测过Qwen2和Qwen2.5 1.5B下的推理速度差异吗?之前有套VLLM的架构发现从2变到2.5时延提高了,不知道是不是需要改一些东西 #1232

Unanswered
Harryjun asked this question in Q&A
Discussion options

You must be logged in to vote

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants