Skip to content

Commit 3185731

Browse files
committed
remove duplicate content & add performance diagram
Signed-off-by: Rein Yang <ruiruyang2@gmail.com>
1 parent 50b8f35 commit 3185731

File tree

6 files changed

+73
-322
lines changed

6 files changed

+73
-322
lines changed

docs/design/feature/async_chunk_design.md

Lines changed: 71 additions & 319 deletions
Large diffs are not rendered by default.
50.6 KB
Loading
151 KB
Loading
144 KB
Loading
143 KB
Loading

vllm_omni/benchmarks/patch/patch.py

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -150,9 +150,8 @@ async def async_request_openai_chat_omni_completions(
150150
generated_text += content or ""
151151
elif modality == "audio":
152152
if output.audio_ttfp == 0.0:
153-
audio_first_timestamp = timestamp
154-
output.audio_ttfp = timestamp - st
155-
audio_generate_time = timestamp - audio_first_timestamp
153+
output.audio_ttfp = timestamp - st
154+
audio_generate_time = timestamp - st
156155
if content != "":
157156
audio_bytes = base64.b64decode(content)
158157
seg = AudioSegment.from_file(io.BytesIO(audio_bytes))

0 commit comments

Comments
 (0)