Skip to content

Pull requests: tenstorrent/tt-inference-server

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

H2D D2H sockets
#2119 opened Feb 14, 2026 by dmadicTT Draft
Add additional loggers
#2118 opened Feb 13, 2026 by fivanovicTT Draft
Add v1 api prefix
#2105 opened Feb 12, 2026 by vpetrovicTT Draft
Samt/uv fixed fea qb fix depend
#2102 opened Feb 12, 2026 by stisiTT Draft
Ben/v1
#2101 opened Feb 12, 2026 by bgoelTT Draft
Idjuric/test dp2
#2100 opened Feb 12, 2026 by idjuricTT Loading…
Dmadic tt/h2d d2h model runner
#2099 opened Feb 12, 2026 by dmadicTT Draft
Use Whisper DP2 by default
#2097 opened Feb 11, 2026 by idjuricTT Loading…
cpp_server connect vllm e2e
#2092 opened Feb 11, 2026 by knovokmetTT Loading…
Remove TTNN from C++ server Docker image
#2073 opened Feb 10, 2026 by ztorlakTT Loading…
Allow Flux to use inference steps 4
#2062 opened Feb 10, 2026 by vpetrovicTT Loading…
Refactor of eval tests
#2033 opened Feb 6, 2026 by vpetrovicTT Draft
Fix build
#2028 opened Feb 6, 2026 by idjuricTT Loading…
enable eval forge llm
#2018 opened Feb 5, 2026 by knovokmetTT Draft
Flux motif eval test
#2017 opened Feb 5, 2026 by vpetrovicTT Loading…
2
7
GPT-OSS evals and RAM fix bug Something isn't working Models CI onboarding Issue for model onboarding on Models CI, has to be triaged by respective model owner teams
#1941 opened Jan 29, 2026 by stisiTT Draft
Dmadic tt/prefill server poc
#1936 opened Jan 29, 2026 by dmadicTT Draft
Uplift for QWen3-8B
#1885 opened Jan 26, 2026 by sott0n Loading…
ProTip! Add no:assignee to see everything that’s not assigned.