Skip to content

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models#6553

Open
gyou2021 wants to merge 6 commits intomicrosoft:masterfrom gyou2021:configurable_autoTP

Commits

Commits on Oct 25, 2024

Commits on Jan 8, 2025

Commits on Jan 13, 2025

Commits on Jan 17, 2025