Skip to content

musa: refine MooreThreads device detection and assignment #936

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

yeahdongcn
Copy link
Contributor

Testing Done

root@worker36086:/ws# uv run ./ktransformers/ktransformers/util/vendors.py 
cpu_device: cpu
cpu_device2: cpu
gpu0: musa:0
x: tensor([[ 2.5273,  2.2146,  0.2072],
        [-0.7591, -0.6740, -0.5080],
        [-0.8306,  0.6573, -0.9665]])
x_gpu: tensor([[ 2.5273,  2.2146,  0.2072],
        [-0.7591, -0.6740, -0.5080],
        [-0.8306,  0.6573, -0.9665]], device='musa:0')
x_cpu1: tensor([[ 2.5273,  2.2146,  0.2072],
        [-0.7591, -0.6740, -0.5080],
        [-0.8306,  0.6573, -0.9665]])
x_cpu2: tensor([[ 2.5273,  2.2146,  0.2072],
        [-0.7591, -0.6740, -0.5080],
        [-0.8306,  0.6573, -0.9665]])

@yeungtuzi
Copy link

什么时候S4000能用起来啊

@Atream Atream requested a review from Azure-Tang April 2, 2025 07:01
@james0zan
Copy link
Member

什么时候S4000能用起来啊

应该快了,我们之前主要是要等 0.2.4 版本的发布。这个版本重构比较大,需要基于它 merge 多显卡支持。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants