Skip to content
This repository has been archived by the owner on Oct 19, 2024. It is now read-only.
This repository has been archived by the owner on Oct 19, 2024. It is now read-only.

python3 -m alpa.test_install error #974

Open
@sandy99-w

Description

Please describe the bug
python3 -m alpa.test_install 运行失败
Please describe the expected behavior

System information and environment

  • OS Platform and Distribution (e.g., Linux Ubuntu 16.04, docker): ubuntu 20.04 docker
  • Python version: 3.9.0
  • CUDA version: 11.2
  • NCCL version:
  • cupy version:
  • GPU model and memory: H800 80G
  • Alpa version: alpa==1.0.0.dev0
  • TensorFlow version:
  • JAX version: jaxlib-0.3.22

To Reproduce
Steps to reproduce the behavior:
Image

Screenshots
If applicable, add screenshots to help explain your problem.

按照相同的配置在A40上安装成功,但是H800上install test失败,参考alpa官方提供的install流程通过源码安装,代码取的master分支;
尝试升级了cuda版本,升级版本后alpa编译失败:
Image

Code snippet to reproduce the problem

Additional information
Add any other context about the problem here or include any logs that would be helpful to diagnose the problem.

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions