Skip to content

feat: Add CUDA 13.2 Dockerfile#78673

Closed
gouzil wants to merge 4 commits intoPaddlePaddle:developfrom
gouzil:feat/add_cuda132_dockerfile
Closed

feat: Add CUDA 13.2 Dockerfile#78673
gouzil wants to merge 4 commits intoPaddlePaddle:developfrom
gouzil:feat/add_cuda132_dockerfile

Conversation

@gouzil
Copy link
Copy Markdown
Member

@gouzil gouzil commented Apr 14, 2026

PR Category

Execute Infrastructure

PR Types

Devs

Description

添加 CUDA 13.2 基础镜像

是否引起精度变化

Copilot AI review requested due to automatic review settings April 14, 2026 09:35
@paddle-bot
Copy link
Copy Markdown

paddle-bot Bot commented Apr 14, 2026

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@paddle-bot paddle-bot Bot added the contributor External developers label Apr 14, 2026
@gouzil gouzil requested a review from swgu98 April 14, 2026 09:35
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds a new manylinux build Dockerfile targeting CUDA 13.2 to support building Paddle binaries on a CUDA 13.2 + cuDNN devel Ubuntu 24.04 base image.

Changes:

  • Introduces tools/dockerfile/manylinux/Dockerfile-132 based on nvcr.io/nvidia/cuda:13.2.0-cudnn-devel-ubuntu24.04.
  • Sets Python version to 3.12 and uses pip --break-system-packages when installing build/runtime Python dependencies.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment on lines +44 to +45
WORKDIR /home
RUN wget -q https://cmake.org/files/v3.31/cmake-3.31.0-linux-x86_64.tar.gz && \
Copy link

Copilot AI Apr 14, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

CMake is downloaded and extracted without any integrity verification (checksum/signature). To reduce supply-chain risk and improve reproducibility, pin and verify the tarball (e.g., SHA256) before extracting, or install CMake via a trusted package source.

Suggested change
WORKDIR /home
RUN wget -q https://cmake.org/files/v3.31/cmake-3.31.0-linux-x86_64.tar.gz && \
WORKDIR /home
ARG CMAKE_SHA256=<PINNED_CMAKE_3_31_0_LINUX_X86_64_TAR_GZ_SHA256>
RUN wget -q https://cmake.org/files/v3.31/cmake-3.31.0-linux-x86_64.tar.gz && \
echo "${CMAKE_SHA256} cmake-3.31.0-linux-x86_64.tar.gz" | sha256sum -c - && \

Copilot uses AI. Check for mistakes.
Comment thread tools/dockerfile/manylinux/Dockerfile-132 Outdated
@@ -0,0 +1,71 @@
# A image for building paddle binaries
Copy link

Copilot AI Apr 14, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Header comment grammar: "A image" should be "An image".

Suggested change
# A image for building paddle binaries
# An image for building paddle binaries

Copilot uses AI. Check for mistakes.

ENV HOME /root

RUN apt-get update --allow-unauthenticated && \
Copy link

Copilot AI Apr 14, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

apt-get update --allow-unauthenticated disables APT package signature verification, which weakens supply-chain security for this build image. If not strictly required, drop --allow-unauthenticated and fix any key/cert issues instead (or explicitly add the needed repository keys).

Suggested change
RUN apt-get update --allow-unauthenticated && \
RUN apt-get update && \

Copilot uses AI. Check for mistakes.
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
@gouzil gouzil requested review from SigureMo and risemeup1 April 14, 2026 13:05
# ENV variables
ARG WITH_GPU
ARG WITH_AVX
ARG PYTHON_VERSION=3.12
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这个镜像里是不是直接 3.14?@swgu98

Copy link
Copy Markdown
Member Author

@gouzil gouzil Apr 14, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

得加 ppa:deadsnakes/ppa 源才能上 3.14

@SigureMo
Copy link
Copy Markdown
Member

fleet CI 需要 merge 下最新 develop

@SigureMo
Copy link
Copy Markdown
Member

CUDA 13.2 解决编译问题可能也要解决类似 #78712 的问题,比如 setup.py 里要加 13.2,注意从 13.0 开始我们要支持 arm,不能限定 x86

@gouzil
Copy link
Copy Markdown
Member Author

gouzil commented Apr 18, 2026

CUDA 13.2 解决编译问题可能也要解决类似 #78712 的问题,比如 setup.py 里要加 13.2,注意从 13.0 开始我们要支持 arm,不能限定 x86

发台机子来让我测测 [旺柴]

那其实这里直接下载 cmake 可能不太合适,得用 apt 或者 pip 装,还有 bcebos 上的 ccache

Comment on lines +41 to +42
libnccl2=2.29.7-1+cuda13.2 \
libnccl-dev=2.29.7-1+cuda13.2 && \
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nccl 应该打在镜像里吗?这里出问题,是不是因为 setup.py 里没有加 nvidia-nccl-cu13 依赖?@swgu98 @risemeup1

@gouzil
Copy link
Copy Markdown
Member Author

gouzil commented Apr 23, 2026

see: #78720

@gouzil gouzil closed this Apr 23, 2026
@SigureMo SigureMo deleted the feat/add_cuda132_dockerfile branch April 23, 2026 03:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

contributor External developers

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants