Disable NVIDIA_TF32_OVERRIDE by default for better precision. by A-nnonymous · Pull Request #75476 · PaddlePaddle/Paddle

A-nnonymous · 2025-09-23T09:07:08Z

PR Category

Operator Mechanism

PR Types

Bug fixes

Description

默认不开启cublas中的tf32 策略。
修改前flag NVIDIA_TF32_OVERRIDE在CUBLAS等库里默认为1，会引入tf32来加速fp32，牺牲精度。
现与torch行为对齐，参考链接如下:

监控结果：

避免FP32 gemm或不带bias的linear 中，13位尾数裁剪带来的大幅精度损失。后续需要进行更细粒度的类型管控。

pcard-93348

A-nnonymous · 2025-09-24T02:49:01Z

/re-run all-failed

codecov-commenter · 2025-09-24T05:40:09Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
⚠️ Please upload report for BASE (develop@2dfc418). Learn more about missing BASE report.

Additional details and impacted files

@@             Coverage Diff             @@
##             develop    #75476   +/-   ##
===========================================
  Coverage           ?   100.00%           
===========================================
  Files              ?         1           
  Lines              ?         2           
  Branches           ?         0           
===========================================
  Hits               ?         2           
  Misses             ?         0           
  Partials           ?         0

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

A-nnonymous · 2025-09-24T06:02:04Z

/re-run all-failed

A-nnonymous · 2025-09-24T08:13:39Z

/re-run all-failed

A-nnonymous · 2025-10-09T07:45:34Z

/re-run all-failed

wanghuancoder

LGTM

…5476)

…Paddle#75476)" This reverts commit fcf3c3f.

#75907) * Revert "Disable CUBLAS TF32 for default for better precision. (#75476)" This reverts commit fcf3c3f. * Update __init__.py test=document_fix

Disable CUBLAS TF32 for default for better precision.

09bca70

A-nnonymous requested review from phlrain and qili93 as code owners September 23, 2025 09:07

A-nnonymous changed the title ~~Disable CUBLAS TF32 for default for better precision.~~ Disable NVIDIA_TF32_OVERRIDE by default for better precision. Sep 23, 2025

sneaxiy approved these changes Sep 23, 2025

View reviewed changes

swgu98 added the skip-ci: template label Oct 11, 2025

wanghuancoder approved these changes Oct 11, 2025

View reviewed changes

phlrain approved these changes Oct 13, 2025

View reviewed changes

wanghuancoder merged commit fcf3c3f into PaddlePaddle:develop Oct 13, 2025
119 of 131 checks passed

SigureMo pushed a commit to cattidea/Paddle that referenced this pull request Oct 14, 2025

Disable CUBLAS TF32 for default for better precision. (PaddlePaddle#7…

84622c2

…5476)

A-nnonymous mentioned this pull request Oct 17, 2025

Revert "Disable NVIDIA_TF32_OVERRIDE by default for better precision." #75907

Merged

A-nnonymous added a commit to A-nnonymous/Paddle that referenced this pull request Oct 17, 2025

Revert "Disable CUBLAS TF32 for default for better precision. (Paddle…

c167bf2

…Paddle#75476)" This reverts commit fcf3c3f.

A-nnonymous mentioned this pull request Nov 20, 2025

cherrypick TF32 related modifications to fleety12 #76479

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disable NVIDIA_TF32_OVERRIDE by default for better precision.#75476

Disable NVIDIA_TF32_OVERRIDE by default for better precision.#75476
wanghuancoder merged 1 commit intoPaddlePaddle:developfrom
A-nnonymous:fix_default_gemm_prec

A-nnonymous commented Sep 23, 2025 •

edited

Loading

Uh oh!

A-nnonymous commented Sep 24, 2025

Uh oh!

codecov-commenter commented Sep 24, 2025

Uh oh!

A-nnonymous commented Sep 24, 2025

Uh oh!

A-nnonymous commented Sep 24, 2025

Uh oh!

A-nnonymous commented Oct 9, 2025

Uh oh!

wanghuancoder left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

A-nnonymous commented Sep 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Category

PR Types

Description

Uh oh!

A-nnonymous commented Sep 24, 2025

Uh oh!

codecov-commenter commented Sep 24, 2025

Codecov Report

Uh oh!

A-nnonymous commented Sep 24, 2025

Uh oh!

A-nnonymous commented Sep 24, 2025

Uh oh!

A-nnonymous commented Oct 9, 2025

Uh oh!

wanghuancoder left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

A-nnonymous commented Sep 23, 2025 •

edited

Loading