[Cherry-pick Fleety_12] Bigtensor and api precision by zhengshengning · Pull Request #76023 · PaddlePaddle/Paddle

zhengshengning · 2025-10-24T03:54:55Z

PR Category

Operator Mechanism

PR Types

New features

Description

将下面paddle develop的如下PR cherry-pick到 fleety_12：

精度逐位对齐：
#75717
#75379
#75588
#75605
#75799
#75341
#75503
#75355
#75363
#75426
#75454
#75367
#75335
#75525
#75549
#75816
#75547
#74638
#75237
#75238
#75965
#75898

有一些fluid、pir、onednn、pass、组合算子、自动并行的修改是由于：为了不损失attribute精度，与Torch对齐，将kernel签名从float改为了double。但这些Kernel从最终Operator算子库的Maker开始就是float类型的，这导致需要很多兼容问题，如：

Save/Load兼容性
老算子库兼容性
组合算子兼容性
推理Pass兼容性，以及常规算子与Fused算子一致性

此外，部分算子运算逻辑调整后，组合算子也需要配合调整组合逻辑，引发了一些组合算子修改。

…en opotype is 'div'(PaddlePaddle#75237)

…nsor is floating (PaddlePaddle#75238) * align LinspaceKernel * update meta * update gpu kernel * fix LinspaceKernelInner * improve kernel

… *(1 + tan(x)^2) (PaddlePaddle#75335) * Tan reverse calculation: dx = dout *(1 + tan(x)^2)

…onal.grid_sample to align with torch accuracy. (PaddlePaddle#75355) * accuracy_stable_grid_sample * fix

* fix * fix test * fix

…ch precision. (PaddlePaddle#75503) * accuracy_stable_sin * accuracy_stable_cos

* fix * fix * fix * fix * fix

…ackward (PaddlePaddle#75525) * fix precision for float16 of paddle.tan backward * fix else branch of CudaTanGradFunctor

…dle#75549) * accuracy_stable_expm1 * fix

* fix * fix

…ional.softplus to double (PaddlePaddle#75426) * fix beta and threshold of Softplus to double * fix test_softplus_activation_fuse_pass v1 * fix test_activation_zero * fix flaot of SoftplusDoubleGradKernel to double * add op_patches for softplus * add yaml for ops/yaml/legacy * fix infershape/operator for FLOAT64 * fix * add SoftPlusOpTranscriber * fix * fix * fix1 * fix2 * fix coverage * fix coverage2

* fix * fix * fix dcu

…addlePaddle#75799) * accuracy_stable_log * accuracy_stable_log * fix * fix * fix * fix * fix5

…ble (PaddlePaddle#75816) * accuracy_stable_logit * add LogitOpTranscriber * fix coverage * fix 0yaml

* accuracy_stable_log_sigmoid * fix test_activation_stride_op.py

…e paddle.nn.functional.leaky_relu API to double (PaddlePaddle#75547)

…le#75856) * fix funcs * gpu * fix * fix * 修改PADDLE_ENFORCE信息 * fix cpu error * fix dcu * fix dcu * fix

* feature: Add specialized LogSigmoidFunctor and CudaLogSigmoidFunctor for complex numbers This commit introduces specialized implementations of LogSigmoidFunctor and CudaLogSigmoidFunctor to handle complex number inputs. The new implementations utilize direct formulas for improved accuracy and stability in calculations involving complex types. * refactor: Optimize LogSigmoidFunctor and CudaLogSigmoidFunctor for complex types by caching exp(-x) to reduce redundant computations. This change enhances performance while maintaining accuracy in calculations. * refactor: modified the formula in LogSigmoidFunctor to make it numerical stable

paddle-bot · 2025-10-24T03:55:00Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

wanghuancoder

LGTM

zrr1999 and others added 24 commits October 24, 2025 11:03

CallScalarFunction uses the dtype of 'self' as the type of 'other' wh…

219e844

…en opotype is 'div'(PaddlePaddle#75237)

LinspaceKernel uses the dtype of 'self' as the type of 'step' when te…

b752526

…nsor is floating (PaddlePaddle#75238) * align LinspaceKernel * update meta * update gpu kernel * fix LinspaceKernelInner * improve kernel

fix CudaSigmoidGradFunctor and CudaSiluGradFunctor (PaddlePaddle#75341)

027fdd5

Softplus accuracy and torch alignment 1 (PaddlePaddle#75363)

d5ff262

[Precision Depth Alignment] paddle.tan reverse calculation: dx = dout…

15ed987

… *(1 + tan(x)^2) (PaddlePaddle#75335) * Tan reverse calculation: dx = dout *(1 + tan(x)^2)

[Precision Depth Alignment] Add support for CUDNN to paddle.nn.functi…

17a77d4

…onal.grid_sample to align with torch accuracy. (PaddlePaddle#75355) * accuracy_stable_grid_sample * fix

correlation supports big tensor (PaddlePaddle#75383)

15b839e

* fix * fix test * fix

paddle.tanh Grad and torch alignment (float16) (PaddlePaddle#75454)

0c90043

[Precision Depth Alignment] paddle.sin and paddle.cos aligns with tor…

30b7970

…ch precision. (PaddlePaddle#75503) * accuracy_stable_sin * accuracy_stable_cos

[深度对齐]Divide (PaddlePaddle#75379)

ed4e384

* fix * fix * fix * fix * fix

[Precision Depth Alignment] fix precision for float16 of paddle.tan b…

42a23ea

…ackward (PaddlePaddle#75525) * fix precision for float16 of paddle.tan backward * fix else branch of CudaTanGradFunctor

[Precision Depth Alignment] fix precision for paddle.expm1 (PaddlePad…

a249a7d

…dle#75549) * accuracy_stable_expm1 * fix

Bigtensor排查修复[Paddle/paddle/phi/kernels/funcs] (PaddlePaddle#75523)

fcfbcf7

* fix * fix

cherry-pick 063bf3a

6cd8fc9

fix (PaddlePaddle#75605)

8df3224

[深度对齐] dot (PaddlePaddle#75717)

37d04e6

* fix * fix * fix dcu

[Precision Depth Alignment] paddle.log aligns with torch precision (P…

7ea153d

…addlePaddle#75799) * accuracy_stable_log * accuracy_stable_log * fix * fix * fix * fix * fix5

[Precision Depth Alignment] fix eps of paddle.logit from float to dou…

fecf083

…ble (PaddlePaddle#75816) * accuracy_stable_logit * add LogitOpTranscriber * fix coverage * fix 0yaml

[Precision Depth Alignment] paddle.log_sigmoid (PaddlePaddle#75898)

2bfa39a

* accuracy_stable_log_sigmoid * fix test_activation_stride_op.py

[Precision Depth Alignment] Modify the negative_slope parameter of th…

66ad0ff

…e paddle.nn.functional.leaky_relu API to double (PaddlePaddle#75547)

[big tensor] Paddle/paddle/phi/kernels/funcs gpuBigtensor (PaddlePadd…

a537b69

…le#75856) * fix funcs * gpu * fix * fix * 修改PADDLE_ENFORCE信息 * fix cpu error * fix dcu * fix dcu * fix

cherry-pick 83d4454

3690ca4

zhengshengning requested review from ForFishes, JiabinYang, LiYuRio, xiaoguoguo626807 and zhiqiu as code owners October 24, 2025 03:54

wanghuancoder approved these changes Oct 24, 2025

View reviewed changes

zhengshengning closed this Oct 24, 2025

zhengshengning reopened this Oct 24, 2025

zhengshengning closed this Oct 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Cherry-pick Fleety_12] Bigtensor and api precision#76023

[Cherry-pick Fleety_12] Bigtensor and api precision#76023
zhengshengning wants to merge 24 commits intoPaddlePaddle:fleety_12from
zhengshengning:bigtensor_precision_v1

zhengshengning commented Oct 24, 2025 •

edited by wanghuancoder

Loading

Uh oh!

paddle-bot Bot commented Oct 24, 2025

Uh oh!

wanghuancoder left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

zhengshengning commented Oct 24, 2025 • edited by wanghuancoder Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Category

PR Types

Description

Uh oh!

paddle-bot Bot commented Oct 24, 2025

Uh oh!

wanghuancoder left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

zhengshengning commented Oct 24, 2025 •

edited by wanghuancoder

Loading