add ForwardDiff@1 #378

penelopeysm · 2025-03-28T23:25:31Z

Hooray, all tests passing (except known Enzyme failures!)

Closes #377
Closes #376

penelopeysm · 2025-03-30T22:08:15Z

Test env still doesn't resolve to FD=1 since that would downgrade a bunch of other packages it seems. I'm trying to slowly track them down, starting with NNLib here FluxML/NNlib.jl#637 (comment), but it might be a while until everything's updated.

In the meantime, I removed ForwardDiff 0.10 from the test compat, to force the tests to run on FD=1.

penelopeysm · 2025-03-30T22:36:43Z

ForwardDiff test fails at test/interface.jl:189. To reproduce the failing test:

using Distributions, Bijectors, ForwardDiff, LinearAlgebra, Test
dist = Dirichlet([1000 * one(Float64), eps(Float64)])
b = Bijectors.SimplexBijector()
r = rand(dist)
x = if any(r .> 0.9999)
    [0.0, 1.0][sortperm(r)]
else
    r
end
y = b(x)
ForwardDiff.jacobian(inverse(b), y)[1:(end - 1), :]
@test logabsdet(ForwardDiff.jacobian(inverse(b), y)[1:(end - 1), :])[1] ≈
    logabsdetjac(inverse(b), y)

I bisected the failure to JuliaDiff/ForwardDiff.jl#695 - it seems that the now (more correct) inequality comparisons are messing with

Bijectors.jl/src/Bijectors.jl

Lines 87 to 93 in 8a525f1

    
           function _clamp(x, a, b) 
        
               T = promote_type(typeof(x), typeof(a), typeof(b)) 
        
               ϵ = _eps(T) 
        
               clamped_x = ifelse(x < a, convert(T, a), ifelse(x > b, convert(T, b), x)) 
        
               DEBUG && _debug("x = $x, bounds = $((a, b)), clamped_x = $clamped_x") 
        
               return clamped_x 
        
           end

because a and b are reals, whereas x is a ForwardDiff.Dual. Specifically, in this instance, we have

x = Dual(1.0, eps(Float64))
a = 0
b = 1

and before that PR, x < a and x > b both return false, so clamped_x is just set to x.

On the other hand, after that PR, x > b now returns true, so clamped_x is now convert(T, b) = Dual(1.0, 0).

It seems to me, mathematically, that _clamp is not differentiable at the points $x = a$ or $x = b$, so the new behaviour (derivative set to 0, true for $x \to a^-$ or $x \to b^+$) is no less correct than the old behaviour (derivative is preserved, true for $x \to a^+$ or $x \to b^-$).

Resolved by changing the parameters of the Dirichlet distribution so that it doesn't generate samples like [0, 1] for which the derivative is undefined.

penelopeysm · 2025-03-31T23:40:27Z

Failing tests in test/transform.jl reported JuliaDiff/ForwardDiff.jl#738 - fixed in JuliaDiff/ForwardDiff.jl#739

Copilot

Copilot reviewed 2 out of 5 changed files in this pull request and generated 1 comment.

Files not reviewed (3)

src/Bijectors.jl: Language not supported
test/interface.jl: Language not supported
test/transform.jl: Language not supported

test/Project.toml

sunxd3 · 2025-04-08T11:08:10Z

On buildkite: short while ago, I asked the JuliaGPU folks to add this (GPU tests need to be run on a server with GPU, which a standard github CI doesn't). We don't currently have GPU testing set up. And I think this is why the buildkite is failing. (It doesn't look great to have a failed test though, I'll fix it soon.)

sunxd3

Looks great!

test/interface.jl

test/transform.jl

yebai · 2025-04-10T20:15:51Z

test/interface.jl

@@ -145,12 +145,10 @@ end
 @testset "Multivariate" begin
    vector_dists = [
        Dirichlet(2, 3),
-        Dirichlet([1000 * one(Float64), eps(Float64)]),
-        Dirichlet([eps(Float64), 1000 * one(Float64)]),
+        Dirichlet([10.0, 0.1]),


I am not sure these new tests could replace the current ones. Tests like

Dirichlet([1000 * one(Float64), eps(Float64)]), Dirichlet([eps(Float64), 1000 * one(Float64)]),

are aimed at the numerical stability of very extrate examples of Dirichlet distributions, i.e. one axis has a very tiny probability mass in average.

It's actually the sample that's the problem. For the sample x = [1.0, 0.0], the transformed variable is y = [36.0436] which is outside of the range for which Float64 is numerically stable.

The issue comes from these lines:

Bijectors.jl/src/bijectors/simplex.jl

Lines 89 to 90 in d8d781b

@inbounds z = LogExpFunctions.logistic(y[1] - log(T(K - 1)))

@inbounds x[1] = _clamp((z - ϵ) / (one(T) - 2ϵ), 0, 1)

As y[1] tends to +Inf, z tends to 1, and the expression (z - ϵ) / (one(T) - 2ϵ) tends towards 1.0000000000000002. If that expression is greater than 1, then it gets _clamped to 1, and the derivative is set to 0.

The difference between FD 0.10 and FD 1.0 is that the new version sets the derivative to 0 if (z - ϵ) / (one(T) - 2ϵ) is greater than, or equal to, 1. And that in turn means that there is a larger range of y[1] for which the derivative gets clamped. Unfortunately, Float64 36.0436 falls into that category (35.8 would have been fine, or alternatively, BigFloat is ok up until around 175).

As far as I can tell the fact that it used to work with FD 0.10 might have been a happy accident – I wrote more about this in a comment above, but (to me) it makes sense for FD to set the derivative to 0 at the point (z - ϵ) / (one(T) - 2ϵ) == 1.

I am still not fully sure how to resolve this though, which is why I haven't really come back to this PR. Obviously changing the sample fixes the tests (and the easiest way to change the sample was to change the distribution from which it was drawn), but I can't tell if there's a workaround in the code that makes it work again for (z - ϵ) / (one(T) - 2ϵ) == 1.0, or more generally for large y.

Pinging @devmotion for your thoughts too :)

Hmm, I'm not sure. I always had the feeling that this stick-breaking transform (explained in eg the Stan docs) can be numerically problematic. I also always thought that these eps workarounds are unsatisfying. But I'm not sure what exactly would be broken when they would be removed, maybe would be interesting to see.

But I'm not sure what exactly would be broken when they would be removed, maybe would be interesting to see.

Finally came back to this; I've been holding off on merging because I haven't had time to properly look into this.

Removing eps actually makes the original tests pass!... But for all the wrong reasons:

Bijectors.jl/src/bijectors/simplex.jl

Lines 28 to 44 in fbaf783

function _simplex_bijector!(y, x::AbstractVector, ::SimplexBijector)

K = length(x)

@assert K > 1 "x needs to be of length greater than 1"

T = eltype(x)

ϵ = _eps(T)

sum_tmp = zero(T)

@inbounds z = x[1] * (one(T) - 2ϵ) + ϵ # z ∈ [ϵ, 1-ϵ]

@inbounds y[1] = LogExpFunctions.logit(z) + log(T(K - 1))

@inbounds @simd for k in 2:(K - 1)

sum_tmp += x[k - 1]

# z ∈ [ϵ, 1-ϵ]

# x[k] = 0 && sum_tmp = 1 -> z ≈ 1

z = (x[k] + ϵ) * (one(T) - 2ϵ) / ((one(T) + ϵ) - sum_tmp)

y[k] = LogExpFunctions.logit(z) + log(T(K - k))

end

return y

end

For the sample x = [1.0, 0.0], z = 1.0 and LogExpFunctions.logit(z) = Inf, which means that the bijector itself becomes numerically unstable.

Right now it's just the ForwardDiff jacobian used to verify the manual logabsdetjac that returns ±Inf; our implementation of link, invlink, logpdf_with_trans, etc. are still stable against extreme values, so this test failure shouldn't have negative effects on upstream packages, and I think it makes sense to change the test to use a distribution for which we can verify it using ForwardDiff.

I tried 2*eps and unfortunately that made ForwardDiff even less happy. So I think I'm inclined to merge (and @yebai really wants this PR in).

yebai · 2025-04-23T13:46:00Z

@penelopeysm, please feel free to merge.

penelopeysm force-pushed the py/forwarddiff-1 branch from 4885c9d to ecd9968 Compare March 30, 2025 22:00

penelopeysm marked this pull request as draft March 31, 2025 23:41

penelopeysm mentioned this pull request Apr 1, 2025

ForwardDiff -> 1 TuringLang/DynamicPPL.jl#871

Merged

penelopeysm force-pushed the py/forwarddiff-1 branch from 48dfaf5 to 2a5e310 Compare April 3, 2025 18:52

penelopeysm marked this pull request as ready for review April 3, 2025 19:31

penelopeysm force-pushed the py/forwarddiff-1 branch 2 times, most recently from f257ca9 to 2583065 Compare April 3, 2025 21:03

penelopeysm added 7 commits April 8, 2025 00:24

add ForwardDiff@1

d114283

Force ForwardDiff=1 in tests

8cd3f7b

Don't test Dirichlet AD at non-differentiable points

bdb1eb4

Don't run duplicate tests

af33293

Use ForwardDiff 1.0.1, fix LKJCholesky Jacobian test

55938c6

Remove dead code

9d305d8

Don't force FD=1 in test suite

6cf4d24

penelopeysm force-pushed the py/forwarddiff-1 branch from c63a5d8 to 6cf4d24 Compare April 7, 2025 23:24

yebai requested review from torfjelde and sunxd3 April 8, 2025 05:39

sunxd3 requested a review from Copilot April 8, 2025 10:53

Copilot AI reviewed Apr 8, 2025

View reviewed changes

test/Project.toml Show resolved Hide resolved

sunxd3 mentioned this pull request Apr 8, 2025

Add elementary GPU tests #379

Open

sunxd3 approved these changes Apr 8, 2025

View reviewed changes

test/interface.jl Show resolved Hide resolved

test/transform.jl Outdated Show resolved Hide resolved

yebai reviewed Apr 10, 2025

View reviewed changes

yebai assigned penelopeysm Apr 10, 2025

penelopeysm added 2 commits April 14, 2025 16:53

Merge branch 'main' into py/forwarddiff-1

67fa51e

Add reference to ForwardDiff issue

f209df8

yebai approved these changes Apr 23, 2025

View reviewed changes

Merge branch 'main' into py/forwarddiff-1

6d0dbe1

penelopeysm merged commit 6d09505 into main May 29, 2025
29 of 33 checks passed

penelopeysm deleted the py/forwarddiff-1 branch May 29, 2025 10:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add ForwardDiff@1 #378

add ForwardDiff@1 #378

Uh oh!

penelopeysm commented Mar 28, 2025 •

edited by yebai

Loading

Uh oh!

penelopeysm commented Mar 30, 2025 •

edited

Loading

Uh oh!

penelopeysm commented Mar 30, 2025 •

edited

Loading

Uh oh!

penelopeysm commented Mar 31, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

sunxd3 commented Apr 8, 2025 •

edited

Loading

Uh oh!

sunxd3 left a comment

Uh oh!

Uh oh!

Uh oh!

yebai Apr 10, 2025

Uh oh!

penelopeysm Apr 10, 2025 •

edited

Loading

Uh oh!

penelopeysm Apr 10, 2025 •

edited

Loading

Uh oh!

penelopeysm Apr 10, 2025

Uh oh!

devmotion Apr 14, 2025

Uh oh!

penelopeysm May 28, 2025 •

edited

Loading

Uh oh!

yebai commented Apr 23, 2025

Uh oh!

Uh oh!

Uh oh!

	@inbounds z = LogExpFunctions.logistic(y[1] - log(T(K - 1)))
	@inbounds x[1] = _clamp((z - ϵ) / (one(T) - 2ϵ), 0, 1)

	function _simplex_bijector!(y, x::AbstractVector, ::SimplexBijector)
	K = length(x)
	@assert K > 1 "x needs to be of length greater than 1"
	T = eltype(x)
	ϵ = _eps(T)
	sum_tmp = zero(T)
	@inbounds z = x[1] * (one(T) - 2ϵ) + ϵ # z ∈ [ϵ, 1-ϵ]
	@inbounds y[1] = LogExpFunctions.logit(z) + log(T(K - 1))
	@inbounds @simd for k in 2:(K - 1)
	sum_tmp += x[k - 1]
	# z ∈ [ϵ, 1-ϵ]
	# x[k] = 0 && sum_tmp = 1 -> z ≈ 1
	z = (x[k] + ϵ) * (one(T) - 2ϵ) / ((one(T) + ϵ) - sum_tmp)
	y[k] = LogExpFunctions.logit(z) + log(T(K - k))
	end
	return y
	end

add ForwardDiff@1 #378

add ForwardDiff@1 #378

Uh oh!

Conversation

penelopeysm commented Mar 28, 2025 • edited by yebai Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

penelopeysm commented Mar 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

penelopeysm commented Mar 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

penelopeysm commented Mar 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sunxd3 commented Apr 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sunxd3 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

yebai Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

penelopeysm Apr 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

penelopeysm Apr 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

penelopeysm Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

devmotion Apr 14, 2025

Choose a reason for hiding this comment

Uh oh!

penelopeysm May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yebai commented Apr 23, 2025

Uh oh!

Uh oh!

Uh oh!

penelopeysm commented Mar 28, 2025 •

edited by yebai

Loading

penelopeysm commented Mar 30, 2025 •

edited

Loading

penelopeysm commented Mar 30, 2025 •

edited

Loading

penelopeysm commented Mar 31, 2025 •

edited

Loading

sunxd3 commented Apr 8, 2025 •

edited

Loading

penelopeysm Apr 10, 2025 •

edited

Loading

penelopeysm Apr 10, 2025 •

edited

Loading

penelopeysm May 28, 2025 •

edited

Loading