`autodiff()` #1580

vincentarelbundock · 2025-08-31T13:20:34Z

I’ve been thinking about autodiff a lot recently and have become convinced that a tight integration with marginaleffects is highly desirable. The gains are substantial and the implementation straightforward.

So here’s my proposal:

Create a marginaleffectsAD package in Python to host all the JAX functions.
Create a helper function in marginaleffects for R that looks at a marginaleffects call and dispatches to an appropriate function from the Python package, and falls back to finite difference whenever necessary.

There are three main benefits to this approach, the first being most important:

The same JAX functions can be re-used in both the R and Python versions of marginaleffects.
We need to host much less boilerplate code in R.
This allows tight integration with an autodiff() function, directly in the R package, without having to rely on hacks like global options.

The Python interface would look something like this:

import marginaleffectsAD as mad
mad.logit.predictions.jacobian_byG(beta, X, groups, num_groups)

In addition to your existing functions, I added support for Probit and Poisson, as well as an initial implementation of comparisons() and avg_comparisons() to compute ATE / G-computation.

I sent you an invitation as contributor to the marginaleffectsAD python package repo, in case you would like to move core development efforts there.

In this PR, I implement a prototype of this idea on the R side. The code below shows that the workflow could be very nice for users.

library(marginaleffects)

# install python dependencies
autodiff(install = TRUE)

# activate autodiff
autodiff(TRUE)

mod <- glm(carb ~ ., data = mtcars, family = poisson)

avg_comparisons(mod)

JAX is fast!


 Term Contrast Estimate Std. Error      z Pr(>|z|)   S   2.5 %  97.5 %
 am      1 - 0 -0.51569    1.43986 -0.358   0.7202 0.5 -3.3378 2.30639
 cyl     +1     0.23779    0.72932  0.326   0.7444 0.4 -1.1916 1.66723
 disp    +1    -0.01349    0.00806 -1.674   0.0942 3.4 -0.0293 0.00231
 drat    +1     0.78415    1.40614  0.558   0.5771 0.8 -1.9718 3.54014
 gear    +1     0.84879    1.14651  0.740   0.4591 1.1 -1.3983 3.09590
 hp      +1     0.00467    0.01143  0.408   0.6830 0.5 -0.0177 0.02706
 mpg     +1    -0.05458    0.15711 -0.347   0.7283 0.5 -0.3625 0.25334
 qsec    +1    -0.32770    0.45950 -0.713   0.4757 1.1 -1.2283 0.57290
 vs      1 - 0 -0.60648    1.33246 -0.455   0.6490 0.6 -3.2180 2.00509
 wt      +1     2.20869    1.81349  1.218   0.2233 2.2 -1.3457 5.76306

Type: response

# accuracy
auto <- function() {
    autodiff(TRUE)
    avg_comparisons(mod) |> suppressMessages()
}

finite <- function() {
    autodiff(FALSE)
    avg_comparisons(mod)
}

# results are within numerical tolerance
a <- auto()
f <- finite()
all.equal(a$estimate, f$estimate)

[1] TRUE

all.equal(a$std.error, f$std.error)

[1] "Mean relative difference: 6.694612e-07"

# benchmark
library(microbenchmark)
microbenchmark(
    auto(),
    finite()
)

Warning in microbenchmark(auto(), finite()): less accurate nanosecond times to
avoid potential integer overflows

Unit: milliseconds
     expr      min       lq     mean   median       uq      max neval cld
   auto() 22.23348 22.85422 24.75146 23.29194 24.91074 85.21522   100  a 
 finite() 58.03616 60.51901 62.56966 61.90844 63.32104 77.45339   100   b

vincentarelbundock · 2025-08-31T19:06:46Z

I would be really interested in people's experience with this.

Is the user interface easy and intuitive?
Did you have problems setting it up?
Did you run into unexpected errors?
Are the warnings/fallbacks clear and informative?

The feature set is currently limited to lm and glm (logit, probit, and poisson), and we do not yet support arguments like hypothesis and wts. But the speed is pretty amazing. In models with many parameters, I get 5-15x speedup.

Tagging people who seem to like bleeding edge stuff: @strengejacke @mattansb @andrewheiss @saudiwin

# Install the dev version of `marginaleffects`
remotes::install_github("vincentarelbundock/marginaleffects@autodiff")

library(marginaleffects)
library(microbenchmark)

# Install Python autodiff dependencies
autodiff(install = TRUE)

# Activate autodiff
autodiff(TRUE)

# Download data and fit a large model
dat <- get_dataset("airbnb")
mod <- glm(TV ~ ., data = dat, family = binomial)

# Average Predictions
finite <- function() {
    autodiff(FALSE)
    predictions(mod, type = "response")
}

auto <- function() {
    autodiff(TRUE)
    predictions(mod, type = "response")
}

microbenchmark(finite(), auto(), times = 5)

# Average Treatment Effect
finite <- function() {
    autodiff(FALSE)
    avg_comparisons(mod, variables = "Heating")
}

auto <- function() {
    autodiff(TRUE)
    avg_comparisons(mod, variables = "Heating")
}

microbenchmark(finite(), auto(), times = 5)

strengejacke · 2025-08-31T19:14:14Z

What's autodiff? Does it require Python?
(if the answer to the 2nd question is "yes", I'm out ;-))

vincentarelbundock · 2025-08-31T19:17:51Z

Autodiff gives you faster and more accurate derivatives, which is what we need for standard errors.

It requires an installation of python on your machine, but you shouldn't have to interact with Python at all.

autodiff(install=TRUE) should do everything for you from R, using the reticulate package.

strengejacke · 2025-08-31T19:31:44Z

Ah ok. But I don't have python installed... 😬

t-kalinowski · 2025-09-01T00:28:45Z

You don't need to install Python - reticulate bootstraps everything it needs on its own.

vincentarelbundock · 2025-09-01T02:02:46Z

@t-kalinowski I think I said it to you in person, but that thing is magic.

saudiwin · 2025-09-01T12:42:04Z

So, obviously don't have much add in terms of autodiff expertise, but the move makes a lot of sense to me. Upgrading the differentiation engine to something state-of-the-art that is robust to many parameters will make it useful for many more kinds of applications in science & industry. Maybe even deep learning? Not sure they want marginaleffects but still just a thought.

The obvious lacuna is, of course, support for ordered beta regression 😁

It's a standard GLM so it shouldn't be too hard, right?

saudiwin · 2025-09-01T12:43:45Z

And re: reticulate, yes, that is really cool, obviously at the same time Python installation is tech debt. But as long as it's an optional part of the package I don't think it's a problem.

There will always be someone running some weird hacked version of Linux in a smart washing machine who won't be able to install it, and CRAN will punish you if you make it a default & they can't install it, etc.

mattansb · 2025-09-02T07:01:42Z

This is very cool!

I haven't tried it out yet, but it is currently limited only to lm and glm classes? Or more generally these types of models (e.g., rms::ols())?

(@t-kalinowski I also have to chime in on the praise - reticulate is awesome. I keep telling people that because of it, R is even better than python in python! 😉)

vincentarelbundock · 2025-09-02T10:55:22Z

@mattansb

I haven’t tried it out yet, but it is currently limited only to lm and
glm classes? Or more generally these types of models (e.g.,
rms::ols())?

All we need to do is write a predict() function in JAX to make predictions based on the model.matrix and vector of coefficients. rms::ols() and rms::lrm() are thus easy to support (except for things like penalty).

Here we get a 13x speedup.

library(microbenchmark)
library(marginaleffects)
library(rms)
dat <- get_dataset("airbnb")
mod <- lrm(TV ~ ., data = dat)

finite <- function() {
  autodiff(FALSE)
  predictions(mod, type = "fitted")
}

auto <- function() {
  autodiff(TRUE)
  predictions(mod, type = "fitted")
}

p1 <- auto()
p2 <- finite()

all.equal(p1$estimate, p2$estimate)

    [1] TRUE

all.equal(p1$std.error, p2$std.error, tol = 1e-6)

    [1] TRUE

microbenchmark(finite(), auto(), times = 5)

    Unit: milliseconds
         expr       min        lq      mean    median        uq       max neval cld
     finite() 2590.1808 2596.2159 2604.5910 2598.2415 2602.0976 2636.2194     5  a 
       auto()  186.3906  241.0184  253.8272  253.9089  270.5217  317.2962     5   b

mattansb · 2025-09-02T10:59:58Z

Wild!

strengejacke · 2025-09-02T12:23:40Z

Just to summarize:

I install the reticulate package
I install marginaleffects from this PR
I run autodiff(TRUE)

And then, whenever possible, marginaleffects uses JAX for the Jacobian internally, which is much faster (and sometime also more accurate)?

strengejacke · 2025-09-02T12:26:45Z

Related to this comment: https://bsky.app/profile/bbolker.bsky.social/post/3lxpwy3nsb222

Since glmmTMB is based on TMB, is it still worth to have a dedicated support for glmmTMB and autodiff in marginaleffects, or don't you expect larger benefits in terms of speed?

vincentarelbundock · 2025-09-02T12:28:41Z

Since glmmTMB is based on TMB, is it still worth to have a dedicated support for glmmTMB and autodiff in marginaleffects, or don't you expect larger benefits in terms of speed?

I don't expect to be able to support glmmTMB at all. (But maybe I'm wrong!)

vincentarelbundock · 2025-09-02T12:36:52Z

Also need to call autodiff(install = TRUE) and autodiff(TRUE), which is a new exported marginaleffects function.

teecrow · 2025-09-16T16:29:24Z

As a suggestion for the documentation/help file for autodiff() - it could be useful to clarify whether autodiff(autodiff=TRUE) needs to be run once ever or once per session. (It's clear that running it a second time after restarting the session does not trigger the same python installs as the first time I ran it, but it's not clear to me whether autodiff will now be used by default for supported models in every new session or not.)

vincentarelbundock · 2025-09-16T22:44:17Z

Thanks.

TBH, I think the current documentation is explicit enough, becauce it says there TRUE enables and FALSE disables.

If you want to out a PR with an improved wording, I'll be happy to review.

teecrow · 2025-09-16T23:29:22Z

What I mean is: once enabled, is it enabled forever? Or does it need to be enabled for every new R session? (Forgive me if this is obvious to those more R-savvy; I consider myself only intermediate in my understanding of these things.)

If the former, the wording could be:

autodiff only needs to be enabled once: it will persist across sessions.

If the latter, the wording could be:

autodiff needs to be enabled once at the start of each new R session.

vincentarelbundock · 2025-09-16T23:32:34Z

ah, I see the potential for confusion now. It should not persist across sessions, so the second wording would be correct.

andrewheiss · 2025-09-16T23:35:19Z

(like how tinytex::install_latex() only ever has to run once on a computer vs. autodiff(TRUE) has to run once per session)

vincentarelbundock · 2025-09-16T23:38:44Z

Improved the docs here. Thanks for the suggestion!

67f1edc

vincentarelbundock added 10 commits August 31, 2025 08:01

cruft settings_init() + jacobian setting via settings_set()

6dc5fdf

autodiff.R

50b73aa

experimental tag

bdbcc31

autodiff group_by

96d3b30

bugfix in autodiff get_by() + tests

ea836e1

alignment with avg_ and by= doesn't work

fa0e0c9

cmp by order

1658a1f

test comments

cc55c91

autodiff skip tests on CI

ad89f07

autodiff: allow parentheses in column names

c8fb223

vincentarelbundock added 12 commits September 1, 2025 09:34

autodiff supports many glm families and links

7ea3c04

cruft

f9d4fb7

autodiff(NULL) by default doesn't change anything

8d88d9e

expensive = FALSE

984a41e

misc

f2c29b1

Merge branch 'main' into autodiff

c06d84c

offset not supported warning

c4afe62

autodiff cleanup

bf86776

no expensive on CI

6a4f2e9

comparison_type

79dbc8c

autodiff_dev

aa0b013

clean autodiff.R

b66bdc9

autodiff supports lrm and ols

8f120f7

vincentarelbundock added 2 commits September 2, 2025 07:18

jax algebra trick for SEs

4551ce4

autodiff: delay_load = FALSE

635568e

vincentarelbundock and others added 12 commits September 2, 2025 15:08

autodiff check if reticulate is install

e448ef1

autodiff: methods_stats.R error in lm method

1e6e6d6

uv.lock

f0f2955

test tolerance

4c10bf4

docs

38e60da

autodiff supports ivreg

e976e34

use py_require() (#1588)

722b12f

bump

6653059

Merge branch 'main' into autodiff

8c880c8

brms + test fixups

144ad1a

test tolerance

47b5b79

assign mAD to secret environment

ee2ea3a

vincentarelbundock merged commit ee2ea3a into main Sep 4, 2025
10 checks passed

vincentarelbundock deleted the autodiff branch September 6, 2025 13:03

autodiff() #1580

autodiff() #1580

Uh oh!

Conversation

vincentarelbundock commented Aug 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vincentarelbundock commented Aug 31, 2025

Uh oh!

strengejacke commented Aug 31, 2025

Uh oh!

vincentarelbundock commented Aug 31, 2025

Uh oh!

strengejacke commented Aug 31, 2025

Uh oh!

t-kalinowski commented Sep 1, 2025

Uh oh!

vincentarelbundock commented Sep 1, 2025

Uh oh!

saudiwin commented Sep 1, 2025

Uh oh!

saudiwin commented Sep 1, 2025

Uh oh!

mattansb commented Sep 2, 2025

Uh oh!

vincentarelbundock commented Sep 2, 2025

Uh oh!

mattansb commented Sep 2, 2025

Uh oh!

strengejacke commented Sep 2, 2025

Uh oh!

strengejacke commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vincentarelbundock commented Sep 2, 2025

Uh oh!

vincentarelbundock commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

teecrow commented Sep 16, 2025

Uh oh!

vincentarelbundock commented Sep 16, 2025

Uh oh!

teecrow commented Sep 16, 2025

Uh oh!

vincentarelbundock commented Sep 16, 2025

Uh oh!

andrewheiss commented Sep 16, 2025

Uh oh!

vincentarelbundock commented Sep 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

`autodiff()` #1580

`autodiff()` #1580

vincentarelbundock commented Aug 31, 2025 •

edited

Loading

strengejacke commented Sep 2, 2025 •

edited

Loading

vincentarelbundock commented Sep 2, 2025 •

edited

Loading