Dear broom maintainer,
the problem
I was runnning lm on a dataset with NA values, and found augment doesn't work with na.action = na.exclude
code
df <- data.frame(
id = 1:10,
x = rnorm(10),
y = rnorm(10)
)
df$x[5] = NA
broom::augment(lm(y~x, data = df, na.action = na.exclude))
output
> Error in `$<-`:
! Assigned data `predict(x, na.action = na.pass, ...) %>% unname()` must be compatible with existing data.
✖ Existing data has 9 rows.
✖ Assigned data has 10 rows.
ℹ Only vectors of size 1 are recycled.
Caused by error in `vectbl_recycle_rhs_rows()`:
! Can't recycle input of size 10 to size 9.
Run `rlang::last_trace()` to see where the error occurred.
remove the na.action = na.exclude option will work. Actually, the following z1 and z2
z1 = lm(y~x, data = df, na.action = na.exclude)
z2 = lm(y~x, data = df) # the default na.action is na.omit
have the same model, coefficients, residuals components, making me really wonder how the na.exclude and na.omit will influence augment's behavior
I'm not entirely sure if the issue we saw above originating from the augment function or the lm function. I would greatly appreciate any insights or guidance you could offer on this matter. Thank you in advance for your assistance.
Thanks!
sessioninfo
> sessionInfo()
R version 4.2.3 (2023-03-15)
Platform: x86_64-pc-linux-gnu (64-bit)
Running under: Ubuntu 22.04.3 LTS
Matrix products: default
BLAS: /usr/lib/x86_64-linux-gnu/blas/libblas.so.3.10.0
LAPACK: /usr/lib/x86_64-linux-gnu/lapack/liblapack.so.3.10.0
locale:
[1] LC_CTYPE=C.UTF-8 LC_NUMERIC=C LC_TIME=C.UTF-8 LC_COLLATE=C.UTF-8 LC_MONETARY=C.UTF-8 LC_MESSAGES=C.UTF-8
[7] LC_PAPER=C.UTF-8 LC_NAME=C LC_ADDRESS=C LC_TELEPHONE=C LC_MEASUREMENT=C.UTF-8 LC_IDENTIFICATION=C
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] purrr_1.0.2 broom_1.0.5 tidyr_1.3.0 dplyr_1.1.3 furrr_0.3.1 future_1.33.0
loaded via a namespace (and not attached):
[1] parallelly_1.36.0 rstudioapi_0.15.0 knitr_1.44 magrittr_2.0.3 tidyselect_1.2.0 R6_2.5.1 rlang_1.1.1 fansi_1.0.5
[9] globals_0.16.2 tools_4.2.3 parallel_4.2.3 xfun_0.40 utf8_1.2.4 cli_3.6.1 digest_0.6.33 tibble_3.2.1
[17] lifecycle_1.0.3 vctrs_0.6.4 codetools_0.2-19 glue_1.6.2 compiler_4.2.3 pillar_1.9.0 generics_0.1.3 backports_1.4.1
[25] listenv_0.9.0 pkgconfig_2.0.3
Dear
broommaintainer,the problem
I was runnning
lmon a dataset with NA values, and foundaugmentdoesn't work withna.action = na.excludecode
output
remove the
na.action = na.excludeoption will work. Actually, the followingz1andz2have the same
model,coefficients,residualscomponents, making me really wonder how thena.excludeandna.omitwill influenceaugment's behaviorI'm not entirely sure if the issue we saw above originating from the
augmentfunction or thelmfunction. I would greatly appreciate any insights or guidance you could offer on this matter. Thank you in advance for your assistance.Thanks!
sessioninfo