Scaling performed within IntegrateLayers and RPCAIntegration and issue with regressing out variables #9853

JTumulty · 2025-04-25T18:28:24Z

JTumulty
Apr 25, 2025

I have noticed that when performing integration with IntegrateLayers using the RPCAIntegration method, that many of the features I attempted to regress out in my PCA (e.g. nCount RNA, and percent mito), are now strongly enriched in certain regions of the umap after integration with RPCA. This totally defeats the purpose of regressing them out in my PCA!

I suspect that this has to do with the step in the RPCAIntegration method where each layer is rescaled before FindIntegrationAnchors is run. I believe that, no matter what, this rescaling step should be more clearly specified somewhere. Even better, the RPCAIntegration function should either fetch the variable used in regression from the original ScaleData run, or you should be able to specify them so that these same variables are used in any scaling performed within the integration functions.

Seeing as I have seen a lot of people having trouble with other integration methods and cell cycle regression I think this is an important topic to discuss. Let me know if anyone else has observed this effect or has thoughts!

JTumulty · 2025-04-25T18:49:04Z

JTumulty
Apr 25, 2025
Author

After running FindIntegrationAnchors independently, I don't think this issue is having as large of an effect as I previously thought. However, I still wonder if it is worth it to use the scaled.data that the user actually provides as opposed to one recalculated within the integration function.

I also think it is worth discussing this topic in general. I am not sure there is a clear answer for dealing with feature regression and integration, especially for a method like RPCA where you do not get back corrected data on which you could run subsequent scaling.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scaling performed within IntegrateLayers and RPCAIntegration and issue with regressing out variables #9853

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Scaling performed within IntegrateLayers and RPCAIntegration and issue with regressing out variables #9853

Uh oh!

JTumulty Apr 25, 2025

Replies: 1 comment

Uh oh!

JTumulty Apr 25, 2025 Author

JTumulty
Apr 25, 2025

JTumulty
Apr 25, 2025
Author