Replies: 1 comment
-
|
Thanks for your query and details on the background of your problem!
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I'm looking into compositionally-valid causal inference methodologies that can be applied to timeseries. Are there any methods that you have stumbled across that would be appropriate?
I've found directed acyclic graphs used for compositional data in causal inference networks: https://academic.oup.com/ije/article/49/4/1307/5802547
Also, reciprocal log ratios that strengthen convergent cross mapping:
https://www.biorxiv.org/content/10.1101/2021.01.25.428037v1.full
A method is considered “compositionally-valid” when it is specifically designed to account for the unique properties of compositional data—data where each observation is a vector of non-negative values that sum to a fixed amount, such as next generation sequencing (NGS) count tables. In these tables, the absolute counts of each feature (like genes or microbes) depend on the sequencing depth, which is arbitrary and can vary between samples. Instead of raw counts, what matters biologically are the relative proportions or abundances of these features. Compositionally-valid methods, such as those involving log-ratio transformations (centered or isometric) or Aitchison distance, address this by analyzing the data in a transformed space where standard statistical and causal inference techniques can be safely applied. This approach prevents spurious results that can arise when traditional methods are used, since changes in one feature’s abundance necessarily affect the proportions of all others due to the constant-sum constraint. By ensuring that analyses are conducted on relative rather than absolute values, compositionally-valid methods are robust to differences in sequencing depth and enable more accurate, biologically meaningful conclusions, which is especially important for time series causal inference across heterogeneous sample.
@jakobrunge What I've been doing is doing a Center Log-Ratio transform of my data before I use TIGRAMITE but ideally there could be a method that is designed specifically for this type of data if available.
Beta Was this translation helpful? Give feedback.
All reactions