Holdout Sample #1021
Replies: 1 comment
-
|
Hi @rajeshpaleti999, Thank you for contacting us! I would like to inform you that Meridian is based on Bayesian Causal Inference, whose primary intent is the accurate estimation of causal marketing effects. It is not intended to be used for future predictions as a forecasting model. A high R-squared doesn't guarantee a good causal inference model, and the best predictive model might not be the best for causal inference. Hence, a model with 99% out-of-sample R-squared can still be a poor model for causal inference. We recommend a holdout sample balanced across geos and time periods, with similar observations for each. An imbalanced sample can lead to insufficient training data for geo or time effects. Meridian doesn't specify a holdout sample by default; you must define one and ensure its balance.
In short, the two methods are testing different things. The time-series split tests forecasting ability, while the random split tests the model's ability to generalize to unseen data from the same time period. Given that Meridian's primary purpose is causal inference, the random split is the more appropriate method for evaluating the model's fit. I hope this helps. Feel free to reach out if you have any questions or suggestions regarding Meridian. Thank you, Google Meridian Support Team |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
I ran the model under two scenarios of holdout samples:
The goodness-of-fit metrics, particularly, R-squared was particularly worse in the Option 1 (i.e., time series split) whereas the performance was comparable between training and holdout samples in Option 2 (i.e., random split). In both cases, I am using 80-20 split between training and holdout samples.
I remember reading that Meridian recommends random split (option 2). Is this correct? If so, how to reconcile with the practice in time-series models to not peek into the future while validating/testing models?
Beta Was this translation helpful? Give feedback.
All reactions