Replies: 4 comments 9 replies
-
|
Can you check on the latest master? I fixed a bug last week... |
Beta Was this translation helpful? Give feedback.
-
|
Can you post a minimal code example with synthetic data? |
Beta Was this translation helpful? Give feedback.
-
|
There might still be a bug, but a remark: if all your variables are categorical, then rather use the Gsquared or CMIsymb CI tests. |
Beta Was this translation helpful? Give feedback.
-
|
The error was that your categorical variable contained only one class and sklearn cannot deal with this. I modified the deviance score to return 0.0 in this case. Check again. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello everyone,
I'm currently using PCMCI+ as part of my bachelor's thesis on process optimization in injection molding using causal machine learning. I encountered a limitation when applying the Regression CI method with discrete variables.
Some of my variables have exactly two categories, and others have three. The issue arises when I set three discrete variables (regardless of their specific values): the method throws an error and fails to run (ValueError: Unknown label type: continuous. Maybe you are trying to fit a classifier, which expects discrete classes on a regression target with continuous values). However, if I only specify two variables as discrete—regardless of whether they have two or three categories—Regression CI runs without problems and yields reasonable results.
Is there a known limitation in Regression CI regarding the number of discrete variables that can be used simultaneously? Could this be related to how the data are formatted or encoded?
I’d appreciate any clarification or suggestions on how to handle this case.
Thanks in advance!
Beta Was this translation helpful? Give feedback.
All reactions