adds vif metric from stats models by kikiluvbrains · Pull Request #676 · mne-tools/mne-nirs

kikiluvbrains · 2026-03-25T16:48:57Z

Reference issue

Multi-collinearity #413.

What does this implement/fix?

Add metrics to quantify collinearity in the design matrix. One simple way to deal with high multi-collinearity (typically VIF > 4–5) is to combine very similar regressors or drop the ones that are problematic.

There are other ways to handle high VIF, such as using principal component analysis (PCA), but I haven’t explored those yet, I am open to suggestions.

for more information, see https://pre-commit.ci

larsoner

Looks like a good start!

larsoner · 2026-04-03T21:52:27Z

 import mne
 import numpy as np
+from mne.utils import logger
+from statsmodels.stats.outliers_influence import variance_inflation_factor


statsmodels is not currently a dependency, so this functionality should probably be optional (or we can start to require statsmodels... but for a single check in a single function it seems like overkill.) So can you

Nest this import in a try/except, then only proceed with the VIF analysis if it's present (and logger.info that the check was skipped if it wasn't done)

Add to some group statsmodels so that some tests / CIs use it

Add some test that when statsmodels is installed (e.g., using pytest.importorskip("statsmodels") at the top of the test) the VIF is reported for some make_first_level_design_matrix calls? Bonus points if you add one that logger.warns because it's bad, or find one that's already bad and assert it.

We could add a new group full to the optional-dependencies here that has statsmodels in it for example

mne-nirs/pyproject.toml

Line 53 in 5acfff0

docs = ["sphinx", "sphinx-gallery", "pydata_sphinx_theme", "numpydoc", "matplotlib"]

Then make sure at least some of the CIs use it.

And CircleCI should definitely use it, so we can see the output in some examples...

Hey, thank you so much for your feedback!

I think we should opt out of statsmodels, I wrote an equivalent test for variance_inflation_factor, and also verified with some test data which gave me equivalent results to statsmodels functionality.

let me know what you think.

Yeah if it's simple enough feel free to do it that way

You could even write a little helper test that uses statsmodels (only in the test!) to verify equivalence of the helper function if you want (and then have CIs install it just for testing)

sounds good, I will add in a little helper test

…nirs into glm-multicollinearity

for more information, see https://pre-commit.ci

larsoner · 2026-04-15T17:02:10Z

Let me know when I should look!

…nirs into glm-multicollinearity

for more information, see https://pre-commit.ci

kikiluvbrains · 2026-04-17T17:36:57Z

@larsoner I am ready for you to take a look, currently am trying to pass all checks

for more information, see https://pre-commit.ci

kikiluvbrains · 2026-04-18T13:45:08Z

@larsoner

I do have an optional flag for "vif_export" as a way to test against stasmodel, but it causes this error in some of the html examples as shown in circleCI, let me know what you think, and best way I can patch this?

        ../examples/general/plot_11b_kf2_fingertapping.py failed leaving traceback:
    
        Traceback (most recent call last):
          File "/home/circleci/project/examples/general/plot_11b_kf2_fingertapping.py", line 416, in <module>
            design_matrix = make_first_level_design_matrix(
                            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
          File "/home/circleci/project/mne_nirs/experimental_design/_experimental_design.py", line 106, in make_first_level_design_matrix
            dm = make_first_level_design_matrix(
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
        TypeError: make_first_level_design_matrix() got an unexpected keyword argument 'vif_export'

larsoner · 2026-04-21T18:52:08Z

 import numpy as np

+# for the comparison of vif we need these two libraries
+from statsmodels.stats.outliers_influence import variance_inflation_factor


This should be imported inside the test in case people don't have it installed, and in the test you can hav.

pytest.importorskip("statsmodels")

larsoner · 2026-04-21T18:52:43Z

+    # wheras statsmodel has their own implmentation before extracting the vif values
+    # note vif will come with a level of uncertainity +/- 0.05 of what is reported
+    for key in vif:
+<<<<<<< HEAD


Looks like some problem with rebasing

larsoner · 2026-04-21T18:53:01Z

+    vif export : bool, optional
+        deafult set to false, if set to True will export vif values; 


Hmm... this probably shouldn't be part of the public API, so let's remove it

If we really had to have a way to get the values for the test we would want to create a private _make_first_level_design_matrix with this option available and the public make_first_level_design_matrix would always call it with return_vif=False, and the test could use the private one with return_vif=True

larsoner · 2026-04-21T18:55:47Z

+    for name, vif_idx in zip(predictor_names, vif_all):
+        msg = f"{name} with VIF of {vif_idx:.3f}"
+        if vif_idx > 4:
+            logger.warning("High collinearity " + msg)
+        else:
+            logger.info(msg)


You are already logging the VIFs so in the test you can do something like the following to recover the values:

from mne.utils import use_log_level, catch_logging ... def some_test(): ... with use_log_level("info"), catch_logging() as log: ... make_first_level_design_matrix(...) log = log.getvalue() vifs = np.array([line.split()[-1] for line in log.splitlines() if " VIF " in line], float)

kikiluvbrains and others added 2 commits March 25, 2026 12:37

add vif metric from stats models

7e463dd

[pre-commit.ci] auto fixes from pre-commit.com hooks

426f2b1

for more information, see https://pre-commit.ci

larsoner reviewed Apr 3, 2026

View reviewed changes

kikiluvbrains and others added 3 commits April 11, 2026 17:15

remove need of statsmodel

dfc290a

Merge branch 'glm-multicollinearity' of github.com:kikiluvbrains/mne-…

ca18ea5

…nirs into glm-multicollinearity

[pre-commit.ci] auto fixes from pre-commit.com hooks

5838b68

for more information, see https://pre-commit.ci

kikiluvbrains and others added 4 commits April 17, 2026 13:17

add test to comapre mne vif implementation to statsmodels vif

a4b4cdf

Merge branch 'glm-multicollinearity' of github.com:kikiluvbrains/mne-…

28b0fe5

…nirs into glm-multicollinearity

[pre-commit.ci] auto fixes from pre-commit.com hooks

bdee657

for more information, see https://pre-commit.ci

Merge branch 'main' into glm-multicollinearity

f7bd800

kikiluvbrains and others added 4 commits April 17, 2026 15:20

allow for optional export of vif to enable comparison with statsmodels

40552b1

allow for optional export of vif to enable comparison with statsmodels

b4a0522

[pre-commit.ci] auto fixes from pre-commit.com hooks

a53e2b4

for more information, see https://pre-commit.ci

re-run circlecI

4f3ef0d

larsoner reviewed Apr 21, 2026

View reviewed changes

		vif export : bool, optional
		deafult set to false, if set to True will export vif values;

Conversation

kikiluvbrains commented Mar 25, 2026

Reference issue

What does this implement/fix?

Uh oh!

larsoner left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

larsoner commented Apr 15, 2026

Uh oh!

kikiluvbrains commented Apr 17, 2026

Uh oh!

kikiluvbrains commented Apr 18, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants