Refactor _handle_events_reading to allow extracting annotation information stand-alone #1389

matthiasdold · 2025-04-03T12:47:36Z

PR Description

Aim:
The _handle_events_reading in read.py reads onsets, duration and descriptions/event_ids from the events.tsv, then filters the columns for n\a values etc. and finally sets annotations to the raw object based on these filtered version of the events.tsv data.

    annot_from_raw = raw.annotations.copy()
    annot_from_events = mne.Annotations(onset=ons, duration=durs, description=descrs)
    raw.set_annotations(annot_from_events)

On default, this will encode information from three columns within the events.tsv.

In order to extract additional information from BIDS dataset within MOABB it would be necessary to ensure that all annotations that end up in the raw and whatever is extracted from the events.tsv are aligned. To allow for this, it would be necessary to have any filtering that happens in L526-602 accessible standalone (without the need to provide a raw).

Proposition:
Simply have the first part of the _handle_events_reading as its own function which returns the filtered version of the onset, description, duration and event_ids. Assuming that these columns would form a primary key within the events.tsv, we could always map additional meta info to the annotations in the raw unambiguously.

Merge checklist

Maintainer, please confirm the following before merging.
If applicable:

All comments are resolved
This is not your own PR
All CIs are happy
PR title starts with [MRG]
whats_new.rst is updated
New contributors have been added to CITATION.cff
PR description includes phrase "closes <#issue-number>"

welcome · 2025-04-03T12:47:39Z

Hello! 👋 Thanks for opening your first pull request here!
Please read the contributor guide, and please follow the steps outlined in the "Instructions for first-time contributors" section therein. ❤️ We will try to get back to you soon. 🚴🏽‍♂️

…wargs

sappelhoff

Hi @matthiasdold to make this function public, it would need to get a more extensive docstr (numpydoc), and preferably also be used in a public example.

You may (or not) also need to add it here in some way: https://github.com/mne-tools/mne-bids/blob/main/mne_bids/__init__.py

There are also some test failures here that I haven't looked into yet.

matthiasdold · 2025-04-04T09:27:53Z

Hi @matthiasdold to make this function public, it would need to get a more extensive docstr (numpydoc), and preferably also be used in a public example.

You may (or not) also need to add it here in some way: https://github.com/mne-tools/mne-bids/blob/main/mne_bids/__init__.py

There are also some test failures here that I haven't looked into yet.

Hi @sappelhoff, thanks for checking this out.
I actually think it could be kept private / _events_file_to_annotation_kwargs function as in the first commit. The reason why I was later putting it to public was that I could not get the docs to build and making it public + introducing it to the __init__.py at least had sphynx find it.

Unfortunately, I still cannot get the docs to build, neither locally nor here on the server. Which is very puzzling to me, as the change really is just separating the existing functions... I will have another try later this afternoon, but any hints would be much appreciated.

Nevermind the above, you comment explains everything actually. I just did not know how Sphinx works properly.

It is working now, but coverage goes down at bit. Would you want me to implement a test for this explicitly?

Also, I had to add the html-noplot back to the Makefile under /doc to build locally, as suggested in the CONTRIBUTING.md.

codecov · 2025-04-04T16:26:19Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 97.49%. Comparing base (5d4320f) to head (6559e55).
Report is 3 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1389      +/-   ##
==========================================
+ Coverage   97.48%   97.49%   +0.01%     
==========================================
  Files          40       40              
  Lines        9023     9060      +37     
==========================================
+ Hits         8796     8833      +37     
  Misses        227      227

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

sappelhoff

Yes, it would be great if we could cover any differences introduced via this PR.

So you ended up leaving the function private 🤔 I think that if you want to use this in another library, it may be worth making it public. For that you could take one example function in mne-bids that is public, and see where it has to be declared, and then do the same for your function.

Otherwise, you always take the risk that is inherent in using private functions of another software package: backward incompatible changes may come at any time and without notice 🤔

If you decide that you do NOT want to make the function public and that you DO want to take the risk of using a private function, then you would at least have to adjust your changelog entry, as we usually do not talk about changes to private functions so explicitly. You could write something a bit more generic, like "refactoring of events filtering", under code health or so.

WDYT @drammock @hoechenberger ?

drammock · 2025-04-07T22:42:17Z

WDYT @drammock

I wasn't super familiar with the internals of what MNE-BIDS did in terms of reading events.tsv prior to looking at this PR. My feeling (from a quick skim and quick think) is that:

there should be a public func that reads events.tsv files.
I'm conflicted about whether it should return an actual annotations object, a dict suitable for passing to mne.Annotations, or a dataframe. I guess the Annotations and the DataFrame can be quite easily constructed from the dict, so I guess the dict is most general/flexible.
for users who want an events array, they can use mne.events_from_annotations and we should mention that in the docstring.
The changelog should only mention this new public function; users won't care / don't need to know that it was part of a private function that was refactored into a separate thing.

matthiasdold · 2025-04-08T05:18:00Z

2. I'm conflicted about whether it should return an actual annotations object, a dict suitable for passing to `mne.Annotations`, or a dataframe. I guess the Annotations and the DataFrame can be quite easily constructed from the dict, so I guess the dict is most general/flexible.

Operating on a (pandas) dataframe would also lead to a more readable code, IMHO, as we can use standard filtering instead of _drop. Although just a subset of the functionality, the dataframe is what I am using in the meantime for moabb.

I didn't go the dataframe route as I wanted to be minimally invasive. Happy to provide a dataframe version as well.

for more information, see https://pre-commit.ci

sappelhoff · 2025-04-08T07:37:19Z

Thanks for your view, Dan.

I agree with your points 1, 3 and 4 -- and regarding point 2, I think returning a dict is the way to go. We do not have pandas as a required dependency of mne-bids, and I don't think the current "feature" is enough to warrant it.

PierreGtch · 2025-04-09T08:48:23Z

@matthiasdold I think you need double backquotes for code samples. This might be why your doc build fails
https://www.sphinx-doc.org/en/master/usage/restructuredtext/basics.html

for more information, see https://pre-commit.ci

matthiasdold · 2025-04-09T09:12:16Z

@matthiasdold I think you need double backquotes for code samples. This might be why your doc build fails https://www.sphinx-doc.org/en/master/usage/restructuredtext/basics.html

Thx for the suggestion @PierreGtch - unfortunately this seems not to be the issue. As for the :func: part, all other lines only had single backquotes, so I would not expect this to be a porblem there (the error message states that mne_bids.events_file_to_annotations_kwargs cannot be found, however the import works for the pytest... )

I am a bit clueless here...

PierreGtch · 2025-04-09T09:46:50Z

@matthiasdold your issue is not anymore with building the doc but before, when the ci is trying to pull the code
It is probably unrelated to you

doc/whats_new.rst

sappelhoff · 2025-04-10T16:00:14Z

yes, something is fishy with circleci. I just opened #1391 with tiny changes to see if I get a circleci issue there as well. If I don't, then maybe the failures ARE related to the PR here, but it looks almost like they aren't.

drammock · 2025-04-10T16:17:35Z

something is fishy with circleci.

the error looks totally related to me:
https://app.circleci.com/pipelines/github/mne-tools/mne-bids/6995/workflows/6d22be6e-fce2-43b0-a5d2-a0712a9fd896/jobs/9247?invite=true#step-108-0_142

/home/circleci/project/doc/whats_new.rst:43: WARNING: py:func reference target not found: mne_bids.events_file_to_annotation_kwargs [ref.func]

same error here:
https://github.com/mne-tools/mne-bids/actions/runs/14379592851/job/40319969519?pr=1389#step:6:257

/home/runner/work/mne-bids/mne-bids/doc/whats_new.rst:43: WARNING: py:func reference target not found: mne_bids.events_file_to_annotation_kwargs [ref.func]

drammock · 2025-04-10T16:22:13Z

add the new public function name to doc/api.rst

sappelhoff · 2025-04-10T18:35:02Z

the error looks totally related to me:

I was talking about this one:

https://app.circleci.com/pipelines/github/mne-tools/mne-bids/6993/workflows/1783535a-8a54-4b94-ac2b-ace4331f6e3f/jobs/9245

sappelhoff · 2025-04-10T18:35:26Z

yet #1391 is unaffected, so it must be something about this PR.

sappelhoff · 2025-04-10T18:37:03Z

ok fine, the errors that circleci now provides all seem meaningful and related :-) no idea what I was after in #1389 (comment)

@matthiasdold this should be possible for you to address. I am +1 for merging once everything is green. thanks a lot!

drammock · 2025-04-10T19:01:27Z

mne_bids/read.py

+    Notes
+    -----
+    The function handles the following cases:
+    - If the `trial_type` column is available, it uses it for event descriptions.


add blank line before bullet list starts

mne_bids/read.py

drammock · 2025-04-10T19:06:04Z

here's the rendered docstring:
https://output.circle-artifacts.com/output/job/1e8c4f31-cedf-4fcd-a9ef-94de47d8437b/artifacts/0/dev/generated/mne_bids.events_file_to_annotation_kwargs.html

see how the notes section bullet list isn't formatted right, and the formatting in the Returns section is inconsistent.

matthiasdold · 2025-04-10T20:07:02Z

Thanks a lot @drammock for the detailed explanation! I should get it sorted out now. My first time working with circleci (as you of course must have guessed), so I was not aware that I could look at the artifacts. Thanks for your patients @drammock and @sappelhoff 😅

drammock

LGTM, thanks for the thorough test. Left a few nitpick comments that you can feel free to ignore or implement.

mne_bids/tests/test_read.py

drammock · 2025-04-10T21:38:05Z

mne_bids/tests/test_read.py

+    assert (ev_kwargs_filtered["onset"] == dext_f["onset"].astype(float).values).all()
+    assert (
+        ev_kwargs_filtered["duration"]
+        == dext_f["duration"].replace("n/a", "0.0").astype(float).values
+    ).all()
+    assert (ev_kwargs_filtered["description"] == dext_f["trial_type"].values).all()
+    assert (
+        ev_kwargs_filtered["duration"][0] == 0.0
+    )  # now idx=0, as first row is filtered out


idem. Could also consider pd.testing.assert_frame_equal but that would require converting the dict to dataframe first.

As we are using a dict for the return type, I would just stick to comparing the iterables here. Implemented all other suggestions though.

mne_bids/tests/test_read.py

for more information, see https://pre-commit.ci

matthiasdold · 2025-04-16T06:48:52Z

Once mne-tools/mne-python#13213 is implemented in mne, we could also add the additional metadata to the Annotations object. This could be part of this PR or a separate one. Let me know what you prefer @drammock and @sappelhoff

sappelhoff

Thanks @matthiasdold!

welcome · 2025-04-16T08:00:22Z

🎉 Congrats on merging your first pull request! 🥳 Looking forward to seeing more from you in the future! 💪

sappelhoff · 2025-04-16T08:00:34Z

This could be part of this PR or a separate one.

a separate one, please! :)

matthiasdold added 2 commits April 3, 2025 14:12

refactoring to get events_file info standalone

c2d078a

shortened docstring

61fd6bb

added authors and whats_new

c3f3317

matthiasdold mentioned this pull request Apr 3, 2025

Additional metadata from BIDS events.tsv NeuroTechX/moabb#744

Open

matthiasdold added 2 commits April 3, 2025 15:36

remove _ for private naming convention of events_file_to_annotation_k…

8428c75

…wargs

corrected _ in events_file_to_annotation_kwargs

f17ef79

sappelhoff reviewed Apr 4, 2025

View reviewed changes

matthiasdold added 3 commits April 4, 2025 18:01

added html-noplot directive to check local doc builds without plots

44a885e

back to private naming convention

49fa312

removing func handles as modified are private functions

2942010

sappelhoff reviewed Apr 4, 2025

View reviewed changes

matthiasdold added 2 commits April 7, 2025 17:32

added unit tests for events_file_to_annotation_kwargs

f98ac2b

added example to docstring

2dd91cc

matthiasdold force-pushed the standalone_events branch from 680b8e1 to 2dd91cc Compare April 7, 2025 15:55

docstring for pytest function

8868355

matthiasdold force-pushed the standalone_events branch from fdac9c1 to 8868355 Compare April 7, 2025 15:59

updated whats_new and keeping the Makefile changes local

4fdf132

matthiasdold force-pushed the standalone_events branch from e94019c to 4fdf132 Compare April 8, 2025 05:30

pre-commit-ci bot and others added 2 commits April 8, 2025 05:30

[pre-commit.ci] auto fixes from pre-commit.com hooks

d7f92da

for more information, see https://pre-commit.ci

exposing events_file_to_annotation_kwargs at mne_bids level

10f634c

trying double backticks

4e6b554

matthiasdold force-pushed the standalone_events branch from eda7182 to 4e6b554 Compare April 9, 2025 08:55

[pre-commit.ci] auto fixes from pre-commit.com hooks

02d26b2

for more information, see https://pre-commit.ci

PierreGtch mentioned this pull request Apr 9, 2025

Add metadata to Annotations mne-tools/mne-python#13199

Open

sappelhoff reviewed Apr 10, 2025

View reviewed changes

doc/whats_new.rst Show resolved Hide resolved

Update doc/whats_new.rst

cce5558

sappelhoff mentioned this pull request Apr 10, 2025

more op to path conversions #1391

Merged

7 tasks

added events_file_to_anntation_kwargs to doc/api.rst

9c75050

drammock reviewed Apr 10, 2025

View reviewed changes

mne_bids/read.py Show resolved Hide resolved

fixing docstring

b62ae70

drammock approved these changes Apr 10, 2025

View reviewed changes

PierreGtch mentioned this pull request Apr 15, 2025

Annotations metadata mne-tools/mne-python#13213

Closed

matthiasdold and others added 2 commits April 16, 2025 08:37

pytest adjustments

a4ce5e5

[pre-commit.ci] auto fixes from pre-commit.com hooks

6559e55

for more information, see https://pre-commit.ci

sappelhoff approved these changes Apr 16, 2025

View reviewed changes

sappelhoff merged commit 486597a into mne-tools:main Apr 16, 2025
25 checks passed

Refactor _handle_events_reading to allow extracting annotation information stand-alone #1389

Refactor _handle_events_reading to allow extracting annotation information stand-alone #1389

Uh oh!

Conversation

matthiasdold commented Apr 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Description

Merge checklist

Uh oh!

welcome bot commented Apr 3, 2025

Uh oh!

sappelhoff left a comment

Choose a reason for hiding this comment

Uh oh!

matthiasdold commented Apr 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Apr 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

sappelhoff left a comment

Choose a reason for hiding this comment

Uh oh!

drammock commented Apr 7, 2025

Uh oh!

matthiasdold commented Apr 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sappelhoff commented Apr 8, 2025

Uh oh!

PierreGtch commented Apr 9, 2025

Uh oh!

matthiasdold commented Apr 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

PierreGtch commented Apr 9, 2025

Uh oh!

Uh oh!

sappelhoff commented Apr 10, 2025

Uh oh!

drammock commented Apr 10, 2025

Uh oh!

drammock commented Apr 10, 2025

Uh oh!

sappelhoff commented Apr 10, 2025

Uh oh!

sappelhoff commented Apr 10, 2025

Uh oh!

sappelhoff commented Apr 10, 2025

Uh oh!

drammock Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

drammock commented Apr 10, 2025

Uh oh!

matthiasdold commented Apr 10, 2025

Uh oh!

drammock left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

drammock Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

matthiasdold Apr 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

matthiasdold commented Apr 16, 2025

Uh oh!

sappelhoff left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

welcome bot commented Apr 16, 2025

Uh oh!

matthiasdold commented Apr 3, 2025 •

edited

Loading

matthiasdold commented Apr 4, 2025 •

edited

Loading

codecov bot commented Apr 4, 2025 •

edited

Loading

matthiasdold commented Apr 8, 2025 •

edited

Loading

matthiasdold commented Apr 9, 2025 •

edited

Loading

matthiasdold Apr 16, 2025 •

edited

Loading