WIP: Experimental parallel #723

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

bruAristimunha wants to merge 10 commits into NeuroTechX:develop from bruAristimunha:experimental_parallel

Collaborator

bruAristimunha commented Mar 10, 2025

No description provided.

bruAristimunha added 7 commits

March 10, 2025 19:55


          starting refactor of the evaluation

2e16478


          testing parallel evaluation..

1218d93


          Merge branch 'develop' into experimental_parallel

40135ba


          improving the output

34f3b0e


          chaging the verbosity

0785de9


          starting to play with refactoring the evaluation...

1a90f69


          trying to flat...

aa88a94

Collaborator Author

bruAristimunha commented Mar 11, 2025

This is so hard >.<

bruAristimunha and others added 3 commits

March 11, 2025 20:14


          updating the splitter

633b8a2


          removing the memory cache

084a144


          draw from pierre's discussion

ea5b2c9

Collaborator

PierreGtch commented Mar 13, 2025

Discussion with @bruAristimunha, action plan for the refactoring:

We will have one shared evaluation method in the base evaluation class which will:
- Load only the metadata
- Split the metadata into different cv folds (calling methods defined in the subclasses)
- Check which cv folds already have a result and can be skipped (except if overwrite=True)
- Then, two options:
  - If lazy_loading=False, load the data of the cv folds that still need to be computed
  - If lazy_loading=True, do nothing
- Launch, in parallel, a shared train_and_evaluate function on all cv folds (if lazy_loading=False, pass the pre-loaded data)
- Gather the results in a dataframe and return
The shared train_and_evaluate method defined in the base evaluation class:
- (It only handles one cv fold and one pipeline)
- Loads the data:
  - If lazy_loading=False (data+metadata provided), selects only the epochs corresponding to this fold
  - If lazy_loading=True (only metadata provided), loads the data. In this case, we recommend the users to activate the BIDS cache mechanism to avoid pre-processing the same data multiple times.
- Trains the pipeline on the cv fold
- computes codecarbone, training time, etc.
- Groups the test data by session
- For each test session, computes the pipeline's score
- Pushes all the test session scores
- Returns the list of test session scores

PierreGtch mentioned this pull request

Add type hints to evaluation classes #732

Open

bruAristimunha closed this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet