Add mixin class for data types with un-named (sub)group properties, e.g. ProcessingModule #705

ehennestad · 2025-05-01T08:48:08Z

Motivation

How to test the behavior?

processingModule = types.core.ProcessingModule();
processingModule.add('MyTimeSeries', types.core.TimeSeries);
processingModule.MyTimeSeries

ans = 

  TimeSeries with properties:

     starting_time_unit: 'seconds'
    timestamps_interval: 1
        timestamps_unit: 'seconds'
                   data: []
              data_unit: ''
               comments: 'no comments'
                control: []
    control_description: ''
        data_continuity: ''
        data_conversion: 1
            data_offset: 0
        data_resolution: -1
            description: 'no description'
          starting_time: []
     starting_time_rate: []
             timestamps: []

Cases to write unit tests for:

Edge cases mentioned in issue

Adding data types to a container type

Data type is added to a container, i.e aProcessingModule.add('namedDataType')
- Is the data type added to the underlying types.untyped.Set?
Is the data type added as a dynamic property to the container type (i.e a ProcessingModule) on nwbRead?

Removing data types from a container type

Test that data object is removed from underlying subgroup (Set object) when using remove on the parent data object
Test that data object is removed from the parent data object if the object is removed from the subgroup (Set object)

Checklist

Have you ensured the PR description clearly describes the problem and solutions?
Have you checked to ensure that there aren't other open or previously closed Pull Requests for the same change?
If this PR fixes an issue, is the first line of the PR description fix #XX where XX is the issue number?

Try using redefinesParen

Add HasGroup mixin class to classes with anonymous subgroups

Fix HasGroups with redefinesDot. This is one option, but only supported from R2021b and later

Utility method for listing names of generated classes for neurodata types

Add TypeName property as a convenience for getting the short name of a data type

- Renamed to HasUnnamedGroups - Use dynamic props instead of overriding indexing - add "add" method

Use callback functions instead of event listeners. The dependency of container groups and their sets are very specific, and an event/listener system would be too general for this case. I.e a Set does not generally have to notify about Items added/removed.

- Add callback function properties accessible by matnwb.mixin.HasUnnamedGroups - Add add method - Add optional inputs for controlling behavior of set method

Rename and reorder methods for better logical composition

- Add the HasUnnamedGroups mixin to the relevant data type classes

Add minimal support for contained "types.untyped.Anon" objects add and NotImplemented warnings/errors in cases where these are not supported

Suppress warning

Rename variable depnm to superclassNames

codecov · 2025-05-01T19:00:51Z

Codecov Report

Attention: Patch coverage is 97.76952% with 6 lines in your changes missing coverage. Please review.

Project coverage is 94.85%. Comparing base (16862b7) to head (bba8564).

Files with missing lines	Patch %	Lines
+matnwb/+mixin/HasUnnamedGroups.m	97.39%	5 Missing ⚠️
+types/+untyped/Anon.m	0.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #705      +/-   ##
==========================================
- Coverage   95.05%   94.85%   -0.21%     
==========================================
  Files         161      163       +2     
  Lines        5868     6124     +256     
==========================================
+ Hits         5578     5809     +231     
- Misses        290      315      +25

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Only invoke callback if the set operation adds a new element, but not for overrides

…thub.com/NeurodataWithoutBorders/matnwb into add-mixin-for-type-with-group-properties

ehennestad · 2025-05-03T10:46:41Z

This is currently possible in MatNWB, but creates an invalid group in the NWB-file where two different data type objects are merged in the same ground:

nwbFile = tests.factory.NWBFile();
module = types.core.ProcessingModule();

nwbFile.processing.set('test', module);

T = struct2table(struct('name', {'a', 'b'}, 'value', {1, 2}));
dynamicTable = util.table2nwb(T);

nwbFile.processing.get('test').description = 'a test';
nwbFile.processing.get('test').nwbdatainterface.set('test', tests.factory.TimeSeriesWithTimestamps)
nwbFile.processing.get('test').dynamictable.set('test', dynamicTable)

nwbExport( nwbFile, 'test_same_name_in_different_subgroups.nwb')

Easy to prevent for this new syntax, but not for the legacy syntax.

Improve variable naming Remove unused code

Simplify some code sections Create nameExists method to check i name is already used in subgroup containers

Add verification the test that object is added and removed on the underlying subgroup object when using methods on the parent object

ehennestad · 2025-05-05T14:27:39Z

+file/fillClass.m

renamed depnm to superclassNames

add matnwb.mixin.HasUnnamedGroups as superclass to data types with un-named subgroups

ehennestad · 2025-05-05T14:28:21Z

+schemes/+utility/listGeneratedTypes.m

Convenience method for listing class names (or short names) of all generated types

bendichter · 2025-05-05T14:42:47Z

Do we need all of this renaming logic? What if someone tries to add two objects, Time-Series and Time_Series?

Why can't we just change module.datainterfaces.get("Foo") to module.get("Foo") or module("Foo")? Then we don't need to mess with any of this name changing stuff. We can expose module.Foo if the name allows it, and if not then revert to the other syntax.

ehennestad · 2025-05-05T21:19:25Z

Do we need all of this renaming logic? What if someone tries to add two objects, Time-Series and Time_Series?

That will look like this:

module = types.core.ProcessingModule('description', 'a module with two timeseries');
module.add('time_series', types.core.TimeSeries)
module.add('time-series', types.core.TimeSeries)

>> module

module = 

Warning: The following named elements of "ProcessingModule" are remapped to have valid MATLAB names, but will be written to file with
their actual names:
       ValidName        ActualName
    _______________    _____________

    "time_series_1"    "time-series"
 
  ProcessingModule with properties:

      description: 'a module with two timeseries'
    time_series_1: [1×1 types.core.TimeSeries]
      time_series: [1×1 types.core.TimeSeries]

Which is actually quite similar to how MATLAB does it in e.g readtable.

given a test.csv file:

time-series	time_series
1	2
3	4

>> readtable('test.csv')
Warning: Column headers from the file were modified to make them valid MATLAB identifiers before creating variable names for the
table. The original column headers are saved in the VariableDescriptions property.
Set 'VariableNamingRule' to 'preserve' to use the original column headers as table variable names. 

ans =

  2×2 table

    time_series    time_series_1
    ___________    _____________

         1               2      
         3               4

Why can't we just change module.datainterfaces.get("Foo") to module.get("Foo") or module("Foo")? Then we don't need to mess with any of this name changing stuff. We can expose module.Foo if the name allows it, and if not then revert to the other syntax.

First, I would like to avoid using module.get because its not a very common MATLAB syntax, and it's difficult for a user to know when to use module.get and when to use standard dot-indexing, i.e module.desription.

Implementing parenthesis indexing is quite straightforward with the matlab.mixin.indexing.RedefinesParen class, but it is only supported from R2021b. It is possible to override subsref for compatibility in older releases, but I would really not recommend doing that.

Also, one syntax for "valid" names and another syntax for other names I think would be confusing.

In summary, the renaming logic would provide full support for reading files from pynwb where names are specified with spaces or or other symbols that are not supported in MATLAB but also take care of edge cases like the one you used as an example, and in general be easier to use as there would only be one type of indexing syntax to remember

bendichter · 2025-05-05T21:50:50Z

You're not warning on the add command? I'm really not into magically renaming. Now the order in which the data elements are added to the file object matters and the user needs to learn the renaming rule. I think module.get('my-name') is so much simpler and sufficiently solves the problem without getting into tricky territory. I understand it's a bit unusual syntax but I've just been in too many situations where renaming causes more confusion than it is worth.

ehennestad · 2025-05-05T22:12:11Z

You're not warning on the add command?

Good point, there could be an extra warning on the add command.

I'm really not into magically renaming

It's not really renaming, just creating an alias which is used for the dynamic properties that enables dot-indexing and autocompletion. module.nwbdatainterface.get('my-name') would still work and creating a shortcut as module.get('my-name') would be a quick addition.

How about keeping/providing both modes?

A typical MATLAB user would likely specify names using valid camelCase or snake_case and not encounter the name remapping at all. If they loaded a file created with python using different naming convention, their preferred modes might vary. For example, I prefer to explore new objects in the command window, in which the alias-names are presented and where I can use them for dot-indexing. If someone else prefers to use the "real" names they could do so with module.get('my-name')

ehennestad · 2025-05-08T14:10:19Z

1. warning on read if name aliases are used
2. .get syntax
3. name mapping should be publicly accessible
4. how-to-guide

+ unittest + factory functions for new unittest

ehennestad added 15 commits April 29, 2025 21:23

Create HasGroups.m

078d0ce

Update HasGroups.m

f924435

Try using redefinesParen

Update fillClass.m

05b4f67

Add HasGroup mixin class to classes with anonymous subgroups

Update HasGroups.m

e1ee36e

Fix HasGroups with redefinesDot. This is one option, but only supported from R2021b and later

Create listGeneratedTypes.m

53ac391

Utility method for listing names of generated classes for neurodata types

Update MetaClass.m

95387a6

Add TypeName property as a convenience for getting the short name of a data type

Update HasGroup mixin

6e73808

- Renamed to HasUnnamedGroups - Use dynamic props instead of overriding indexing - add "add" method

Update HasUnnamedGroups.m

abc0363

Use callback functions instead of event listeners. The dependency of container groups and their sets are very specific, and an event/listener system would be too general for this case. I.e a Set does not generally have to notify about Items added/removed.

Update Set.m

0b3363d

- Add callback function properties accessible by matnwb.mixin.HasUnnamedGroups - Add add method - Add optional inputs for controlling behavior of set method

Update HasUnnamedGroups.m

3fb2965

Rename and reorder methods for better logical composition

Update matnwb generator

410429b

- Add the HasUnnamedGroups mixin to the relevant data type classes

Update generated classes

95e6cf7

Update HasUnnamedGroups.m

4a8b47a

Add minimal support for contained "types.untyped.Anon" objects add and NotImplemented warnings/errors in cases where these are not supported

Update AnonTest.m

31bc60b

Suppress warning

Refactor fillClass

f830485

Rename variable depnm to superclassNames

ehennestad added 10 commits May 2, 2025 10:19

Start adding name remapping to valid matlab names

fd16470

Add name-mapping strategy for invalid or duplicate names

1f75bf1

Fix bugs related to name remapping and custom display property names

cb79ebd

Update Set.m

ebef8ab

Only invoke callback if the set operation adds a new element, but not for overrides

Add unit test for HasUnnamedGroupsMixin

8263dd1

Add extra tests

9d45779

Merge branch 'main' into add-mixin-for-type-with-group-properties

feeedc4

Update tests, remove unused/redundant method

addb0fc

Merge branch 'add-mixin-for-type-with-group-properties' of https://gi…

0cbec56

…thub.com/NeurodataWithoutBorders/matnwb into add-mixin-for-type-with-group-properties

Minor fixes in comments

2d69d9a

ehennestad added 3 commits May 3, 2025 22:11

Refactor HasUnnamedGroups

5d8a8ba

Improve variable naming Remove unused code

Refactor HasUnnamedGroups

e7c4c0b

Simplify some code sections Create nameExists method to check i name is already used in subgroup containers

Merge branch 'main' into add-mixin-for-type-with-group-properties

f661e44

Add new verification in test for Set

160f7b2

ehennestad marked this pull request as ready for review May 4, 2025 19:56

Update HasUnnamedGroupsTest.m

9596254

Add verification the test that object is added and removed on the underlying subgroup object when using methods on the parent object

ehennestad requested a review from bendichter May 4, 2025 21:33

Merge branch 'main' into add-mixin-for-type-with-group-properties

b2e27f2

ehennestad changed the title ~~Add mixin class for data types with un-named (sub)group properties, i.e ProcessingModule~~ Add mixin class for data types with un-named (sub)group properties, e.g. ProcessingModule May 5, 2025

ehennestad commented May 5, 2025

View reviewed changes

Merge branch 'main' into add-mixin-for-type-with-group-properties

13d7165

Merge branch 'main' into add-mixin-for-type-with-group-properties

7c231c9

ehennestad added 4 commits May 10, 2025 21:08

Add get methods on the mixin class + unittest

5aa5568

Add function on NwbFile to retrieve remapped names

4c31aab

Add method for listing remapped names in NwbFile

1385098

+ unittest + factory functions for new unittest

Fix wrong property name for type in test

bba8564

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add mixin class for data types with un-named (sub)group properties, e.g. ProcessingModule #705

Add mixin class for data types with un-named (sub)group properties, e.g. ProcessingModule #705

Uh oh!

ehennestad commented May 1, 2025 •

edited

Loading

Uh oh!

codecov bot commented May 1, 2025 •

edited

Loading

Uh oh!

ehennestad commented May 3, 2025

Uh oh!

ehennestad May 5, 2025

Uh oh!

ehennestad May 5, 2025

Uh oh!

bendichter commented May 5, 2025

Uh oh!

ehennestad commented May 5, 2025 •

edited

Loading

Uh oh!

bendichter commented May 5, 2025

Uh oh!

ehennestad commented May 5, 2025 •

edited

Loading

Uh oh!

ehennestad commented May 8, 2025 •

edited

Loading

Uh oh!

Uh oh!

Add mixin class for data types with un-named (sub)group properties, e.g. ProcessingModule #705

Are you sure you want to change the base?

Add mixin class for data types with un-named (sub)group properties, e.g. ProcessingModule #705

Uh oh!

Conversation

ehennestad commented May 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

How to test the behavior?

Cases to write unit tests for:

Edge cases mentioned in issue

Adding data types to a container type

Removing data types from a container type

Checklist

Uh oh!

codecov bot commented May 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ehennestad commented May 3, 2025

Uh oh!

ehennestad May 5, 2025

Choose a reason for hiding this comment

Uh oh!

ehennestad May 5, 2025

Choose a reason for hiding this comment

Uh oh!

bendichter commented May 5, 2025

Uh oh!

ehennestad commented May 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bendichter commented May 5, 2025

Uh oh!

ehennestad commented May 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ehennestad commented May 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ehennestad commented May 1, 2025 •

edited

Loading

codecov bot commented May 1, 2025 •

edited

Loading

ehennestad commented May 5, 2025 •

edited

Loading

ehennestad commented May 5, 2025 •

edited

Loading

ehennestad commented May 8, 2025 •

edited

Loading