feat: Add `check_time_intervals_duration` to verify TimeIntervals table duration and include unit tests. #635

bendichter · 2025-11-21T00:07:53Z

a more modular PR for #628

…le duration and include unit tests.

for more information, see https://pre-commit.ci

…st practices for tables

codecov-commenter · 2025-11-21T21:18:02Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 76.95%. Comparing base (d8382c9) to head (c944bb5).

Additional details and impacted files

@@            Coverage Diff             @@
##              dev     #635      +/-   ##
==========================================
+ Coverage   73.03%   76.95%   +3.92%     
==========================================
  Files          47       47              
  Lines        1587     1610      +23     
==========================================
+ Hits         1159     1239      +80     
+ Misses        428      371      -57

Files with missing lines	Coverage Δ
src/nwbinspector/checks/__init__.py	`100.00% <ø> (ø)`
src/nwbinspector/checks/_tables.py	`97.88% <100.00%> (+0.40%)`	⬆️

... and 4 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

h-mayorquin

Some questions.

h-mayorquin · 2025-11-25T14:58:59Z

src/nwbinspector/checks/_tables.py

    return None
+
+
+@register_check(importance=Importance.CRITICAL, neurodata_type=TimeIntervals)


Should this be a critical error? We are basically saying that they won't ever be such cases. Seems too strong to me.

We should have a way of handling "things that are usually errors but there is a small probability that the user knows what they are doing and they can move forward"

The solution could a non-trivial barrier to enable this like an environment variable that could skip this error (as a CLI argument would be hard to propagate to DANDI) or something of the like.

h-mayorquin · 2025-11-25T15:09:43Z

src/nwbinspector/checks/_tables.py

+            end_times.append(float(time_intervals["stop_time"][-1]))
+
+    # Check for other time columns
+    for column_name in time_intervals.colnames:


I think that all the times in the time columns should be smaller than the max(time_intervals[stop_time`].data), right? Maybe that could be a check on its own.

h-mayorquin · 2025-11-25T15:11:55Z

src/nwbinspector/checks/_tables.py

+    end_times = []
+
+    # Check for start_time and stop_time columns
+    if "start_time" in time_intervals.colnames and len(time_intervals["start_time"]) > 0:


Here and with the other time columns we are assuming that the time columns are already well-ordered. Is there a way of running checks in order? Maybe this check does not make sense if the other fails and the output will confuse rather than clarify if the ascending order check fails.

Yes, this check requires that the times are in order. If they are not, this check may fail to raise a message. I suppose we could just read all the data. That's probably fine in most cases -- I doubt we'll come across many datasets where these datasets are large. Do you think it's better to just read all of the time arrays?

It is definitely more safe.

If we want to be more efficient maybe we can just combine these two checks:

nwbinspector/src/nwbinspector/checks/_tables.py

Lines 52 to 80 in d8382c9

@register_check(importance=Importance.BEST_PRACTICE_VIOLATION, neurodata_type=TimeIntervals)

def check_time_interval_time_columns(

time_intervals: TimeIntervals, nelems: Optional[int] = NELEMS

) -> Optional[InspectorMessage]:

"""

Check that time columns are in ascending order.

Parameters

----------

time_intervals: TimeIntervals

nelems: int, optional

Only check the first {nelems} elements. This is useful in case there columns are

very long so you don't need to load the entire array into memory. Use None to

load the entire arrays.

"""

unsorted_cols = []

for column in time_intervals.columns:

if column.name == "start_time":

if not is_ascending_series(column.data, nelems):

unsorted_cols.append(column.name)

if unsorted_cols:

return InspectorMessage(

message=(

f"{unsorted_cols} are time columns but the values are not in ascending order. "

"All times should be in seconds with respect to the session start time."

)

)

return None

To avoid reading them twice.

I think either way is fine.

bendichter and others added 4 commits November 20, 2025 16:06

feat: Add check_time_intervals_duration to verify TimeIntervals tab…

4c06957

…le duration and include unit tests.

[pre-commit.ci] auto fixes from pre-commit.com hooks

539d25e

for more information, see https://pre-commit.ci

docs: add reference to check_time_intervals_duration function in be…

1476eff

…st practices for tables

updated changelog

e0c27f9

bendichter requested a review from stephprince November 21, 2025 21:12

test: Add additional tests for check_time_intervals_duration function

c944bb5

bendichter requested a review from h-mayorquin November 25, 2025 00:48

h-mayorquin reviewed Nov 25, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add `check_time_intervals_duration` to verify TimeIntervals table duration and include unit tests. #635

feat: Add `check_time_intervals_duration` to verify TimeIntervals table duration and include unit tests. #635

Uh oh!

bendichter commented Nov 21, 2025

Uh oh!

codecov-commenter commented Nov 21, 2025

Uh oh!

h-mayorquin left a comment

Uh oh!

h-mayorquin Nov 25, 2025

Uh oh!

h-mayorquin Nov 25, 2025

Uh oh!

h-mayorquin Nov 25, 2025

Uh oh!

bendichter Nov 25, 2025

Uh oh!

h-mayorquin Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		return None


		@register_check(importance=Importance.CRITICAL, neurodata_type=TimeIntervals)

	@register_check(importance=Importance.BEST_PRACTICE_VIOLATION, neurodata_type=TimeIntervals)
	def check_time_interval_time_columns(
	time_intervals: TimeIntervals, nelems: Optional[int] = NELEMS
	) -> Optional[InspectorMessage]:
	"""
	Check that time columns are in ascending order.

	Parameters
	----------
	time_intervals: TimeIntervals
	nelems: int, optional
	Only check the first {nelems} elements. This is useful in case there columns are
	very long so you don't need to load the entire array into memory. Use None to
	load the entire arrays.
	"""
	unsorted_cols = []
	for column in time_intervals.columns:
	if column.name == "start_time":
	if not is_ascending_series(column.data, nelems):
	unsorted_cols.append(column.name)
	if unsorted_cols:
	return InspectorMessage(
	message=(
	f"{unsorted_cols} are time columns but the values are not in ascending order. "
	"All times should be in seconds with respect to the session start time."
	)
	)

	return None

feat: Add check_time_intervals_duration to verify TimeIntervals table duration and include unit tests. #635

Are you sure you want to change the base?

feat: Add check_time_intervals_duration to verify TimeIntervals table duration and include unit tests. #635

Uh oh!

Conversation

bendichter commented Nov 21, 2025

Uh oh!

codecov-commenter commented Nov 21, 2025

Codecov Report

Uh oh!

h-mayorquin left a comment

Choose a reason for hiding this comment

Uh oh!

h-mayorquin Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

h-mayorquin Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

h-mayorquin Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

bendichter Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

h-mayorquin Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feat: Add `check_time_intervals_duration` to verify TimeIntervals table duration and include unit tests. #635

feat: Add `check_time_intervals_duration` to verify TimeIntervals table duration and include unit tests. #635