Fix some `FieldTimeSeries` indexing bugs #4505

loisbaker · 2025-05-13T14:30:48Z

This PR fixes some small indexing bugs for FieldTimeSeries described in #4100

Changes the way that times are compared in

Oceananigans.jl/src/OutputReaders/field_time_series.jl

Line 252 in d1e9b46

if all(time_range .≈ times) # good enough for most

and

Oceananigans.jl/src/OutputReaders/set_field_time_series.jl

Line 10 in d1e9b46

find_time_index(time::Number, file_times) = findfirst(t -> t ≈ time, file_times)

so that ≈ is replaced with a comparison using a vector norm in the first case, and using an absolute tolerance based on a characteristic timescale in the second case. This avoids returning false due to the default relative error criterion when comparing values close to zero.
Moves the time array that is indexed in

Oceananigans.jl/src/OutputReaders/set_field_time_series.jl

Line 25 in d1e9b46

t = fts.times[n]

to the CPU so errors aren't thrown when fts.times is a CuArray.

Improves element-wise comparison, which returned False for values near zero due to relative error default.

Modifies `find_time_index` to take an extra timescale parameter `dt`, so that an absolute error can be computed to avoid `isapprox` returning `false` for values near zero

src/OutputReaders/set_field_time_series.jl

navidcy · 2025-05-13T15:24:20Z

src/OutputReaders/set_field_time_series.jl

-
+
+    # Compute a timescale for comparisons
+    dt = mean(file_times[2:end] - file_times[1:end-1])


Suggested change

dt = mean(file_times[2:end] - file_times[1:end-1])

timescale = mean(diff(file_times))

Hmm, it's really the time increment between time points, so I think Δt is more appropriate. Just suggest using the Δ not dt for readability

Or mean_Δt

navidcy · 2025-05-13T15:30:35Z

src/OutputReaders/set_field_time_series.jl

+    # Compute a timescale for comparisons
+    dt = mean(file_times[2:end] - file_times[1:end-1])


why don't we compute this within find_time_index? just a thought...
or something like:

function find_time_index(time::Number, file_times; time_scale = mean(diff(file_times))) ϵ = 100 * eps(eltype(file_times)) return findfirst(t -> isapprox(t, time, atol=ϵ*time_scale), file_times) end

Good point, I guess pre-computing it avoids recalculation at every iteration of the for loop it's used in? But if that's not an issue it would be simpler to compute within the function.

Precomputing serves two purposes: 1) we don't recompute, which could become important for very long time series, and 2) as a point of design, this allows us to change the estimate of the characteristic time increment later. Right now we use the mean of all time increments, but this is not the only choice one might make.

src/OutputReaders/set_field_time_series.jl

src/OutputReaders/field_time_series.jl

src/OutputReaders/set_field_time_series.jl

glwagner · 2025-05-14T21:58:52Z

@loisbaker should we add tests at all? Ok if that's a todo for the future, but tests will ensure these changes are lasting

Co-authored-by: Gregory L. Wagner <[email protected]>

loisbaker · 2025-05-15T17:29:52Z

@loisbaker should we add tests at all? Ok if that's a todo for the future, but tests will ensure these changes are lasting

Sure, I can work on that - a test to check that find_time_index will pick the right index when the error between the two compared values is sufficiently small, and doesn't otherwise? Would you also make a test to check that the range conversion in field_time_series.jl only happens when the time values are sufficiently close to evenly spaced?

glwagner · 2025-05-15T17:36:10Z

@loisbaker should we add tests at all? Ok if that's a todo for the future, but tests will ensure these changes are lasting

Sure, I can work on that - a test to check that find_time_index will pick the right index when the error between the two compared values is sufficiently small, and doesn't otherwise? Would you also make a test to check that the range conversion in field_time_series.jl only happens when the time values are sufficiently close to evenly spaced?

Perhaps just a simple test that resembles the case you presented that illustrated the bug would work (the more fine-grained tests can be nice too, but I don't think necessary in this case and they are more work).

We can also do in a future PR, since I think this is otherwise read to merge -- your call!

loisbaker · 2025-05-15T17:39:38Z

@loisbaker should we add tests at all? Ok if that's a todo for the future, but tests will ensure these changes are lasting

Sure, I can work on that - a test to check that find_time_index will pick the right index when the error between the two compared values is sufficiently small, and doesn't otherwise? Would you also make a test to check that the range conversion in field_time_series.jl only happens when the time values are sufficiently close to evenly spaced?

Perhaps just a simple test that resembles the case you presented that illustrated the bug would work (the more fine-grained tests can be nice too, but I don't think necessary in this case and they are more work).

We can also do in a future PR, since I think this is otherwise read to merge -- your call!

Great, I'll put this on the to-do list for a future PR!

src/OutputReaders/set_field_time_series.jl

loisbaker added 4 commits May 13, 2025 11:20

Moves indexing of fts.times to CPU

1dd0a04

Compares times range and array using vector norm

44f59f3

Improves element-wise comparison, which returned False for values near zero due to relative error default.

Computes a timescale dt for absolute time comparisons

bc56592

Modifies `find_time_index` to take an extra timescale parameter `dt`, so that an absolute error can be computed to avoid `isapprox` returning `false` for values near zero

Add a space

a2fb18e

navidcy reviewed May 13, 2025

View reviewed changes

src/OutputReaders/set_field_time_series.jl Outdated Show resolved Hide resolved

navidcy reviewed May 13, 2025

View reviewed changes

navidcy added the output 💾 label May 13, 2025

navidcy reviewed May 13, 2025

View reviewed changes

navidcy requested a review from glwagner May 13, 2025 15:31

navidcy reviewed May 14, 2025

View reviewed changes

src/OutputReaders/set_field_time_series.jl Show resolved Hide resolved

glwagner reviewed May 14, 2025

View reviewed changes

src/OutputReaders/field_time_series.jl Outdated Show resolved Hide resolved

glwagner reviewed May 14, 2025

View reviewed changes

src/OutputReaders/set_field_time_series.jl Outdated Show resolved Hide resolved

loisbaker and others added 3 commits May 15, 2025 17:56

Add comment and change time increment label dt to Δt

47c8d32

Co-authored-by: Gregory L. Wagner <[email protected]>

Comment to explain why times array is converted to a range

792c523

Change dt to Δt

5ed7b52

glwagner approved these changes May 15, 2025

View reviewed changes

glwagner and others added 3 commits May 15, 2025 11:41

Merge branch 'main' into fix-fts-indexing

2b246a2

Merge branch 'main' into fix-fts-indexing

89a439a

Merge branch 'main' into fix-fts-indexing

8c99bc0

navidcy reviewed May 22, 2025

View reviewed changes

src/OutputReaders/set_field_time_series.jl Show resolved Hide resolved

Apply suggestions from code review

a4ebe04

navidcy merged commit 412776f into CliMA:main May 22, 2025
53 checks passed

simone-silvestri mentioned this pull request May 27, 2025

Fix indexing issues for FieldTimeSeries #4550

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix some `FieldTimeSeries` indexing bugs #4505

Fix some `FieldTimeSeries` indexing bugs #4505

Uh oh!

loisbaker commented May 13, 2025

Uh oh!

Uh oh!

navidcy May 13, 2025 •

edited

Loading

Uh oh!

glwagner May 14, 2025

Uh oh!

navidcy May 14, 2025

Uh oh!

navidcy May 13, 2025

Uh oh!

loisbaker May 14, 2025

Uh oh!

glwagner May 14, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

glwagner commented May 14, 2025

Uh oh!

loisbaker commented May 15, 2025

Uh oh!

glwagner commented May 15, 2025

Uh oh!

loisbaker commented May 15, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!



		# Compute a timescale for comparisons
		dt = mean(file_times[2:end] - file_times[1:end-1])

	dt = mean(file_times[2:end] - file_times[1:end-1])
	timescale = mean(diff(file_times))

Fix some FieldTimeSeries indexing bugs #4505

Fix some FieldTimeSeries indexing bugs #4505

Uh oh!

Conversation

loisbaker commented May 13, 2025

Uh oh!

Uh oh!

navidcy May 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

glwagner May 14, 2025

Choose a reason for hiding this comment

Uh oh!

navidcy May 14, 2025

Choose a reason for hiding this comment

Uh oh!

navidcy May 13, 2025

Choose a reason for hiding this comment

Uh oh!

loisbaker May 14, 2025

Choose a reason for hiding this comment

Uh oh!

glwagner May 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

glwagner commented May 14, 2025

Uh oh!

loisbaker commented May 15, 2025

Uh oh!

glwagner commented May 15, 2025

Uh oh!

loisbaker commented May 15, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Fix some `FieldTimeSeries` indexing bugs #4505

Fix some `FieldTimeSeries` indexing bugs #4505

navidcy May 13, 2025 •

edited

Loading