-
Notifications
You must be signed in to change notification settings - Fork 42
Description
Follow-up from: #188 (comment)
The FSS implementation in #266 provides the user with flexibility to calculate skill scores from single 2-D fields. However, accumulating these scores temporally or spatially across other dimensions is not a trivial problem, and may have implications on skill interpretation. See: 1
The current implementation (#266) aggregates the decomposed scores. Which is one approach shown in the reference; and also the default/implied approach in Roberts and Lean (2008):
Although not explicitly stated in Roberts and Lean (2008), Eq. (3) was always the intended way of using the score for multiple forecasts
While this achieves consistency in some situations may have trade-offs compared to simply averaging the scores in other situations. Furthermore, 0 value cases were discussed here: #266 (comment) similar questions/points are also considered in the reference.
The implementation in #266 already offers a single field variant, and in specialised methods the ability to return decomposed scores. Hence, the user can already choose to address any follow up research using those features. However, for convenience it may be useful to provide the choice between aggregation (current implementation) and averaging over other dimensions when accumulating fields.
Further, to address cases where F = O = 0 there are a couple of propositions given in the referenced paper.
Proposed fine grained controls:
- option to choose
averagingoveraggregation. - option to choose edge case methods when computing the FSS score for certain cases e.g. forecast & obs are effectively 0.
- FSS curve (see replies)
- % based thresholding (see replies)
Reference