Skip to content

Diagnostic for comparing distributions #3

@jgonzalezab

Description

@jgonzalezab

For precipitation, it is important to compare distributions, particularly to assess the tails (extreme precipitation). We could include several diagnostics designed for this purpose (e.g., skill score [1] or Wasserstein distance [2]). However, one of the main concerns is how sensitive these diagnostics are to the parts of the distribution we are most interested in (extremes).

An alternative approach is to directly plot the frequency distributions using a log vertical axis (e.g., Fig. 3a [3] or Fig. 5 [4]). The main drawback is that, given the number of models being compared, such a plot could become cluttered. This raises the question: how can we display all these models in a single plot? Or, alternatively, how could we split them into multiple plots? The latter would require some classification of the models being intercompared, which is not straightforward.

Another option would be to implement one or more of the aforementioned diagnostics and examine how well their results correspond to what we observe in the plots. If the diagnostics are consistent with what we see in the plots, we could rely on them, otherwise, we may need to show the findings directly in the plots.

Refs

[1] https://journals.ametsoc.org/view/journals/clim/20/17/jcli4253.1.xml
[2] https://docs.scipy.org/doc/scipy-1.11.4/reference/generated/scipy.stats.wasserstein_distance.html
[3] https://arxiv.org/abs/2407.14158
[4] https://journals.ametsoc.org/view/journals/aies/4/4/AIES-D-24-0121.1.xml

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions