Some questions about "AsymptoticLimits.cc"

Dear Combine experts,

I really appreciate the scripts that you provide that make limits for so many CMS analysis. But while I read through the codes, I have some questions that I hope can be clarified so that we might reduce misunderstanding from users:

1. May I ask why we set ``rMin`` directly to 0 when we use $\widetilde{q}$? I wonder if MIGRAD will get the gradient calculation wrong if $r$ isn't allowed to get slightly toward negative values, causing biased minimisation. Could we follow the definition in eq. (16) of the [Asymptotic formulae paper](https://arxiv.org/pdf/1007.1727), re-define the NLL if $\widehat{r}$ is either less than 0 or larger than $r$?
https://github.com/cms-analysis/HiggsAnalysis-CombinedLimit/blob/62005b0b00ef5d284e3a4d4732c82d443f9123d2/src/AsymptoticLimits.cc#L161

2. I think it is good that ``rMax`` is set to be 3 times the uncertainty of r (``rErr``). So the ``while`` loop has much smaller chance to become an infinite loop (than the one mentioned in point 8). But I think we should put a warning whenever the ``rMax`` is doubled to let the user know the uncertainty of r increases fast with r, which could be due to incorrect model configuration or large systematics applying to signals.
https://github.com/cms-analysis/HiggsAnalysis-CombinedLimit/blob/62005b0b00ef5d284e3a4d4732c82d443f9123d2/src/AsymptoticLimits.cc#L225

3. I wonder why we do a logarithmic interpolation for the observed limit but bissection method for the expected limit as default? When a user creates an Asimov dataset using "-t -1", the user would expect the observed limit to match the 50% quantile expected limit. But one wouldn't because the limit-searching methods are not same.
https://github.com/cms-analysis/HiggsAnalysis-CombinedLimit/blob/62005b0b00ef5d284e3a4d4732c82d443f9123d2/src/AsymptoticLimits.cc#L247
I also wonder what is the mathematical basis that the r should be lienar to the natural log of CLs. According to equations like (69) in the [Asymptotic formulae paper](https://arxiv.org/pdf/1007.1727), the relation is an inverse CDF of the normal distribution, not natural log.

4. May I ask the mathematical basis fo this 80-20 interpolation ?
https://github.com/cms-analysis/HiggsAnalysis-CombinedLimit/blob/62005b0b00ef5d284e3a4d4732c82d443f9123d2/src/AsymptoticLimits.cc#L249

5. Is it intentional to use the function "minimize" for unconstrained fit:
https://github.com/cms-analysis/HiggsAnalysis-CombinedLimit/blob/62005b0b00ef5d284e3a4d4732c82d443f9123d2/src/AsymptoticLimits.cc#L184
But then we use the function "improve" for the constrained fit"
https://github.com/cms-analysis/HiggsAnalysis-CombinedLimit/blob/62005b0b00ef5d284e3a4d4732c82d443f9123d2/src/AsymptoticLimits.cc#L294

6. I thought qmu can never become negative according to the definition of qmu in the [Asymptotic formulae paper](https://arxiv.org/pdf/1007.1727)'s eq (16).
https://github.com/cms-analysis/HiggsAnalysis-CombinedLimit/blob/62005b0b00ef5d284e3a4d4732c82d443f9123d2/src/AsymptoticLimits.cc#L299
Is this to prevent numerical errors? The setting of qmu to 0 when $\widehat{r}>r$ is done here already:
https://github.com/cms-analysis/HiggsAnalysis-CombinedLimit/blob/62005b0b00ef5d284e3a4d4732c82d443f9123d2/src/AsymptoticLimits.cc#L304

7. According to page 20 of [Minuit documentation](https://root.cern.ch/download/minuit.pdf), the error level (the value to 1 $\sigma$) affects the precision of the minimiser: estimated distance to minimum (EDM) requirement is 0.001*[tolerance = 0.1]*[error-level]. The [RooFit default](https://root.cern.ch/doc/master/RooAbsReal_8h_source.html#l00250) value is 1. But in the following snippet, it changed it to 1.92
https://github.com/cms-analysis/HiggsAnalysis-CombinedLimit/blob/62005b0b00ef5d284e3a4d4732c82d443f9123d2/src/AsymptoticLimits.cc#L403
So the precision becomes worse by this change. But in the following lines, it used this less precise fit to determine whether the Asimov dataset is well-behaved.
https://github.com/cms-analysis/HiggsAnalysis-CombinedLimit/blob/62005b0b00ef5d284e3a4d4732c82d443f9123d2/src/AsymptoticLimits.cc#L415
What's stranger is that the reference is 0.001 times rMax, which is rather arbitrary. I suggested we turn off this warning as it might mislead careful users that turn up the verbose level, or we don't change the error-level for higher precision and could use the ratio $\widehat{r}$ over $\sigma_{\text{fit}}(r)$ as a indicator of the fit healthiness.

8. The current default way of calculating expected limit is via the "bisection", which contains a while loop:
https://github.com/cms-analysis/HiggsAnalysis-CombinedLimit/blob/62005b0b00ef5d284e3a4d4732c82d443f9123d2/src/AsymptoticLimits.cc#L517
This implied that user might be stuck in an infinite loop if the rMax and rMin were set too narrow. In contrast, the method "stepping" has a warning when it couldn't find the required NLL shift and won't get stuck in an infinite loop:
https://github.com/cms-analysis/HiggsAnalysis-CombinedLimit/blob/62005b0b00ef5d284e3a4d4732c82d443f9123d2/src/AsymptoticLimits.cc#L557

Best regards,

Kuan
B2G Combine contact

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Some questions about "AsymptoticLimits.cc" #1037

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Some questions about "AsymptoticLimits.cc" #1037

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions