feat: add sum of weights for BTA_ttbar workflow, fix typo in suball script and make basepath in BTA producers configurable #111

philippgadow · 2025-01-20T10:34:20Z

Description

This pull request introduces several improvements and fixes to the BTA workflow and associated scripts and configurations:

Feature Addition:
- Implemented support for calculating the sum of weights, including variations such as LHE scale weights, and PS weights, in the BTA_ttbar workflow (LHE PDF weights not considered for the moment).
- Enabled configuration of basepath and output_dir in BTA_producer and BTA_ttbar_producer workflows for greater flexibility in file management.
Bug Fixes:
- Typo Correction: Fixed an issue in the suball script where variables paser were incorrectly referenced as parser, ensuring proper command-line parsing functionality.
- Use events.PuppiMET.pt everywhere in BTA_ttbar_producer workflow if analysis is Run3
Configuration Update:
- Modified test_env.yml to replace defaults with nodefaults in the conda channel configuration, removing dependencies on Anaconda, which is not free to use for organisations with more than 200 collaborators.

- doc: add instructions of the CommFW, remove contents from readme - doc: add api information - env: update missing module -

…rprises

…rable

…ar workflow

…mmissioning into btv_dilep_wf

… of weights for BTA_ttbar workflow

- add plots for plot description - more info in docstring

Ming-Yan

Hi @philippgadow this all looks good to me ! Thanks for the PR

Is there any changes you want to include further in this PR, if not we are good to merge :)

Ming-Yan

Sorry noticing one change after approve it @philippgadow could you please have a look with this change?

src/BTVNanoCommissioning/workflows/BTA_ttbar_producer.py

…into btv_dilep_wf

Ming-Yan

Hi @philippgadow , thank you for incorporating the suggestions, I have some minor suggestions for moving things to common selections/functions.

Could you please have a look?

Ming-Yan · 2025-03-12T09:17:28Z

src/BTVNanoCommissioning/workflows/BTA_ttbar_producer.py

            (muons.pt > 20)
            & (abs(muons.eta) < 2.4)
            & muons.tightId  # pass cut-based tight ID
-            & (muons.pfRelIso04_all < 0.12)  # muon isolation cut
+            & (
+                muons.pfRelIso04_all < 0.15
+            )  # muon isolation cut (tight: https://twiki.cern.ch/twiki/bin/viewauth/CMS/SWGuideMuonIdRun2#Particle_Flow_isolation and https://github.com/cms-sw/cmssw/blob/75451d59a7acc30aec874be9a6b9a8835f2f7b3e/PhysicsTools/NanoAOD/python/muons_cff.py#L249)
        ]


indeed this seems match to the selection mu_idiso in our common selection, could you please move to use this mask, the purpose is to find out the selections are synced :)

Ming-Yan · 2025-03-12T09:20:58Z

test_env.yml

@@ -1,7 +1,7 @@
 name: btv_coffea
 channels:
  - conda-forge
-  - defaults
+  - nodefaults


I think we can remove this :)

Ming-Yan · 2025-03-12T09:23:25Z

src/BTVNanoCommissioning/workflows/BTA_ttbar_producer.py

+
+    def transfer_file(self, local_outfile_path, outfile_path):
+        transfer_command = f"xrdcp -p --silent {local_outfile_path} {outfile_path}"
+        result = os.system(transfer_command)
+        # Check if xrdcp failed
+        if result != 0:
+            print("xrdcp failed, attempting to transfer with gfal-copy")
+            transfer_command = (
+                f"gfal-copy -p -f -t 4200 {local_outfile_path} {outfile_path}"
+            )
+            result = os.system(transfer_command)
+            if result == 0:
+                print("File transferred successfully with gfal-copy")
+            else:
+                print("gfal-copy also failed")
+        else:
+            print("File transferred successfully with xrdcp")
+        if result == 0:
+            os.system(f"rm {local_outfile_path}")
+        else:
+            print("File transfer failed, need to transfer manually")
+            # append file path to a list for manual transfer which is stored in output_dir
+            with open(f"{self.output_dir}/manual_transfer.txt", "a") as f:
+                f.write(f"{transfer_command}\n")


Can we move this as a common function for other workflow if needed?
https://github.com/cms-btv-pog/BTVNanoCommissioning/blob/master/src/BTVNanoCommissioning/helpers/func.py

I would also suggest to have some documentation in the optional changes in doc :)
https://btvnanocommissioning.readthedocs.io/en/latest/developer.html#optional-changes

Ming-Yan · 2025-03-12T10:04:42Z

src/BTVNanoCommissioning/workflows/BTA_ttbar_producer.py

+                        "total_lhe_scaleweights_1": (
+                            ak.Array(
+                                [ak.sum(lhe_scale_w_arrays[:, 1] * events.genWeight)]
+                            )
+                            if lhe_pdf_w_arrays is not None
+                            and number_lhe_scaleweights > 1
+                            else ak.Array([0.0])
+                        ),
+                        "total_lhe_scaleweights_2": (
+                            ak.Array(
+                                [ak.sum(lhe_scale_w_arrays[:, 2] * events.genWeight)]
+                            )
+                            if lhe_pdf_w_arrays is not None
+                            and number_lhe_scaleweights > 2
+                            else ak.Array([0.0])
+                        ),
+                        "total_lhe_scaleweights_3": (
+                            ak.Array(
+                                [ak.sum(lhe_scale_w_arrays[:, 3] * events.genWeight)]
+                            )
+                            if lhe_pdf_w_arrays is not None
+                            and number_lhe_scaleweights > 3
+                            else ak.Array([0.0])
+                        ),
+                        "total_lhe_scaleweights_4": (
+                            ak.Array(
+                                [ak.sum(lhe_scale_w_arrays[:, 4] * events.genWeight)]
+                            )
+                            if lhe_pdf_w_arrays is not None
+                            and number_lhe_scaleweights > 4
+                            else ak.Array([0.0])
+                        ),
+                        "total_lhe_scaleweights_5": (
+                            ak.Array(
+                                [ak.sum(lhe_scale_w_arrays[:, 5] * events.genWeight)]
+                            )
+                            if lhe_pdf_w_arrays is not None
+                            and number_lhe_scaleweights > 5
+                            else ak.Array([0.0])
+                        ),
+                        "total_lhe_scaleweights_6": (
+                            ak.Array(
+                                [ak.sum(lhe_scale_w_arrays[:, 6] * events.genWeight)]
+                            )
+                            if lhe_pdf_w_arrays is not None
+                            and number_lhe_scaleweights > 6
+                            else ak.Array([0.0])
+                        ),
+                        "total_lhe_scaleweights_7": (
+                            ak.Array(
+                                [ak.sum(lhe_scale_w_arrays[:, 7] * events.genWeight)]
+                            )
+                            if lhe_pdf_w_arrays is not None
+                            and number_lhe_scaleweights > 7
+                            else ak.Array([0.0])
+                        ),
+                        "total_lhe_scaleweights_8": (
+                            ak.Array(
+                                [ak.sum(lhe_scale_w_arrays[:, 8] * events.genWeight)]
+                            )
+                            if lhe_pdf_w_arrays is not None
+                            and number_lhe_scaleweights > 8
+                            else ak.Array([0.0])
+                        ),
+                        "total_psweights_0": (
+                            ak.Array([ak.sum(ps_w_arrays[:, 0] * events.genWeight)])
+                            if ps_w_arrays is not None and number_of_psweights > 0
+                            else ak.Array([0.0])
+                        ),
+                        "total_psweights_1": (
+                            ak.Array([ak.sum(ps_w_arrays[:, 1] * events.genWeight)])
+                            if ps_w_arrays is not None and number_of_psweights > 1
+                            else ak.Array([0.0])
+                        ),
+                        "total_psweights_2": (
+                            ak.Array([ak.sum(ps_w_arrays[:, 2] * events.genWeight)])
+                            if ps_w_arrays is not None and number_of_psweights > 2
+                            else ak.Array([0.0])
+                        ),
+                        "total_psweights_3": (
+                            ak.Array([ak.sum(ps_w_arrays[:, 3] * events.genWeight)])
+                            if ps_w_arrays is not None and number_of_psweights > 3
+                            else ak.Array([0.0])


I was thinking whether we can have a unified function that does the work for all kinds of weight.
something like

def sumw(events,mcweights= "LHEScaleWeight"): weight_size= max(ak.count(events[mcweights],axis=-1)) weights={} for i in range(weight_size): weights[f'total_{mcweights}_{i}'=ak.sum(events[mcweights][:,i]*events.genWeight) return weights

Then you can have the returned weight arrays included

LHEweight=sumw(events,mcweights= "LHEScaleWeight") f['sumw']={**LHEweight,....}

Then I think this functionality later can be used in other workflows :)

mondalspandan and others added 16 commits January 16, 2025 10:00

Fix small bug in --isArray mode

8a671bf

doc: add the documentation of CommFW

246a041

- doc: add instructions of the CommFW, remove contents from readme - doc: add api information - env: update missing module -

Linting

c8b03bd

Add latest 2022 veto maps (summer22 instead of winter22)

cb10ee0

feat: add JEC/jetveto with files in jsonpog-intergration

21c8694

fix typo in suball script

c5884ec

change to nodefaults channels to avoid dependency on anacond for ente…

8c87a24

…rprises

add sum of weights for variations in BTA_ttbar, make basepath configu…

936fbb3

…rable

add protection if lhe and ps scale weights are not present in BTA_ttb…

167a689

…ar workflow

Merge branch 'spandan_dev_2501' of github.com:mondalspandan/BTVNanoCo…

d5dc253

…mmissioning into btv_dilep_wf

also transfer files without events for correctly keeping track of sum…

af54ec9

… of weights for BTA_ttbar workflow

doc: improve plot description

a2ad076

- add plots for plot description - more info in docstring

doc: add correction description

8ee0ee6

add default gen unc.

90bd9b4

fix: JEC unc

35550d4

fix: uncertainties

70684b8

Ming-Yan approved these changes Jan 23, 2025

View reviewed changes

Ming-Yan requested changes Jan 23, 2025

View reviewed changes

src/BTVNanoCommissioning/workflows/BTA_ttbar_producer.py Outdated Show resolved Hide resolved

philippgadow added 12 commits January 23, 2025 15:17

add white/black list to suball

e28a1e2

merge

d24987f

pre merge master

f3b4234

merge with master

7dcf7e3

update BTA_ttbar producer

85d65b3

Merge branch 'master' of github.com:cms-btv-pog/BTVNanoCommissioning …

af9395d

…into btv_dilep_wf

Merge branch 'master' of github.com:cms-btv-pog/BTVNanoCommissioning …

546bcc2

…into btv_dilep_wf

update BTA ttbar producer

0213f28

Merge branch 'master' of github.com:cms-btv-pog/BTVNanoCommissioning …

0a1f7c1

…into btv_dilep_wf

implement distinction for run 2 and run 3 MET

a254d08

format with black

777a5a9

remove DUST reference

119eb42

fix run 2 vs run 3 MET selection

39ca4c2

philippgadow requested a review from Ming-Yan March 10, 2025 16:35

Ming-Yan requested changes Mar 13, 2025

View reviewed changes

philippgadow closed this May 21, 2025

philippgadow deleted the btv_dilep_wf branch May 21, 2025 09:58

philippgadow restored the btv_dilep_wf branch May 21, 2025 09:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add sum of weights for BTA_ttbar workflow, fix typo in suball script and make basepath in BTA producers configurable #111

feat: add sum of weights for BTA_ttbar workflow, fix typo in suball script and make basepath in BTA producers configurable #111

Uh oh!

philippgadow commented Jan 20, 2025

Uh oh!

Ming-Yan left a comment •

edited

Loading

Uh oh!

Ming-Yan left a comment

Uh oh!

Uh oh!

Ming-Yan left a comment •

edited

Loading

Uh oh!

Ming-Yan Mar 12, 2025

Uh oh!

Ming-Yan Mar 12, 2025

Uh oh!

Ming-Yan Mar 12, 2025

Uh oh!

Ming-Yan Mar 12, 2025

Uh oh!

Uh oh!

feat: add sum of weights for BTA_ttbar workflow, fix typo in suball script and make basepath in BTA producers configurable #111

feat: add sum of weights for BTA_ttbar workflow, fix typo in suball script and make basepath in BTA producers configurable #111

Uh oh!

Conversation

philippgadow commented Jan 20, 2025

Description

Uh oh!

Ming-Yan left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Ming-Yan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Ming-Yan left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Ming-Yan Mar 12, 2025

Choose a reason for hiding this comment

Uh oh!

Ming-Yan Mar 12, 2025

Choose a reason for hiding this comment

Uh oh!

Ming-Yan Mar 12, 2025

Choose a reason for hiding this comment

Uh oh!

Ming-Yan Mar 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Ming-Yan left a comment •

edited

Loading

Ming-Yan left a comment •

edited

Loading