ATLAS_WCHARM_13TEV #2337

ecole41 · 2025-06-20T14:15:00Z

This pull request is for the implementation of the ATLAS_WCHARM_13TEV dataset

ecole41 · 2025-06-23T13:16:57Z

nnpdf_data/nnpdf_data/commondata/ATLAS_WCHARM_13TEV/data.yaml

@enocera Do we want to have all of the channels ((W−+D+), (W++D−), (W−+D∗+), (W++D∗−)) put together into one dataset like this?

@ecole41 Yes we want a single data set with all the channels. Two remarks.

One must be cautious w.r.t. the treatment of correlations: there may be uncertainties that are correlated across all the channels and uncertainties that are not. But this is something that you surely know better than me, because you've gone through the paper. Also, when definining the theory part of the metadata, one should have four entries, with four different names, one for each channel.

I realise that this measurement is only particle level, which may complicate the way in which we will have to make theoretical predictions, but this is something that we can discuss later.

ecole41 · 2025-06-25T13:45:07Z

nnpdf_data/nnpdf_data/commondata/ATLAS_WCHARM_13TEV/filter_utils.py

+    """
+
+    ndat = 5
+    # Produce covmat of form [[W-/W+],[0],


@enocera We are given the covariance matrices for W-/W+ and for W-( *)/W+(*). I am assuming that means there are no correlations between the excited (*) and non-excited channels.
I wanted to check what should be done to construct the systematics with this.

We are given the systematics for each point but I am not sure if these include the correlations from this covmat. The covariances matrices are also combined statistical and systematic uncertainty covariance matrices so does this mean that decomposing this covmat will give the final systematics with correlations accounted for?

Dear @ecole41 after looking into this data set a little, I suggest to implement two variants insofar as uncertainties are concerned. This means that you have to generate two uncertainties.yaml files, with different names of your choice (e.g. uncertainties.yaml and uncertainties_covariances.yaml).
1 - The first variant, associated to the uncertainties.yaml file, is the one in which you implement the breakdown of 280 uncertainties provided, for each bin, in the last column of Tables 19-22. These uncertainties, that have the same names across all channels, can be correlated all over the place.
2 - The second variant, associated to the uncertainties_covarainces.yaml file, is the one in which you implement the information that you've got from Tables 15-18. The way I'd do this is as follows: first construct a block-diagonal covariance matrix; then generate N_dat artificial systematics from this covariance matrix. The function that generates artificial systematics from a given covariance matrix is here.

Thank you, that makes a lot of sense.

I just wanted to check the function used to symmetrise the errors. The function in: nnpdf_data/nnpdf_data/filter_utils/uncertainties.py, shows that the se_delta is equal to the average of the two errors and the se_sigma is related to their difference. Should this be the other way around?

I also wanted to check how we should treat the se_delta and se_sigma values. Do we need add the se_delta onto the central data values and use these in the data.yaml file?

I just wanted to check the function used to symmetrise the errors. The function in: nnpdf_data/nnpdf_data/filter_utils/uncertainties.py, shows that the se_delta is equal to the average of the two errors and the se_sigma is related to their difference. Should this be the other way around?

You are totally right, that function looks wrong, it should be the other way around to be consistent with Eqs. (23)-(24) and (27) of https://arxiv.org/pdf/physics/0403086. We should check how many data sets have been affected by that typo. Thanks.

I also wanted to check how we should treat the se_delta and se_sigma values. Do we need add the se_delta onto the central data values and use these in the data.yaml file?

You have to compute se_delta for each data point, sum them up, shift the exp central value and dump that into the data.yaml file. And likewise for the uncertainty. We are essentially using Eqs.(27)-(28) of https://arxiv.org/pdf/physics/0403086. The function above computes se_delta and se_sigma for a single data point i, but then you have to sum over all points, manipulate central values and uncertainties, and finally write the manipulated central values and uncertainties in the corresponding .yaml files. Hope that this clarifies the issue.

ecole41 · 2025-07-11T10:26:52Z

nnpdf_data/nnpdf_data/commondata/ATLAS_WCHARM_13TEV/metadata.yaml

+    label: ATLAS $W^-+c$ 13 TeV
+    units: '[pb]'
+  process_type: 
+  tables: [19,20,21,22]  # 5/19 (W−+D+), 6/20(W++D−), 9/21(W−+D∗+), 10/22(W++D∗−) 


@enocera Should these four ((W−+D+), (W++D−), (W−+D∗+), (W++D∗−))be separated into separate observables or be kept as one?

These should be four different observables of the same data set, as, e.g., different differential distributions for top pair production.

Ok, and should these all have a separate data, uncertainties and kinematics yaml files? So four of each?

ecole41 · 2025-08-22T14:40:18Z

nnpdf_data/nnpdf_data/commondata/ATLAS_WCHARM_13TEV/metadata.yaml

+  ndata: 5
+  plotting:
+    dataset_label: ATLAS $W^-+c$ 13 TeV
+    y_label: 'Differential fiducial cross-section times the single-lepton-flavor W boson branching ratio' #In Latex terms?


I'm not sure how to put this in Latex terms

… observables

…treamlined filter script

… observables

…treamlined filter script

enocera · 2025-08-28T05:41:22Z

Dear @ecole41 , in view of the upcoming Morimondo meeting, I have revised the implementation of this data set. After a more careful look at the paper, I came to the conclusion that the best way of proceeding is to implement only two observables: one for the decay of a W in a D meson, and one for the decay of a W in a Dstar meson. In both cases, the observable incorporates a W- and a W+. The observable is the absolute distribution differential in the pseudorapidity of the lepton. This choice greatly simplifies the implementation, because we only need to take into account the full covariance matrix (Table 12 of the paper), and decompose it into the artificial systematic uncertainties. Because the full covariance matrix is not available for normalised distributions, we simply dismiss them. We dismiss the distributions differential in pT because they are theoretically trickier than the distributions in eta. Because W- and W+ are correlated, we have to incorporate the two in the same observable, for each decay mode. The two decay modes, as per the paper, are instead largely uncorrelated, except for the luminosity uncertainty. For this reason, I first compute the luminosity covariance matrix, and I subtract it from the full covariance matrix provided by the experimentalists before generating the artificial systematic uncertainties. I then attach the luminosity uncertainty to the list of uncertainties as usual. This way the luminosity uncertainty can be correlated across data sets and experiments as it should. This way the implementation is significantly more compact. The tests now pass. We can discuss further in Morimondo. Thanks.

scarlehoff · 2025-10-21T12:45:00Z

@ecole41 could you generate also the (NLO) grids for this dataset?

ecole41 · 2025-10-28T13:09:51Z

@ecole41 could you generate also the (NLO) grids for this dataset?

I am happy to give it a go. I am new to making grids without the pine cards, do we have a similar dataset already to use a base?

enocera · 2025-10-29T08:37:48Z

Dear @ecole41 unfortunately we don't have an example for W+charm. We used MCFM to compute applgrids back in the days, but I don't think that this is of any help now. As @scarlehoff suggests, we may want to use madgraph to generate the NLO PineAPPL grids. These will then be supplemented with K-factors to incorporate the NNLO correction. In this respect, I point you out to this paper https://arxiv.org/pdf/2510.24525, in which they perform a comparison between NNLO predictions and the very same data set in this PR (see in particular Sect. 3). Maybe you've got a chance to talk to René Poncelet who sits at Cavendish and ask him if they can provide you with any useful input to compute theoretical predictions?

ecole41 added 2 commits June 20, 2025 15:12

First commit;

f205bfe

data central

bf1e7d8

ecole41 commented Jun 23, 2025

View reviewed changes

ecole41 added the NNPDF4.1 label Jun 23, 2025

ecole41 self-assigned this Jun 24, 2025

ecole41 added 2 commits June 25, 2025 13:51

Added kinematics

68e203d

Started uncertainties

d1adbcc

ecole41 commented Jun 25, 2025

View reviewed changes

ecole41 added 5 commits July 9, 2025 16:39

Artificial uncertainties

43cf998

fixes with utila nad metadata

d8e6aec

symmetrise errors

8ef8942

symmetrise errors

0580b11

Systematics

15e98dd

ecole41 commented Jul 11, 2025

View reviewed changes

ecole41 added 2 commits August 4, 2025 12:01

Fixed asymm errors

dbef8c6

Separated into separate observables

3f24200

ecole41 commented Aug 22, 2025

View reviewed changes

Added kinematics labels to metadata

6bde456

enocera added the data toolchain label Aug 27, 2025

enocera and others added 11 commits August 27, 2025 15:50

Replaced Hepdata tables and updated metadata.yaml with optimal set of…

cc395a8

… observables

Streamlined data and kinematic file generation

421a5bb

Deleted redundant files: corrected Hepdata tables; restructured and s…

6019a3a

…treamlined filter script

Corrected metadata syntax

109f27a

Corrected number of points

202b0e2

First commit;

2330693

data central

52ebd23

Added kinematics

7f15390

Started uncertainties

883bca9

Artificial uncertainties

eb117c5

fixes with utila nad metadata

64aba29

ecole41 and others added 11 commits August 27, 2025 22:48

symmetrise errors

5d820f2

symmetrise errors

77b5b84

Systematics

665b6af

Fixed asymm errors

5430028

Separated into separate observables

21ff988

Added kinematics labels to metadata

70f0be3

Replaced Hepdata tables and updated metadata.yaml with optimal set of…

f6e22dd

… observables

Streamlined data and kinematic file generation

f40ed74

Deleted redundant files: corrected Hepdata tables; restructured and s…

2c870b6

…treamlined filter script

Corrected metadata syntax

0522fa7

Corrected number of points

9665cd0

enocera force-pushed the implement_ATLAS_WCHARM_13TEV branch from 202b0e2 to 9665cd0 Compare August 27, 2025 20:48

enocera changed the title ~~[WIP] Implementation of ATLAS_WCHARM_13TEV~~ ATLAS_WCHARM_13TEV Aug 28, 2025

enocera added 2 commits August 28, 2025 07:27

filter cleanup

de68a7d

Cleanup filter

18eda5c

enocera added the grids missing label Sep 17, 2025

Corrected wrong luminosity label

3d6dce2

ATLAS_WCHARM_13TEV #2337

Are you sure you want to change the base?

ATLAS_WCHARM_13TEV #2337

Uh oh!

Conversation

ecole41 commented Jun 20, 2025

Uh oh!

ecole41 Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

enocera Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

ecole41 Jun 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

enocera Jun 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ecole41 Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ecole41 Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

enocera Jul 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

enocera Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

ecole41 Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

enocera Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

ecole41 Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

ecole41 Aug 22, 2025

Choose a reason for hiding this comment

Uh oh!

enocera commented Aug 28, 2025

Uh oh!

scarlehoff commented Oct 21, 2025

Uh oh!

ecole41 commented Oct 28, 2025

Uh oh!

enocera commented Oct 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ecole41 Jun 25, 2025 •

edited

Loading

enocera Jun 27, 2025 •

edited

Loading

ecole41 Jul 9, 2025 •

edited

Loading

ecole41 Jul 11, 2025 •

edited

Loading

enocera Jul 29, 2025 •

edited

Loading

enocera commented Oct 29, 2025 •

edited

Loading