-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New implementation of CMS_WCHARM_13TEV_WPWM-TOT-UNNORM #2244
base: master
Are you sure you want to change the base?
Conversation
Hi @RoyStegeman , I think I'm done here. Please, see the following comments:
Honestly, I can't judge whether these differences are relevant or not. The difference in chi2 is not negligible if one accounts for the shifts. On the other hand, the difference in the t0 matrices does not really worry me as I was able to reproduce the chi2 of the legacy implementation provided shifts were removed. @RoyStegeman, what do you think? Maybe it is worth asking @enocera. |
Do you know why the fktables of this dataset only exist in theories 704 (0.5,0.5) and 705 (0.5,1)? |
No, maybe @enocera does. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not so familiar with dataset implementations so this is going to take me some time to figure out...
For now I just have a question regarding the Extractor
class. There are also a lot of unused imports, are you using an lsp?
import numpy as np | ||
import yaml |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
import numpy as np | |
import yaml |
import os | ||
|
||
import numpy as np | ||
import pandas as pd |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
import pandas as pd |
import os | ||
|
||
import numpy as np | ||
import pandas as pd |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
import pandas as pd |
SYS_UNC_by_bin = [{}] | ||
|
||
|
||
class Extractor: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some of this stuff seems pretty universal. Would it be worth defining a base class that is shared between datasets?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've been thinking about making this class more universal. However, I gave up because there are many differences between datasets, even within the same experiment. If we want to use a more universal extractor
, then we should all agree on standard common specifics amongst datasets. For now, the extractor class is rather specialised to the datasets that I implemented.
@@ -0,0 +1,304 @@ | |||
import logging | |||
import os |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
import os |
kinematic_coverage: | ||
- k1 | ||
- k2 | ||
- k3 | ||
plot_x: abs_eta | ||
kinematic_coverage: [abs_eta, m_W2] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why do the legacy kinematics have three variables and the new implementation just two? And does the legacy dataset not use this metadatafile and thus cause issues if the kinematics don't match?
From the value it seems the removed variable was just supposed to indicate the 13TeV beam energy, which I don't think would be used anywhere. But this question is more about how the code deals with it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The third variable in the old implementation was the beam energy. However, this is optional in the new common data parser as the
The legacy dataset does use this metadata file. However, since the kinematics is the same between the two versions, the legacy file can be removed (and I will).
@@ -0,0 +1,143 @@ | |||
import numpy as np |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
import numpy as np |
|
language server protocol. It's the software that highlights tokens based on their role in the python syntax. Including unused imports I'm pretty sure you are using it, but just in case you're not |
Oh, then I am. But I just forgot to delete the unused imports. |
I think that the reason is as follows: W+c data were not included in NNDPF4.0 because, at that time, NNLO corrections to the matrix elements were not known. When the MHOU, QED, and aN3LO determinations were produced, the leitmotiv was to put them on the same grounds as NNPDF4.0. Therefore W+c did not go into them. At some point I raised the question whether we should include it (in the same way as we do, e.g. for LHC data in the N3LO fit). Initially their answer was yes, but then they retracted. So I suspect that Andrea started to compute the FK tables, but then stopped. |
f054206
to
342e1f5
Compare
According to what ERN said in the last code meeting, this one is also ready for review. |
This PR implements CMS_WCHARM_13TEV_WPWM-TOT-UNNORM in the new format.
General comments
This dataset delivers the differential distribution in function of the absolute rapidity of the lepton pair. Each data point is accompanied by a (symmetric) statistical uncertainty and a (asymmetric) systematical uncertainty. The latter is the sum in quadrature of the different sources of uncertainty. The breakdown of these systematic sources is not delivered in the HepData format, but it is given in Table 1 of the paper.
The legacy version has the variant
sys_10
, which should not be implemented because it was meant to account for the 3pt prescription.Legacy: [default],
New: [default w/ shifts], [default w/o shifts]