ntuple-tools

The python scripts in this repository should help you get started analysing the HGCAL L1 TP ntuples

Pre-requisites: first time setup

1. lxplus setup

This step is lxplus specific, givin access to a more recent python and root version. Edit/skip it accordingly for your specific system.

source setup_lxplus.sh

2. install `virtualenvwrapper`

This stetp needs to be done only once for your account and can be done with whatever python version is in use in the system.

For some reason the current CMSSW scrips seems to deliver an inconsistent setup of virtualenv and virtualenvwrapper, for this reason we force a new installation in ~/.local using:

pip install --ignore-installed --user virtualenv==15.1.0 virtualenvwrapper

For a more complete overview of the procedure you can refer to virtualenvwrapper installation instructions

3. setup `virtualenvwrapper`

For starting using virtualenvwrapper

source setVirtualEnvWrapper.sh

4. create a virtualenv for the project

The first time you will have to create the actual instance of the virtualenv:

mkvirtualenv --system-site-packages -p `which python3.8` -r requirements_py3.8.txt <venvname>

requirements_py3.8.txt

and

requirements_py3.8.txt

for python 3.8 and 3.10 respectively.

You can use the file directly using for example:

pip install -r requirements_py3.8.txt

Setup after first installation

1. lxplus setup

This step is lxplus specific, givin access to a more recent python and root version. Edit/skip it accordingly for your specific system.

source setup_lxplus.sh

2. setup `virtualenvwrapper`

For starting using virtualenvwrapper

source setVirtualEnvWrapper.sh

3. activate the `virtualenv`

After this initial (once in a time) setup is done you can just activate the virtualenv calling:

workon <venvname>

(lsvirtualenv is your friend in case you forgot the name).

Running the analysis

The main script is analyzeHgcalL1Tntuple.py:

python analyzeHgcalL1Tntuple.py --help

An example of how to run it:

python analyzeHgcalL1Tntuple.py -f cfg/hgctps.yaml -i cfg/datasets/ntp_v81.yaml -c tps -s doubleele_flat1to100_PU200 -n 1000 -d 0

Configuration file

The configuration is handled by 2 yaml files. One specifying

output directories
versioning of the plots
collections of samples, i.e. group of samples to be processed homogeneously: for each collection the list of plotters (see below) to be run is provided.

The other prividing

details of the input samples (location of the ntuple files)

Example of configuration file can be found in:

Reading ntuple branches or creating derived ones

The list of branches to be read and converted in pandas DataFrame format is specified in the module

collections

Instantiating an object of class DFCollection. What is actually read event by event depends anyhow on which plotters are actually instantiated (collections are read on-demand).

Selecting subsets of object collections

Selections are defined as strings in the module:

selections

Different collections are defined for different objects and/or different purposes. The selections have a name whcih is used for the histogram naming (see below). Selections are used by the plotters.

Adding a new plotter

The actual functionality of accessing the objects, filtering them according to the selections and filling histograms is provided by the plotter classes defined in the module:

plotters

Basic plotters are already available, most likely you just need to instantiate one of them (or a collection of them) using the DFCollection instance you are interested in. Which collection is run for which sample is steered by the configuration file.

The plotters access one or more collections, select them in several different ways, book and fill the histograms (see below).

Adding a new histogram

Histograms are handled in the module:

l1THistos

There are different classes of histograms depending on the input object and on the purpose. To add a new histogram to an existing class it is enough to add it in the corresponding constructor and in the fill module. The writing of the histos to files is handled transparently.

The histogram naming follows the convention: <ObjectName>_<SelectionName>_<GenSelectionName>_<HistoName>

This is assumed in all the plotters and in the code to actually draw the histograms.

Submitting to the batch system

Note that the script analyzeHgcalL1Tntuple.py can be used to submit the jobs to the HTCondor batch system invoking the -b option. A dag configuration is created and you can actually submit it following the script output.

Note about hadd job.

For each sample injected in the batch system a DAG is created. The DAG will submitt an hadd command once all the jobs will succeed. However, if you don't want to wait (or you don't care) you can submit also a condor job that will run hadd periodically thus reducing dramatically the latency. For example:

condor_submit batch_single_empart_guns_tracks_v77/ele_flat2to100_PU0/batch_harvest.sub

Name		Name	Last commit message	Last commit date
Latest commit History 1,160 Commits
cfg		cfg
data		data
python		python
src		src
templates		templates
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
analyzeHgcalL1Tntuple.py		analyzeHgcalL1Tntuple.py
buildTC2TowerMaps.py		buildTC2TowerMaps.py
display.py		display.py
hgcal_det_id.py		hgcal_det_id.py
hgcal_display.py		hgcal_display.py
requirements.txt		requirements.txt
requirements_py3.10.txt		requirements_py3.10.txt
requirements_py3.8.txt		requirements_py3.8.txt
runHarvesting.py		runHarvesting.py
run_process_pool.py		run_process_pool.py
selection.yaml		selection.yaml
selection_1061p2_v10geom.yaml		selection_1061p2_v10geom.yaml
selection_1061p2_v9geom.yaml		selection_1061p2_v9geom.yaml
selection_1061p2_v9geom_vanilla.yaml		selection_1061p2_v9geom_vanilla.yaml
selection_1112_NT_v11geom.yaml		selection_1112_NT_v11geom.yaml
selection_1112_v11geom.yaml		selection_1112_v11geom.yaml
selection_111pre6_v10geom.yaml		selection_111pre6_v10geom.yaml
selection_111pre6_v11geom.yaml		selection_111pre6_v11geom.yaml
selection_111pre6_v11geom_giovannniTune.yaml		selection_111pre6_v11geom_giovannniTune.yaml
selection_111pre6_v9geom.yaml		selection_111pre6_v9geom.yaml
selection_NF.yaml		selection_NF.yaml
selection_NF_v65B.yaml		selection_NF_v65B.yaml
selection_th20.yaml		selection_th20.yaml
selection_tps.yaml		selection_tps.yaml
selection_v11.yaml		selection_v11.yaml
selection_v9geom.yaml		selection_v9geom.yaml
setVirtualEnvWrapper.sh		setVirtualEnvWrapper.sh
setup_lxplus.sh		setup_lxplus.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ntuple-tools

Pre-requisites: first time setup

1. lxplus setup

2. install `virtualenvwrapper`

3. setup `virtualenvwrapper`

4. create a virtualenv for the project

Setup after first installation

1. lxplus setup

2. setup `virtualenvwrapper`

3. activate the `virtualenv`

Running the analysis

Configuration file

Reading ntuple branches or creating derived ones

Selecting subsets of object collections

Adding a new plotter

Adding a new histogram

Submitting to the batch system

Note about hadd job.

About

Releases

Packages

Languages

cerminar/ntuple-tools

Folders and files

Latest commit

History

Repository files navigation

ntuple-tools

Pre-requisites: first time setup

1. lxplus setup

2. install virtualenvwrapper

3. setup virtualenvwrapper

4. create a virtualenv for the project

Setup after first installation

1. lxplus setup

2. setup virtualenvwrapper

3. activate the virtualenv

Running the analysis

Configuration file

Reading ntuple branches or creating derived ones

Selecting subsets of object collections

Adding a new plotter

Adding a new histogram

Submitting to the batch system

Note about hadd job.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

2. install `virtualenvwrapper`

3. setup `virtualenvwrapper`

2. setup `virtualenvwrapper`

3. activate the `virtualenv`

Packages