CondDBESSource.cc added dump method to JSON file #43374

PonIlya · 2023-11-23T13:51:36Z

PR description:

The purpose of this PR is to improve the CondDBESSource.cc dump method.
Records and their consumption will be output to a JSON file rather than a log file or console due to the fact that they are usually too long for visual analysis.
This is more convenient for further use, for example, generating сonsumption tables a log parsing script was previously used to fill it out

Previously, this issue was raised in the following PR but was not approved

Taking into account the comment, I left the old dump method but added the option to upload JSON specifying the file name via the command:
process.GlobalTag.JsonDumpFileName =cms.untracked.string("CondDBESSource.json")
The command above starts the creation of a JSON dump file if the file name or path to it is specified.

PR validation:

The generated .json file should have the same contents as the previous version of dumpstat after running.

cmsDriver.py TTbar_13TeV_TuneCUETP8M1_cfi --conditions auto:phase1_2023_realistic_postBPix -n 5 --era Run3_2023 --geometry DB:Extended -s GEN --fileout output_step1_GEN.root --beamspot Realistic25ns13p6TeVEarly2023Collision --customise_commands='process.GlobalTag.DumpStat =cms.untracked.bool(True) \n process.GlobalTag.JsonDumpFileName =cms.untracked.string("CondDBESSource.json")'|& tee output_step1_GEN.log
OR (without dump into .log)
cmsDriver.py TTbar_13TeV_TuneCUETP8M1_cfi --conditions auto:phase1_2023_realistic_postBPix -n 5 --era Run3_2023 --geometry DB:Extended -s GEN --fileout output_step1_GEN.root --beamspot Realistic25ns13p6TeVEarly2023Collision --customise_commands='process.GlobalTag.JsonDumpFileName =cms.untracked.string("CondDBESSource.json")'|& tee output_step1_GEN.log
Not a backport

cmsbuild · 2023-11-23T13:59:10Z

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-43374/37852

This PR adds an extra 24KB to repository

cmsbuild · 2023-11-23T13:59:35Z

A new Pull Request was created by @PonIlya for master.

It involves the following packages:

CondCore/ESSources (db, alca)

@consuegs, @cmsbuild, @saumyaphor4252, @francescobrivio, @perrotta can you please review it and eventually sign? Thanks.
@mmusich, @PonIlya, @yuanchao, @tocheng, @rsreds this is something you requested to watch as well.
@sextonkennedy, @rappoccio, @antoniovilela you are the release manager for this.

cms-bot commands are listed here

perrotta · 2023-11-23T16:14:51Z

@PonIlya this is going to disrupt the possibility for dumping the twiki format from the text logs. Nothing that cannot get fixed by the scripts in AlCaTools later on, but I would let the possibility to dump either in the text or in the json file ruled by a configuration parameter.

Moreover, the json files in output are numbered incrementally, and their name depends on what's already in the repository where you run them. I think that also those names should be made configurable, so that scripts in AlCaTools/ConditionsConsumed (for example runCMSDrivers_data2023D.sh) can be used to give them the same name as of the python config which is run.

vlimant · 2023-11-24T08:17:25Z

this is an extremely useful feature! many thanks

PonIlya · 2023-11-27T17:12:50Z

@perrotta
I created a script for processing json files. So this shouldn't be a blocker.
Although, it's a good idea to give the user a choice

I propose it this way:
Leave the old output method as default, and activate the new one instead of the old one when passing the argument in the command line --j --json.
Example:
cmsDriver.py step1 --conditions auto:run3_hlt_relval -n 5 --era Run3_2023 -s L1REPACK:Full --data --scenario pp --datatier FEVTDEBUGHLT --eventcontent FEVTDEBUGHLT --filein /store/data/Run2023D/JetMET0/RAW/v1/000/369/978/00000/00b9eba7-c847-465b-a6de-98bceae93613.root --fileout output_step1_L1.root --customise_commands="process.load('Configuration.StandardSequences.Digi_cff') \n process.GlobalTag.DumpStat =cms.untracked.bool(True)" --json --outputCommands "keep *" |& tee output_step1_L1.log

As for the names of the files, I will change the naming method so that it is taken from the arguments and corresponds to the name .log (output_step1_L1.json and etc.)

malbouis · 2023-12-14T11:28:24Z

@perrotta , @PonIlya , is there anything preventing this PR from moving forward?

perrotta · 2023-12-14T12:06:37Z

@perrotta , @PonIlya , is there anything preventing this PR from moving forward?

Yes, #43374 (comment) must be addressed before we can proceed (in particular the second part, that canno be cured by an additional script in AlCaTools)

vlimant · 2023-12-14T14:18:10Z

how about dumping the info both in the log file (the old way) and in a json (new way) on the side, systematically, when process.GlobalTag.DumpStat =cms.untracked.bool(True)

vlimant · 2023-12-14T14:24:55Z

IMO the name of the json file should be fixed and unambiguous like "CondDBESSource_stats.json" or "CondDBESSource.json"

PonIlya · 2023-12-14T15:18:23Z

IMO the name of the json file should be fixed and unambiguous like "CondDBESSource_stats.json" or "CondDBESSource.json"

I going to add the configuration parameter which can take the name of the JSON file. Like --j [filename].json and by default
CondDBESSource.json.

PonIlya · 2023-12-14T15:30:07Z

But, I have a problem with the way to pass the JSON file name and start flag to the CondDBESSource.cc from cmsDriver.py
We need some acceptable way.
Maybe with os.environ
os.environ["JSON_FILENAME"] = json_filename
and
std::getenv("JSON_FILENAME")
Or add a config file like config.ini
instead of "Auto generated configuration file" because this kind of config file is named by the user.

For now, i chose this way, with customise_commands:
--customise_commands='process.GlobalTag.JsonDumpFileName =cms.untracked.string("Testfilename") \n process.GlobalTag.DumpStat=cms.untracked.bool(True)
If I don't come up with anything better.

malbouis · 2023-12-14T16:57:10Z

@PonIlya , I would go with #43374 (comment) and just give the file a static (non-configurable) name.

cmsbuild · 2023-12-15T11:44:18Z

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-43374/38218

This PR adds an extra 24KB to repository

cmsbuild · 2023-12-15T11:44:44Z

Pull request #43374 was updated. @francescobrivio, @consuegs, @saumyaphor4252, @perrotta, @cmsbuild can you please check and sign again.

PonIlya · 2023-12-18T09:14:27Z

@perrotta I changed the code so that you can select the dump method and file name through custom commands.
process.GlobalTag.JsonDumpFileName =cms.untracked.string("CondDBESSource.json")
I hope the current implementation will suit everyone.

PonIlya · 2023-12-18T09:20:45Z

@malbouis My opinion is that one file name is not convenient because... if you run several dumps in a row, it will be overwritten. It will not be possible to collect information in JSON format immediately about GEN, SIM, etc. with one .sh script.

I have already added a new approach, I think it has become more convenient

perrotta · 2023-12-18T11:52:43Z

please test

cmsbuild · 2023-12-18T14:49:26Z

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-c35264/36550/summary.html
COMMIT: 2114bdf
CMSSW: CMSSW_14_0_X_2023-12-18-1100/el8_amd64_gcc12
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmssw/43374/36550/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

You potentially removed 85 lines from the logs
Reco comparison results: 28 differences found in the comparisons
DQMHistoTests: Total files compared: 50
DQMHistoTests: Total histograms compared: 3429858
DQMHistoTests: Total failures: 1208
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 3428628
DQMHistoTests: Total skipped: 22
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.0 KiB( 49 files compared)
Checked 214 log files, 167 edm output root files, 50 DQM output files
TriggerResults: no differences found

vlimant · 2024-01-10T07:31:10Z

is this good to go ?

perrotta · 2024-01-11T13:21:15Z

please test
(just to refresh them, no surprise expected, though)

perrotta · 2024-01-11T13:25:56Z

+1

Thank you @PonIlya , and sorry for the long delay after the restart at the beginning of the year: the PR as such is good to go, the scripts in AlCaTools/ConditionsConsumed can now get easily adapted to output also the json file, and there is no risk any more of ovewriting those outputs.
Since I do not expect suprises from the tests, I'm signing this now, so that the release managers can merge it as soon as tests certify that it is still compatible with the release

cmsbuild · 2024-01-11T13:26:23Z

This pull request is fully signed and it will be integrated in one of the next master IBs after it passes the integration tests. This pull request will now be reviewed by the release team before it's merged. @antoniovilela, @sextonkennedy, @rappoccio (and backports should be raised in the release meeting by the corresponding L2)

cmsbuild · 2024-01-11T16:03:18Z

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-c35264/36802/summary.html
COMMIT: 2114bdf
CMSSW: CMSSW_14_0_X_2024-01-10-2300/el8_amd64_gcc12
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/43374/36802/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

You potentially removed 100 lines from the logs
Reco comparison results: 1 differences found in the comparisons
DQMHistoTests: Total files compared: 48
DQMHistoTests: Total histograms compared: 3247277
DQMHistoTests: Total failures: 0
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 3247255
DQMHistoTests: Total skipped: 22
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.0 KiB( 47 files compared)
Checked 200 log files, 161 edm output root files, 48 DQM output files
TriggerResults: no differences found

rappoccio · 2024-01-12T14:46:55Z

+1

mmusich · 2023-12-16T13:15:18Z

CondCore/ESSources/plugins/CondDBESSource.cc

+      recordData["timeLookupPayloadIds"].push_back(payloadIdData);
+    }
+
+    jsonData[recName].push_back(recordData);


Out of curiosity, I was wondering, does it make sense to limit the json output to the payloads that are actually consumed in the job (i.e. if !pids.empty() or is there any particular reason to print all the tags that would be consumed but are actually not ?

cmsbuild added this to the CMSSW_14_0_X milestone Nov 23, 2023

cmsbuild added alca-pending db-pending pending-signatures tests-pending orp-pending code-checks-pending labels Nov 23, 2023

PonIlya mentioned this pull request Nov 23, 2023

New script for converting JSON dumpstat to Twiki format cms-AlCaDB/AlCaTools#94

Open

cmsbuild added code-checks-approved and removed code-checks-pending labels Nov 23, 2023

cmsbuild added code-checks-pending and removed code-checks-approved labels Dec 15, 2023

cmsbuild added code-checks-approved and removed code-checks-pending labels Dec 15, 2023

PonIlya marked this pull request as draft December 15, 2023 11:46

PonIlya closed this Dec 15, 2023

cmsbuild added tests-started and removed tests-pending labels Dec 18, 2023

cmsbuild added tests-approved and removed tests-started labels Dec 18, 2023

cmsbuild added tests-started and removed tests-approved labels Jan 11, 2024

cmsbuild added alca-approved fully-signed db-approved and removed alca-pending db-pending pending-signatures labels Jan 11, 2024

cmsbuild added tests-approved and removed tests-started labels Jan 11, 2024

cmsbuild added orp-approved and removed orp-pending labels Jan 12, 2024

cmsbuild merged commit 2cae9b4 into cms-sw:master Jan 12, 2024

cmsbuild mentioned this pull request Jan 13, 2024

Implementation of the Muon Track Splitting (MTS) Validation in the new TkAl all-in-one tool #43708

Merged

vlimant mentioned this pull request Jan 26, 2024

NANO (and MINI) pulling conditions from outside GT #43797

Closed

mmusich reviewed Feb 25, 2025

View reviewed changes

CondDBESSource.cc added dump method to JSON file #43374

CondDBESSource.cc added dump method to JSON file #43374

Uh oh!

Conversation

PonIlya commented Nov 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR description:

PR validation:

Uh oh!

cmsbuild commented Nov 23, 2023

Uh oh!

cmsbuild commented Nov 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

perrotta commented Nov 23, 2023

Uh oh!

vlimant commented Nov 24, 2023

Uh oh!

PonIlya commented Nov 27, 2023

Uh oh!

malbouis commented Dec 14, 2023

Uh oh!

perrotta commented Dec 14, 2023

Uh oh!

vlimant commented Dec 14, 2023

Uh oh!

vlimant commented Dec 14, 2023

Uh oh!

PonIlya commented Dec 14, 2023

Uh oh!

PonIlya commented Dec 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

malbouis commented Dec 14, 2023

Uh oh!

cmsbuild commented Dec 15, 2023

Uh oh!

cmsbuild commented Dec 15, 2023

Uh oh!

PonIlya commented Dec 18, 2023

Uh oh!

PonIlya commented Dec 18, 2023

Uh oh!

perrotta commented Dec 18, 2023

Uh oh!

cmsbuild commented Dec 18, 2023

Comparison Summary

Uh oh!

vlimant commented Jan 10, 2024

Uh oh!

perrotta commented Jan 11, 2024

Uh oh!

perrotta commented Jan 11, 2024

Uh oh!

cmsbuild commented Jan 11, 2024

Uh oh!

cmsbuild commented Jan 11, 2024

Comparison Summary

Uh oh!

rappoccio commented Jan 12, 2024

Uh oh!

mmusich Dec 16, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

PonIlya commented Nov 23, 2023 •

edited

Loading

cmsbuild commented Nov 23, 2023 •

edited

Loading

PonIlya commented Dec 14, 2023 •

edited

Loading