ENH Cheetah-based SFX workflows and compression in Cheetah and XPP preparation #87

gadorlhiac · 2025-09-30T01:42:44Z

Description

This PR creates SFX workflows using Cheetah. It also allows for cheetah to be configured to run compression.

That latter feature requires merging this omdevteam/om#17 PR in OM.

Checklist

Update cheetah templates to support compression.
SFX workflows with cheetah, and a version with forced conversion to XTC2 to support compression, since libpressio requires the psana2 environment for some operation modes.
Update parameter models as needed.
Switch the default database specification to v2
Prepare smalldata_tools integrations for compression and fix various bugs
Update the setup script to account for the change to maestro

PR Type:

New feature/Enhancement

Address issues:

Testing

XPP Smalldata compression (workflow via `maestro`)

Running it:

> launch_slurm -c config/xpp_compression.yaml -W workflows/common/xpp_compression.dag -e xppx1003621 -r 197 --account=lcls:data --partition=milano --ntasks=5 --nodes=1
[2025-11-24 10:18:09.242] [LWM:Manager] [info] Running workflows with SlurmLauncher.
[2025-11-24 10:18:09.242] [HTTP:Server] [info] Starting server on 0.0.0.0:41239 with 5 threads, using 64 shards, with a backlog size of 1000 and 10000 maximum events.
[2025-11-24 10:18:09.243] [LWM:Manager] [info] Beginning workflow.
[2025-11-24 10:18:09.243] [LWM:SlurmLauncher] [info] Will launch Xtc1to2Converter with: /sdf/scratch/users/d/dorlhiac/work/lute_smdconv/install/bin/submit_slurm.sh --taskname Xtc1to2Converter --config config/xpp_compression.yaml --account=lcls:data --partition=milano --ntasks=5 --nodes=1
# ...
INFO:lute.execution.executor:TaskStatus.COMPLETED
ERROR:lute.io.elog:eLog Update Failed! JID_UPDATE_COUNTERS is not defined!
INFO:lute.execution.executor:Exiting after Task completion.
TASK_LOG -- INFO:lute.tasks.xtc: Conversion completed.

Time Xtc1to2Converter spent: 
- Pending: 14 s
- Running: 14 s
# ...

Look at HDF5 files

> h5ls /sdf/data/lcls/ds/xpp/xppx1003621/hdf5/with_compression/xppx1003621_Run0197.h5/jungfrau1M_alcove
ROI_111peak_area         Dataset {10, 193, 256}
ROI_111peak_com          Dataset {10, 2}
ROI_111peak_max          Dataset {10}
ROI_111peak_mean         Dataset {10}
ROI_111peak_sum          Dataset {10}
ROI_211peak_area         Dataset {10, 193, 256}
ROI_211peak_com          Dataset {10, 2}
ROI_211peak_max          Dataset {10}
ROI_211peak_mean         Dataset {10}
ROI_211peak_sum          Dataset {10}
ROI_224peak_area         Dataset {10, 175, 256}
ROI_224peak_com          Dataset {10, 2}
ROI_224peak_max          Dataset {10}
ROI_224peak_mean         Dataset {10}
ROI_224peak_sum          Dataset {10}
ROI_232peak_area         Dataset {10, 295, 459}
ROI_232peak_com          Dataset {10, 2}
ROI_232peak_max          Dataset {10}
ROI_232peak_mean         Dataset {10}
ROI_232peak_sum          Dataset {10}
ROI_air_scatter_bottom_area Dataset {10, 463, 350}
ROI_air_scatter_bottom_com Dataset {10, 2}
ROI_air_scatter_bottom_max Dataset {10}
ROI_air_scatter_bottom_mean Dataset {10}
ROI_air_scatter_bottom_sum Dataset {10}
ROI_air_scatter_larger_area Dataset {10, 177, 567}
ROI_air_scatter_larger_com Dataset {10, 2}
ROI_air_scatter_larger_max Dataset {10}
ROI_air_scatter_larger_mean Dataset {10}
ROI_air_scatter_larger_sum Dataset {10}
ROI_large_scatter_area   Dataset {10, 220, 232}
ROI_large_scatter_com    Dataset {10, 2}
ROI_large_scatter_max    Dataset {10}
ROI_large_scatter_mean   Dataset {10}
ROI_large_scatter_sum    Dataset {10}

Workflow definition

!LUTE_DAG
task_name: "ConvertXtc1to2"
slurm_params: ""
next:
- task_name: "SmallDataProducerSpack"
  slurm_params: ""
  next: []

YAML Configuration

%YAML 1.3
---
title: "Config to run smalldata_tools on converted XTC1 files."
experiment: "xppx1003621"
run: "{{ $RUN_NUM }}"
date: "2025/11/14"
lute_version: 0.1      # Do not be change unless need to force older version
task_timeout: 6000
work_dir: "/sdf/data/lcls/ds/xpp/xppx1003621/results/lute_output"
...
---
# We will define some convenience keys for substitution in other parameters
EXPERIMENT_DIR: "/sdf/data/lcls/ds/xpp/{{ experiment }}"
FAKE_PSDM_SUBDIR: "xpp/{{ experiment }}/xtc"
XTC2_FILE_PATH: "{{ EXPERIMENT_DIR }}/scratch/conversion/{{ FAKE_PSDM_SUBDIR }}"
XTC2_FILE_NAME: "{{ experiment }}-r{{ run:04d }}-s000-c000.xtc2"

ConvertXtc1to2:               # All variables are given as strings
  node_id: "1"                # Node ID for the detector
  #eventfile: ""
  nevents: 10
  output_file: "{{ XTC2_FILE_PATH }}/{{ XTC2_FILE_NAME }}"
  xtc1_access_pattern:
    jungfrau1M_alcove: # Name of the detector in the converted XTC2
    # You can have a list of attributes you will convert that will be stored in
    # this detector
      - xtc2_attr_name: "calib"          # Name of this attribute in xtc2
        object_name: "jungfrau1M_alcove" # Name of the detector in psana1
        object_type: "psana.Detector"    # Name of the object type in psana1
        object_field_name: "calib"       # Name of the per-event method to use in psana1
    #EBeam:
    #  - xtc2_attr_name: "photon_energy"
    #    object_name: "EBeam"
    #    object_type: "psana.Detector"
    #    object_field_name: ["get","ebeamPhotonEnergy"]

SubmitSMD:
  # Command line arguments
  #map_by: "core"   # MPI resource mapping - take care with changing unless familiar
  #bind_to: "core"  # MPI resource binding - take care with changing unless familiar
  #np: 5
  producer: "/sdf/data/lcls/ds/xpp/xppx1003621/scratch/smalldata_tools/lcls2_producers/smd_producer.py"
  run: "{{ run }}"
  experiment: "{{ experiment }}"
  #stn: 0
  #directory: "/sdf/data/lcls/ds/xpp/xppx1003621/hdf5/no_compression"
  directory: "/sdf/data/lcls/ds/xpp/xppx1003621/hdf5/with_compression"
  psdm_dir: "{{ XTC2_FILE_PATH }}"
  #config: "mfx_cctbx"
  #gather_interval: 25
  #norecorder: False
  #url: "https://pswww.slac.stanford.edu"
  #epicsAll: False
  #full: False
  #fullSum: False
  #default: true
  #image: False
  #tiff: False
  #centerpix: False
  #postRuntable: False
  #wait: False
  #xtcav: False
  #noarch: False
  # Producer variables. These are substituted into the producer to run specific
  # data reduction algorithms. Uncomment and modify as needed.
  # If you prefer to modify the producer file directly, leave commented.
  # Beginning with `getROIs`, you will need to modify the first entry to be a
  # detector. This detector MUST MATCH one of the detectors in `detnames`.
  # In the future this will be automated. If you have multiple detectors you can
  # add them with their own set of parameters.

  detnames: ["jungfrau1M_alcove"]
  # Detector sum images - per detector
  #detSumAlgos:
  #  jungfrau1M_alcove:
  #    - "calib"
  #    - "calib_max"
  # Setup the ROIs
  getROIs:
    jungfrau1M_alcove:   # Change to detector name
      - ROI: [[[1,2], [37, 230], [448, 704]]]
        name: "ROI_111peak"
        writeArea: True   # Whether to save ROI, if False, save sum but not img.
        thresADU: 8
        calcPars: True
      - ROI: [[[1,2], [37, 230], [448, 704]]]
        name: "ROI_211peak"
        writeArea: True   # Whether to save ROI, if False, save sum but not img.
        thresADU: 8
        calcPars: True
      - ROI: [[[1,2], [175,470], [26,  485]]]
        name: "ROI_232peak"
        writeArea: True   # Whether to save ROI, if False, save sum but not img.
        thresADU: 8
        calcPars: True
      - ROI: [[[0,1], [328,503], [370, 626]]]
        name: "ROI_224peak"
        writeArea: True   # Whether to save ROI, if False, save sum but not img.
        thresADU: 8
        calcPars: True
      - ROI: [[[0,1], [76, 296], [778,1010]]]
        name: "ROI_large_scatter"
        writeArea: True   # Whether to save ROI, if False, save sum but not img.
        thresADU: 8
        calcPars: True
      - ROI: [[[1,2], [23, 486], [639, 989]]]
        name: "ROI_air_scatter_bottom"
        writeArea: True   # Whether to save ROI, if False, save sum but not img.
        thresADU: 8
        calcPars: True
      - ROI: [[[1,2], [7,  184], [289, 856]]]
        name: "ROI_air_scatter_larger"
        writeArea: True   # Whether to save ROI, if False, save sum but not img.
        thresADU: 8
        calcPars: True
  getAzIntParams:
    jungfrau1M_alcove:
      eBeam: 9.4
      center: [26167.58, -30407.6] # um
      dis_to_sam: 45.0
      tx: 0
      ty: 0
  # Compression arguments. Comment this entire block if no compression wanted
  getPressioCompression:
    jungfrau1M_alcove:
      compressor_id: "sz3"
      # Specific arguments vary depending on compressor_id
      compressor_args:
        abs_error_bound: 10

Screenshots

…compression to Cheetah.

…x missing closing {% endif %} in cheetah template.

…ation.

… cheetah output automatically. Bump CrystFEL to 0.12.0

…environment and use execvpe instead

… specifications.

…etah/om

…d to reliably find install Python version even when a different Python is currently active via the user environment....

… for smalldata compression/converted xtc2s

…mestamps were wrong.

…odel. Also fix update_env/shell_source conflict.

…ctories as convenience.

…I. Remove unused parameters from cheetah config.

…nment. Any spack stuff in PYTHONPATH is breaking LUTE searching for pydantic.

…del parameter.

…ents from conversion.

gadorlhiac and others added 30 commits September 29, 2025 18:39

SKL Work in progress for Cheetah based SFX workflow. Add support for …

677e2b1

…compression to Cheetah.

MNT Remove unneeded imports

0d59bd5

BUG Fix bug in cheetah parameters model with missing psana fields. Fi…

3e10b8b

…x missing closing {% endif %} in cheetah template.

BUG Get cheetah running with the new version of OM.

3601372

BUG Handle nested templated parameter models for V1 Database specific…

c233721

…ation.

ENH Cheetah writes out image lst file. CrystFEL model updates to read…

d52205f

… cheetah output automatically. Bump CrystFEL to 0.12.0

MNT Add descriptions for Cheetah model.

9c789d4

UTL Update lute_help for enum type description and lack of descriptions.

1bb5336

ENH Update Xtc1to2 conversion to support configuration.

ac98aac

UTL Remove some extraneous information from lute_help

3c43ffd

ENH Support additional OM configuration

2e773e0

MNT Update validation of OM retrieval/processing layer parameters.

1b1ddff

MNT Simplify cheetah compression config YAML using substitutions.

3d9c997

MNT Rework ThirdPartyTask environment and exec to not modify initial …

167d911

…environment and use execvpe instead

MNT Better environment handling for ThirdPartyTask.

25059eb

ENH Add option to convert arbitrary data from XTC1 to XTC2 with field…

be5d81c

… specifications.

Merge dev

d2ad403

ENH Support setting psana algorithm for image fetching when using che…

4aa401d

…etah/om

Auto-commit black formatting

0736fb5

UTL Add glob to activation script. Not spellcheck friendly... but nee…

0e0401f

…d to reliably find install Python version even when a different Python is currently active via the user environment....

MNT Change affinity/rank binding settings.

8ecd900

Auto-commit black formatting

a061e4b

MNT Update xtc2 writing to use multiple detectors.

0201c17

SKL Work on adding validation to specification of conversion.

50087b9

SKL Add calib constants work in progress.

15a4e37

MNT Save calibration constants on SlowUpdate dgrams separately.

23b2a1b

SKL Work in progress. the epics strategy doesn't work fully. Add yaml…

83ddef0

… for smalldata compression/converted xtc2s

BUG Working for parallel processing with converted xtc2 files now. Ti…

e144fb1

…mestamps were wrong.

Fix Add compression to smd2 templates and additional options to smd m…

0d5fa4f

…odel. Also fix update_env/shell_source conflict.

MNT Clean up arguments.

a19828a

gadorlhiac mentioned this pull request Nov 15, 2025

BUG Various smalldata_tools issues #94

Closed

7 tasks

MNT Add the XPP compression workflow/config YAML.

34d66aa

gadorlhiac changed the title ~~ENH Cheetah-based SFX workflows and compression in Cheetah~~ ENH Cheetah-based SFX workflows and compression in Cheetah and XPP preparation Nov 15, 2025

gadorlhiac added 3 commits November 14, 2025 23:47

BUG Typo in config file.

394a894

MNT Add check for correct dir structure in xtc2 conversion. Make dire…

3b8f216

…ctories as convenience.

BUG Fix invalidated iterator.

af3b369

This was referenced Nov 17, 2025

DOC Task related documentation for smalldata_tools and the XTC1-XTC2 conversion #97

Closed

DOC Provide documentation on Maestro #96

Closed

This was linked to issues Nov 17, 2025

DOC Provide documentation on Maestro #96

Closed

DOC Task related documentation for smalldata_tools and the XTC1-XTC2 conversion #97

Closed

gadorlhiac and others added 16 commits November 17, 2025 17:10

BUG Handle default basemodels as tempalte parameters for db spec 2 AP…

005f959

…I. Remove unused parameters from cheetah config.

MNT Deal with spack and using the fast sourcing method for the enviro…

9d849d8

…nment. Any spack stuff in PYTHONPATH is breaking LUTE searching for pydantic.

DOC Add some additional maestro documentation

03a4c91

DOC Add note on Handler

46887f8

DOC Add info on meson.build files.

20f148d

DOC Caveat on first-party Task in different env.

323ee38

BUG Fix issues with the xpp compression workflow and remove unused mo…

28dfbf2

…del parameter.

BUG Add full path to the subprocess xtc scripts.

4f3aba7

BUG Remove bugging num events transfer.

9e7c0d7

BUG Fix some bugs in xtc.

c303e29

MNT cleanup lute_path in submit_slurm.sh

9b5dc01

BUG Fix smalldata file path.

13d23e0

BUG Fix smd model for psdm_dir

99d4075

BUG Fix smd2 template. Update the XPP compression YAML for compatibility

1c4c5c5

BUG Add work-around for spack/conda incompatiblities. Remove dummy ev…

2fbfe3d

…ents from conversion.

Auto-commit black formatting

359e134

gadorlhiac marked this pull request as ready for review November 24, 2025 18:22

gadorlhiac mentioned this pull request Nov 24, 2025

BUG XPP for LCLS2 Producer, Fix some GenericContainer bugs slac-lcls/smalldata_tools#271

Merged

2 tasks

gadorlhiac merged commit a7bea20 into slac-lcls:dev Nov 24, 2025

gadorlhiac deleted the ENH/new_sfx branch November 24, 2025 19:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ENH Cheetah-based SFX workflows and compression in Cheetah and XPP preparation #87

ENH Cheetah-based SFX workflows and compression in Cheetah and XPP preparation #87

Uh oh!

gadorlhiac commented Sep 30, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ENH Cheetah-based SFX workflows and compression in Cheetah and XPP preparation #87

ENH Cheetah-based SFX workflows and compression in Cheetah and XPP preparation #87

Uh oh!

Conversation

gadorlhiac commented Sep 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

PR Type:

Address issues:

Testing

XPP Smalldata compression (workflow via maestro)

Running it:

Look at HDF5 files

Workflow definition

YAML Configuration

Screenshots

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gadorlhiac commented Sep 30, 2025 •

edited

Loading

XPP Smalldata compression (workflow via `maestro`)