Skip to content

Chunked splitting#97

Open
jurjen93 wants to merge 16 commits intomasterfrom
chunked-splitting
Open

Chunked splitting#97
jurjen93 wants to merge 16 commits intomasterfrom
chunked-splitting

Conversation

@jurjen93
Copy link
Collaborator

@jurjen93 jurjen93 commented Feb 19, 2026

This MR:

  • Splits a CSV or astropy Table into chunks of 10 for enhanced parallelisation and more efficient usage of the explode step in DP3.
  • Rename/rewrite split_parset.cwl to split_direction_single_ms.cwl, since that fits better its purpose as it also performs now concatenation after splitting a subband.

@jurjen93 jurjen93 marked this pull request as draft February 19, 2026 11:15
jurjen93 and others added 4 commits February 19, 2026 15:48
This MR fixes:
- An issue that comes up when running get_phasediff.cwl and
concat_csv.cwl with Toil-CWL-runner without singularity container, where
double quotation marks are not interpreted the same as single quotation
marks.
- Make the phasediff_ms writable, as it updates the input MS when
converting the dataset from linear to circular polarisation, which makes
it impossible to restart a failed job.
- Add back the pre-averaging before phasediff selection to speedup
processing.
This PR adds resource requirements based on running the subtraction on a
Lockman pointing on cosma.

**DIS2 h5parm merging**
```
87.61user 63.33system 2:34.87elapsed 97%CPU (0avgtext+0avgdata 86228096maxresident)k                     │
8226448inputs+28627944outputs (1725major+1981817minor)pagefaults 0swaps
```

**Subtracting one chunk**
Looking at the output of `time` it seems the prediction only uses ~45 GB
```
3589.76user 403.85system 12:43.18elapsed 523%CPU (0avgtext+0avgdata 45454272maxresident)k                │  - class: ResourceRequirement
2143544inputs+100504136outputs (2932major+52721951minor)pagefaults 0swaps  
```
the the OoM killer kept kicking in and DDFacet itself reported ~125GB so
perhaps there's a small spike somewhere or something like that. That is
where the 128 GB ramMin comes from.
@jurjen93 jurjen93 marked this pull request as ready for review February 27, 2026 07:36
@jurjen93 jurjen93 requested a review from tikk3r February 27, 2026 07:39
@jurjen93
Copy link
Collaborator Author

The diff shows again changes from other MR that were added after rebasing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants