Modernize multiprocessing by gmloose · Pull Request #181 · revoltek/losoto

gmloose · 2025-08-12T13:25:55Z

This pull request implements the changes suggested in #180.

Sorted import statements, minor change in log message.

The wrong version of the updated PLOT operation was accidentally committed. This one should be OK.

Removed the now obsolete `multiproceManager` class. It has been replaced by `multiprocessing.Pool`

github-actions · 2025-08-12T13:26:57Z

☂️ Python Coverage

current status: ✅

Overall Coverage

Lines	Covered	Coverage	Threshold	Status
4361	765	18%	0%	🟢

New Files

No new covered files...

Modified Files

File	Coverage	Status
losoto/lib_operations.py	27%	🟢
losoto/operations/flag.py	4%	🟢
losoto/operations/flagextend.py	13%	🟢
losoto/operations/flagstation.py	2%	🟢
losoto/operations/plot.py	2%	🟢
losoto/operations/reweight.py	13%	🟢
TOTAL	10%	🟢

updated for commit: e763aac by action🐍

Resolved merge conflict.

Two scripts that help in debugging/comparing the output of recently changed operations against a reference set. Both sets need to be generated. First run `benchmark_all.sh` on the reference (e.g. the current `master`). Next run it on the changed operations in a different directory. Finally run `compare.sh` to compare the contents of the two directories. The parset files contain inputs for each of the LoSoTo operations under test. NOTE: This directory can be removed once everything works as expected. TODO: Investigate how these tests can somehow be converted into regression tests (ran by `pytest`).

gmloose · 2025-08-12T14:51:43Z

Note for the reviewer(s): the debug directory contains stuff that I used to verify that the new implementation produces the same result as the old one. It doesn't need to be reviewed, nor does it need to be merged, though I welcome suggestions about how these tests could be turned into CI tests.

The file `compare.out` contains the difference between two runs. Note that the run times may vary with a few seconds between runs. The differences are not something to be worried about, though for very short processes (e.g. in the PLOT operation) you'll notice that the new implementation has a bit more overhead.

tmillenaar · 2025-08-13T11:44:44Z

Note for the reviewer(s): the debug directory contains stuff that I used to verify that the new implementation produces the same result as the old one. It doesn't need to be reviewed, nor does it need to be merged, though I welcome suggestions about how these tests could be turned into CI tests.

For fun I tried to run debug/benchmark_all.sh for myself but I don't have the required bandpass.input.h5. Is this a file I can download from somewhere?

tmillenaar · 2025-08-13T11:31:03Z

losoto/lib_operations.py

        return multiprocessing.cpu_count()


-class multiprocManager(object):


Quite the nice cleanup :)

tmillenaar · 2025-08-13T11:49:34Z

losoto/operations/flag.py

+    ncpu = ncpu if ncpu > 0 else nproc()  # use all available CPUs if ncpu is not set
+    logging.debug("Using %s CPU(s) for operation FLAG.", ncpu)
+    with multiprocessing.Pool(ncpu) as pool:
+        results = pool.starmap(_flag, args)


You mentioned in slack that the new pool implementation was a bit slower and you wondered if it was related to overhead of running the Pool. While that can be the case, I wonder if the difference was in the original queue returning results as soon as they were ready whereas the pool.starmap will return them in order once all are finished. We can test this by using imap_unordered instead of starmap.

How big were the performance differences you observed?

You mentioned in slack that the new pool implementation was a bit slower and you wondered if it was related to overhead of running the Pool. While that can be the case, I wonder if the difference was in the original queue returning results as soon as they were ready whereas the pool.starmap will return them in order once all are finished. We can test this by using imap_unordered instead of starmap.

How big were the performance differences you observed?

Not that large. Have a look at debug/compare.out. And they can differ by a few seconds between runs. But overall the Pool seems to be slightly slower. I like your suggestion to use imap_unordered, I will try that.

Just realized I cannot use imap_unordered as a drop-in replacement, because I need the starmap functionality. So, I would need something like starmap_unordered, which unfortunately doesn't exist. Work-around would be to write that wrapper myself.

Fair, let's stick with the starmap. The performance loss is small and it both simplifies and fixes the potential for haning queues so still pretty good all things considered.

gmloose · 2025-08-13T14:54:36Z

Note for the reviewer(s): the debug directory contains stuff that I used to verify that the new implementation produces the same result as the old one. It doesn't need to be reviewed, nor does it need to be merged, though I welcome suggestions about how these tests could be turned into CI tests.

For fun I tried to run debug/benchmark_all.sh for myself but I don't have the required bandpass.input.h5. Is this a file I can download from somewhere?

The input file is almost 1 GB in size. I can put it somewhere where you can download it, if you want. But this is indeed the problematic part of reproducing the results.

darafferty · 2025-09-04T17:07:10Z

Looks good to me! Out of curiosity, does the new implementation indeed solve the deadlock issues mentioned in #180? (I'm assuming it does, just wondering if you had a chance to check it.)

gmloose · 2025-09-05T07:32:14Z

Looks good to me! Out of curiosity, does the new implementation indeed solve the deadlock issues mentioned in #180? (I'm assuming it does, just wondering if you had a chance to check it.)

Yes it solves the dead-lock issue. You now get a clear error message from the child process if it dies due to an exception.

Removed the debug directory. It is no longer necessary and litters the project.

gmloose added 8 commits August 7, 2025 13:47

Modernize parallelization in FLAG operation

fd2bcc6

Modernize parallelization in FLAGEXTEND operation

f5cd58b

Modernize parallelization in FLAGSTATION operation

3c23d39

Modernize parallelization in PLOT operation

8963db8

Modernize parallelization in REWEIGHT operation

7f805c7

Sort imports

b3321bf

Sorted import statements, minor change in log message.

Fix operation PLOT

f28ecd4

The wrong version of the updated PLOT operation was accidentally committed. This one should be OK.

Remove now obsolete multiprocManager class

6ff8109

Removed the now obsolete `multiproceManager` class. It has been replaced by `multiprocessing.Pool`

gmloose marked this pull request as draft August 12, 2025 13:26

gmloose self-assigned this Aug 12, 2025

gmloose requested a review from darafferty August 12, 2025 13:26

gmloose added 2 commits August 12, 2025 15:39

Merge branch 'master' into issue-180_modernize-multiprocessing

7acb26e

Resolved merge conflict.

tmillenaar reviewed Aug 13, 2025

View reviewed changes

gmloose marked this pull request as ready for review August 13, 2025 15:49

gmloose requested review from tammojan and removed request for darafferty August 21, 2025 08:22

gmloose requested review from darafferty and removed request for tammojan September 4, 2025 13:14

darafferty approved these changes Sep 4, 2025

View reviewed changes

gmloose added 2 commits September 5, 2025 09:35

Merge branch 'master' into issue-180_modernize-multiprocessing

65b0d69

Remove debug directory

e763aac

Removed the debug directory. It is no longer necessary and litters the project.

gmloose merged commit 24b18b1 into master Sep 5, 2025
4 checks passed

gmloose mentioned this pull request Dec 11, 2025

Losoto sometimes hangs, caused by MR #173 #178

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modernize multiprocessing#181

Modernize multiprocessing#181
gmloose merged 13 commits intomasterfrom
issue-180_modernize-multiprocessing

gmloose commented Aug 12, 2025

Uh oh!

github-actions bot commented Aug 12, 2025 •

edited

Loading

Uh oh!

gmloose commented Aug 12, 2025 •

edited

Loading

Uh oh!

tmillenaar commented Aug 13, 2025

Uh oh!

tmillenaar Aug 13, 2025

Uh oh!

tmillenaar Aug 13, 2025

Uh oh!

gmloose Aug 13, 2025 •

edited

Loading

Uh oh!

gmloose Aug 13, 2025

Uh oh!

tmillenaar Aug 14, 2025

Uh oh!

gmloose commented Aug 13, 2025

Uh oh!

darafferty commented Sep 4, 2025

Uh oh!

gmloose commented Sep 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		return multiprocessing.cpu_count()


		class multiprocManager(object):

Conversation

gmloose commented Aug 12, 2025

Uh oh!

github-actions bot commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

☂️ Python Coverage

Overall Coverage

New Files

Modified Files

Uh oh!

gmloose commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tmillenaar commented Aug 13, 2025

Uh oh!

tmillenaar Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

tmillenaar Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

gmloose Aug 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gmloose Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

tmillenaar Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

gmloose commented Aug 13, 2025

Uh oh!

darafferty commented Sep 4, 2025

Uh oh!

gmloose commented Sep 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

github-actions bot commented Aug 12, 2025 •

edited

Loading

gmloose commented Aug 12, 2025 •

edited

Loading

gmloose Aug 13, 2025 •

edited

Loading