Skip to content

Comments

Integrate Raphaels refactor, package structure, and documentation#4

Open
andreas-prein wants to merge 73 commits intoAndreasPrein:mainfrom
andreas-prein:integrate-student
Open

Integrate Raphaels refactor, package structure, and documentation#4
andreas-prein wants to merge 73 commits intoAndreasPrein:mainfrom
andreas-prein:integrate-student

Conversation

@andreas-prein
Copy link

This PR integrates the major refactoring and extensions developed by Raphael Graf
into the main MOAAP repository.

Key changes:

  • Introduces structured Python package layout (moaap/)
  • Integrates Sphinx-based documentation (docs/)
  • Adds examples and tutorial material
  • Improves maintainability and usability

The repository license remains GPL-3.0. Legacy entry points are preserved.

grafrap and others added 30 commits September 25, 2025 10:53
…ference.

2d is only faster for jetstreams, because of large extent.
…d, approx. half of the time is in the watershedding methods, which takes even more time for larger datasets.
…not measured with prof, but with original data, 2d was 40s, 3d was 20s and parallel is 15s total, where the constant time to save to .nc file is included)
…y footprint and see if parallel execution could be good.
…distribution and when to use single core and when the parallel version
…e ratio of the picture is x:y, if x>y the chunking should also be such that x>y
add of more comments and function description,
add of check for sequential and parallel code depending on memory usage and also add default chunking strategy, if none is given.
grafrap and others added 30 commits December 10, 2025 11:43
…roducing shared memory, with the exact same (+-0.05%) result as in the sequential algorithm.

Add Documentation for new functions
…acy_psl_tracking function, the labeling was not removed, since this changes the endresult of the function.
…an be simulated, by just giving 1 as input for both chunksizes. Additionally, an mp_method was defined based on the findings on Euler, of when the data generates a timeout. If that is the case, spawn should be used. Generally, fork is better for small problems, whereas spawn has nearly no overhead for large problems (i.e. >400M cells in all dimensions)
… min_js_anomaly from upper level call and use definition from the config file. Update tutorial and correction of a second bug, when trying to call watershed with just one total chunk
…omatic choosing of multiprocessing method and default of number of processes set to 'None'
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants