Feature/update modkit version by OberonDixon · Pull Request #22 · streetslab/dimelo_v2

OberonDixon · 2025-02-21T20:46:54Z

No description provided.

…l need to apply to export functions, and need to check that performance is unimpacted.

…ause it will complicate parallelization and is not clearly useful.

…cess_pileup_row and renamed/configured a few variables to support this.

…hunk generation, currently only for load_processed.pileup_vectors_from_bedmethyl. pileup_counts_from_bedmethyl is a simpler case. Passes pre-existing tests including when chunk_size<region size, but further infrastructure is required to assess whether regions_5to3prime still works right (logic gets a bit complicated). Speed with a single core and a small number of small regions is slightly slower due to overhead necessary to support parallelization. Need to test that we get a speedup with multi-core long regions or many regions.

…ead_vectors_from_hdf5 test targets to pass tests; other test targets are the same for now.

…d generate_targets.py to properly handle copying over new kwargs from cases.py when re-running some but not all targets: if all targets are re-run, then the old test_matrix.pkl is discarded. If only some are re-run, we need the old test_matrix.pkl, but still need to copy over all the kwargs for all cases that won't be re-run and still need to copy over old results, which will be overridden for re-run targets.

…rocessed-tests-baseline Feature/dm 191 refactor load processed tests baseline

…ss_pileup_row. Passing all tests.

…ns_5to3prime logic within the pileup_vectors_from_bedmethyl function, which is the only one that actually needs it. Runs faster, still passes all tests.

…y, re-ran targets for pileup_counts_from_bedmethyl and pileup_vectors_from_bedmethyl.

…rocessed-tests-baseline DM-192 Added chunk_size for load_processes parallelization to cases.p…

…. Added progress bars and quiet toggles for pileup loaders. Set default chunk size to 1 MB after whole-chromosome speed testing. Added option to parallelize within rather than between regions with regions_to_list.

…e to avoid intermittent crashes. Adjust tabix fetch to avoid negative start values. Tweak wording on progress bars.

…d some comments.

… to all pileup plotters. A few other small tweaks.

… and Macos ostensibly have 4 cores. This should not change any outputs because for extract the test is hardcoded to use 1 core, and that is the only one with any stochasticity in parallelization.

…to avoid ever sending the cores arg twice, and added a comment to explain the reason why extract is tested unparallelized, unlike other functions.

…ment for extract. Updated test_matrix.pickle by re-running generate_targets. No changes to pileup or extract files, but should pass all tests.

…rocessed-tests-baseline New test_matrix and cases to cover 1, 2, 3, 4, and None for cores

…r versions but must meet minimum requirements.

… changes. Fixed import order

…f 0.4.0 and needs to be examined. For now, pin it to a tight range.

Oberon Dixon-Luinenburg and others added 25 commits January 8, 2025 09:18

DM-229 Consolidating parsing logic for load_processed functions. Stil…

b2b7689

…l need to apply to export functions, and need to check that performance is unimpacted.

DM-231 Removed regions=None case for pileup_counts_from_bedmethyl bec…

3f3ec85

…ause it will complicate parallelization and is not clearly useful.

DM-229 Adjusted export.pileup_to_bigwig to use new load_processed.pro…

91fb5e4

…cess_pileup_row and renamed/configured a few variables to support this.

DM-232 Added strand information to regions .bed files. Re-generated r…

caf9dcd

…ead_vectors_from_hdf5 test targets to pass tests; other test targets are the same for now.

Merge pull request #16 from streetslab/feature/DM-191-refactor-load-p…

bb526bf

…rocessed-tests-baseline Feature/dm 191 refactor load processed tests baseline

DM-192 Fixed logic to properly handle single_strand case within proce…

caa3e59

…ss_pileup_row. Passing all tests.

DM-192 Further refactor to simplify process_pileup_row and keep regio…

ac3657e

…ns_5to3prime logic within the pileup_vectors_from_bedmethyl function, which is the only one that actually needs it. Runs faster, still passes all tests.

DM-192 Added chunk_size for load_processes parallelization to cases.p…

a9648df

…y, re-ran targets for pileup_counts_from_bedmethyl and pileup_vectors_from_bedmethyl.

Merge pull request #17 from streetslab/feature/DM-191-refactor-load-p…

c830abf

…rocessed-tests-baseline DM-192 Added chunk_size for load_processes parallelization to cases.p…

DM-192,DM-236 Properly close and delink shared memory when appropriat…

56e6333

…e to avoid intermittent crashes. Adjust tabix fetch to avoid negative start values. Tweak wording on progress bars.

DM-191 Re-arrange contents of load_processed for better clarity. Adde…

97efec1

…d some comments.

DM-238 Add quiet and cores parameters and corresponding documentation…

a8ed1ae

… to all pileup plotters. A few other small tweaks.

DM-239 Added test case coverage for 1-4 cores. GitHub jobs for Ubuntu…

f1b0758

… and Macos ostensibly have 4 cores. This should not change any outputs because for extract the test is hardcoded to use 1 core, and that is the only one with any stochasticity in parallelization.

DM-239 Added cases.py support for cores=None. Tweaked dimelo_test.py …

0b68892

…to avoid ever sending the cores arg twice, and added a comment to explain the reason why extract is tested unparallelized, unlike other functions.

DM-239 Adjusted generate_targets module to properly handle cores argu…

1da3d3c

…ment for extract. Updated test_matrix.pickle by re-running generate_targets. No changes to pileup or extract files, but should pass all tests.

Ruff format fixes

fcf116f

Merge pull request #18 from streetslab/feature/DM-191-refactor-load-p…

5dad63b

…rocessed-tests-baseline New test_matrix and cases to cover 1, 2, 3, 4, and None for cores

Merge branch 'main' into feature/DM-191-refactor-load-processed

f0f027b

Fixed small mistakes from merge

bd94e78

DM-242 environment spec loosening. Python and modkit can both be newe…

060c4bf

…r versions but must meet minimum requirements.

DM-243 Version checking changed to min rather than exact; minor logic…

f0d6387

… changes. Fixed import order

Adjusted version range. Something seems to have changed too much as o…

ec1c3f6

…f 0.4.0 and needs to be examined. For now, pin it to a tight range.

OberonDixon closed this Jun 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/update modkit version#22

Feature/update modkit version#22
OberonDixon wants to merge 25 commits intomainfrom
feature/update-modkit-version

OberonDixon commented Feb 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

OberonDixon commented Feb 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant