deep copy in make_browser_figure by thekugelmeister · Pull Request #119 · streetslab/dimelo-toolkit

thekugelmeister · 2025-11-26T00:43:03Z

Avoid modifying the input dataframe in make_browser_figure, to avoid a bug where it was impossible to rerun the browser figure with collapsing reads until the input dataframe y_index column was reset to be non-unique.

The current (very simple) implementation of this fix relies on a deep copy operation, meaning that for very large datasets this may cause memory issues. I do not foresee this being the bottleneck for read browsing; by the time this becomes an issue the browser itself may be completely inoperable. However, I am leaving the space for a different fix to be implemented if this is a problem.

Addressing this issue in some other way will require a rewrite of the plotting logic, as the column in question is referenced inside of a different groupby operation.

OberonDixon

This seems like a fine approach to me. The change I am requesting is to get rid of or make non-redundant the .copy() operation, which I believe currently is creating a copy that does not then get used.

I assume that the normal level of copying depth that gets applied by .assign is ok, because we aren't making in-place modifications to python objects (such as numpy arrays) downstream in the pipeline. If I am misremembering though then it could be that we need a deeper copying operation.

dimelo/plot_read_browser.py

deep copy in make_browser_figure

378618f

OberonDixon requested changes Nov 26, 2025

View reviewed changes

dimelo/plot_read_browser.py Outdated Show resolved Hide resolved

thekugelmeister added 2 commits November 26, 2025 12:27

manually triggered pre-commit

b415cba

remove extraneous copy

eb5b999

thekugelmeister merged commit 8d31ec0 into main Nov 26, 2025
3 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

deep copy in make_browser_figure#119

deep copy in make_browser_figure#119
thekugelmeister merged 3 commits intomainfrom
read_browser_y_index_bug

thekugelmeister commented Nov 26, 2025

Uh oh!

OberonDixon left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

thekugelmeister commented Nov 26, 2025

Uh oh!

OberonDixon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants