Reduce memory usage by twiggler · Pull Request #51 · fox-it/dissect.ntfs

twiggler · 2026-02-11T19:54:29Z

Remove cache from indices.

Warning: Might reduce performance

Closes #50

codecov · 2026-02-11T19:56:31Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 87.40%. Comparing base (4bda3bd) to head (c93babd).

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #51      +/-   ##
==========================================
- Coverage   87.42%   87.40%   -0.03%     
==========================================
  Files          11       11              
  Lines        1249     1247       -2     
==========================================
- Hits         1092     1090       -2     
  Misses        157      157

Flag	Coverage Δ
unittests	`87.40% <100.00%> (-0.03%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Schamper · 2026-02-11T23:28:16Z

Can you provide any measurements for memory reduction?

JSCU-CNI · 2026-02-12T09:02:27Z

Perhaps a benchmark test for NTFS could be added before this PR is considered to prevent performance regression.

twiggler · 2026-02-12T17:30:25Z

Can you provide any measurements for memory reduction?

On CPython 3.12 and a 150G NTFS disk, when navigating the file system tree, I see 3.5GB Memory usage without the patch and 3.0GB with the patch, with no discernible performance loss. I will run some more tests tomorrow.

So it is not as spectacular as the ExtFs or XFS fixes. Probably because there don't appear to be any circular references.

If people have any other ideas to reduce memory consumption, feel free to chime in.

One additional thing I saw myself is that we seem to eagerly parse all entries when searching:

dissect.ntfs/dissect/ntfs/index.py

Line 103 in c93babd

entries = list(self.root.entries())

which is probably not great. Alternatively, we could do a two-pass here where we first scan the offsets and lazily parse the rest.

Update: Actually, there was a circular reference between Index and IndexBuffer, but that should not be a problem anymore with the deletion of the IndexBuffer cache

twiggler · 2026-02-13T09:46:47Z

Perhaps a benchmark test for NTFS could be added before this PR is considered to prevent performance regression.

I agree that benchmarks are a good idea in general. However, since this fix involves reducing the cache footprint to resolve the leak, some performance regression is likely unavoidable.

Since we don't currently have any benchmark tests for NTFS, could you clarify:

Which specific NTFS use cases/patterns you would like to see tested?
What is the acceptable threshold for regression (e.g., is a 5-10% slowdown acceptable to fix the leak)?"

twiggler · 2026-03-11T14:35:23Z

@JSCU-CNI do you have some guidelines about use cases / patterns for the regression tests?

Reduce memory usage

c93babd

twiggler requested a review from Miauwkeru February 11, 2026 19:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce memory usage#51

Reduce memory usage#51
twiggler wants to merge 1 commit intomainfrom
kill-cache

twiggler commented Feb 11, 2026 •

edited

Loading

Uh oh!

codecov bot commented Feb 11, 2026 •

edited

Loading

Uh oh!

Schamper commented Feb 11, 2026

Uh oh!

JSCU-CNI commented Feb 12, 2026

Uh oh!

twiggler commented Feb 12, 2026 •

edited

Loading

Uh oh!

twiggler commented Feb 13, 2026

Uh oh!

twiggler commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

twiggler commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Schamper commented Feb 11, 2026

Uh oh!

JSCU-CNI commented Feb 12, 2026

Uh oh!

twiggler commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

twiggler commented Feb 13, 2026

Uh oh!

twiggler commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

twiggler commented Feb 11, 2026 •

edited

Loading

codecov bot commented Feb 11, 2026 •

edited

Loading

twiggler commented Feb 12, 2026 •

edited

Loading