Fix to bootstrap of free energy surfaces, affecting timing and quantitative results by mrshirts · Pull Request #535 · choderalab/pymbar

mrshirts · 2024-08-26T04:20:07Z

Free energy surface code was calling MBAR after each call to randomizing bootstraps. It does not appear to affect the results, but slows things down by a factor of a little less than 2x (146 vs 87 seconds for one sample run).

…changed indices, taking too long.

codecov · 2024-08-26T04:23:10Z

Codecov Report

❌ Patch coverage is 52.94118% with 8 lines in your changes missing coverage. Please review.
✅ Project coverage is 69.27%. Comparing base (ed40ec3) to head (c0dc85d).

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

mrshirts · 2024-08-26T04:28:23Z

Couple more changes I will put in to fix uncertainties, don't do anything yet to it . . .

mrshirts · 2024-08-26T15:58:55Z

OK, I think I got the changes in I needed to.

mikemhenry · 2024-08-26T16:35:49Z

@mrshirts when you are ready for review go ahead and add whoever you want to review this PR 😄

mrshirts · 2024-08-26T17:04:07Z

pymbar/fes.py

-                        self.u_kn[:, bootstrap_indices], self.N_k, initial_f_k=self.mbar.f_k
-                    )
-                    x_nb = x_n[bootstrap_indices]
+                # recompute MBAR.


This was unnecessary - it was running MBAR too many times. This saves approximately 2X time.

mrshirts

My comments on this for other people.

mrshirts · 2024-08-26T17:04:29Z

pymbar/fes.py

-                    fall[:, b] = h["f"] - h["f"][j]
-                df_i = np.std(fall, axis=1)
+                    fall[:, b] = h["f"] - h["f"][j]  # subtract out the reference bin
+                df_i = np.std(fall, ddof=1, axis=1)


Fixing the std definition.

mrshirts · 2024-08-26T17:04:38Z

pymbar/fes.py

                            histogram_datas[b]["f"] - histogram_datas[b]["f"].transpose()
                        )
-                dfxij_vals = np.std(fall, axis=2)
+                dfxij_vals = np.std(fall, ddof=1, axis=2)


Fixing std definition

mrshirts · 2024-08-26T17:04:58Z

pymbar/fes.py

-            kde = self.kde
-        kde.fit(x_n, sample_weight=self.w_n)
+            kde = self.kde  # use these new weights for the KDE
+            w_n = self.w_n


I actually can't remember if this was 100% necessary to get updated weights . . .

mrshirts · 2024-08-26T17:05:15Z

pymbar/fes.py

-                    fall[:, b] = h["f"] - h["f"][j]
-                df_i = np.std(fall, axis=1)
+                    fall[:, b] = h["f"] - h["f"][j]  # subtract out the reference bin
+                df_i = np.std(fall, ddof=1, axis=1)


Fix bootstrap std definition

mrshirts · 2024-08-26T17:05:41Z

pymbar/fes.py

        if reference_point == "from-lowest":
            fmin = np.min(f_i)
            f_i = f_i - fmin
+            wheremin = np.argmin(


Need to find the location that this is zeroed at for the actual computation of the std.

mrshirts · 2024-08-26T17:05:49Z

pymbar/fes.py

        elif reference_point == "from-specified":
            fmin = -self.kde.score_samples(np.array(fes_reference).reshape(1, -1))
            f_i = f_i - fmin
+            wheremin = np.argmin(


Need to find the location that this is zeroed at for the actual computation of the std.

mrshirts · 2024-08-26T17:06:16Z

pymbar/fes.py

-                fall[:, b] = -self.kdes[b].score_samples(x) - fmin
-            df_i = np.std(fall, axis=1)
+                fall[:, b] = -self.kdes[b].score_samples(x)
+                fall[:, b] -= fall[wheremin, b]


Zero out at the correct location.

mrshirts · 2024-08-28T14:40:23Z

Suggestions for anyone else who should review - or if there's anyone who could take a look? We are looking at some free energy surface problems for OpenFF, so we want to get this through.

ijpulidos · 2025-08-05T17:58:06Z

I guess another way to make sure we are doing this correctly is to know what was the issue that the OpenFF folks were having and try to reproduce it here and make it a unit test? I don't know if that's possible, but that would be great to have.

Fixing the bootstrap indices - it was calling MBAR after each set of …

2bc1a00

…changed indices, taking too long.

mrshirts added 3 commits August 26, 2024 09:18

Fixed for standard deviations and bootstrap errors for KDE.

c1d4cf0

Fix the resets.

e9f299d

removing debugging.

d409818

mrshirts added 4 commits August 26, 2024 10:05

fix formatting for black.

42e04ab

some fixes for lint.

bc7f5b8

lint checks.

185736d

more linting.

4ed45db

mrshirts self-assigned this Aug 26, 2024

mrshirts marked this pull request as draft August 26, 2024 17:02

mrshirts requested review from Lnaden, maxentile and mikemhenry August 26, 2024 17:02

mrshirts commented Aug 26, 2024

View reviewed changes

mrshirts changed the title ~~Fix to bootstrap of free energy surfaces, affecting timing~~ Fix to bootstrap of free energy surfaces, affecting timing and quantitative results Aug 26, 2024

maxentile removed their request for review August 26, 2024 17:38

mrshirts marked this pull request as ready for review August 29, 2024 19:30

mikemhenry added 3 commits March 21, 2025 15:48

Merge branch 'master' into fix_kde_boot_Aug2024

9a919a7

Merge branch 'master' into fix_kde_boot_Aug2024

2146beb

Merge branch 'master' into fix_kde_boot_Aug2024

ab33f6d

Merge branch 'main' into fix_kde_boot_Aug2024

c0dc85d

Conversation

mrshirts commented Aug 26, 2024

Uh oh!

codecov bot commented Aug 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

mrshirts commented Aug 26, 2024

Uh oh!

mrshirts commented Aug 26, 2024

Uh oh!

mikemhenry commented Aug 26, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mrshirts left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mrshirts commented Aug 28, 2024

Uh oh!

ijpulidos commented Aug 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Aug 26, 2024 •

edited

Loading