Skip to content

fix: stabilize spearman distance correlation sampling#7

Draft
GeorgePearse wants to merge 1 commit intomainfrom
fix/spearman-subsample
Draft

fix: stabilize spearman distance correlation sampling#7
GeorgePearse wants to merge 1 commit intomainfrom
fix/spearman-subsample

Conversation

@GeorgePearse
Copy link
Copy Markdown
Owner

Summary

Makes sample_size subsample points (rows) before computing Spearman correlation over pairwise distances.

Why

Sampling arbitrary pairs can be noisy and can over/under-emphasize regions depending on the distribution of pairs; subsampling points tends to give a more stable estimate for large datasets.

Testing

  • Not run (pure Python change).

@openhands-ai
Copy link
Copy Markdown

openhands-ai bot commented Dec 24, 2025

Looks like there are a few issues preventing this PR from being merged!

  • GitHub Actions are failing:
    • Performance Benchmarks

If you'd like me to help, just leave a comment, like

@OpenHands please fix the failing actions on PR #7 at branch `fix/spearman-subsample`

Feel free to include any additional details that might help me get this PR into a better state.

You can manage your notification settings

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant