Implement a model to determine when it's better to use a linear merge sum rather than the standard quadratic merge sum #8

mfdeakin · 2024-01-30T23:27:18Z

Use the model to reduce adaptive evalution costs, deciding at compile time whether the linear merge sums + required lower level merge sums is better than just using the top level quadratic merge sum
Correct the estimated quadratic merge latency model to account for the quadratic merge assuming neither sums have been built as non-overlapping and non-adjacent
Add eval_type to the latency estimates, needed for two_sum which may has different characteristics for vector_type outputs

sum rather than the standard quadratic merge sum Use the model to reduce adaptive evalution costs, deciding at compile time whether the linear merge sums + required lower level merge sums is better than just using the top level quadratic merge sum Correct the estimated quadratic merge latency model to account for the quadratic merge assuming neither sums have been built as non-overlapping and non-adjacent Add eval_type to the latency estimates, needed for two_sum which may has different characteristics for `vector_type` outputs TODO: Fix the failing adaptive test cases

Detect if subtrees are merged or not when using the linear merge, if not, then merge them individually before performing the linear merge Always perform a linear merge higher levels of the expression tree are expecting it Implement and use sparse_mult_merge, the algorithm used to multiply two partial sums of strongly non-adjacent values with the result being strongly non-adjacent Apparently slightly bugged, need to diagnose the issue

codecov-commenter · 2024-02-03T00:22:02Z

Codecov Report

Attention: Patch coverage is 97.03947% with 9 lines in your changes are missing coverage. Please review.

Project coverage is 98.14%. Comparing base (894bde9) to head (562261c).

Files	Patch %	Lines
src/ae_fp_eval_impl.hpp	93.33%	7 Missing ⚠️
src/ae_adaptive_predicate_eval.hpp	96.49%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main       #8      +/-   ##
==========================================
+ Coverage   98.06%   98.14%   +0.08%     
==========================================
  Files          13       14       +1     
  Lines        1034     1241     +207     
==========================================
+ Hits         1014     1218     +204     
- Misses         20       23       +3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Also add zero-pruning to the (slow) linear merge, gives substantial performance improvements (~150 us to ~15 us)

Probably incurs a performance penalty to expressions using it Also move common helful functors like is_nonzero and zero_prune_store to utils

More cleanup, move copy_nonzero to utils, use zero_prune_store_inc some more Split out the binary recursive merge from the sparse_mult_merge method, implement it as the standard recursive merge + zero pruning Use somewhat better names in sparse_mult_merge_term

Add multiplication implementation tests Remove zero_prune_store_dec, rename zero_prune_store_inc to zero_prune_store More uses of zero_prune_store instead of copy-pasta

Now do only necessary work, don't add, negate, or multiply zeros Remove non-zero filtering

mfdeakin added 3 commits January 30, 2024 15:26

Fix sparse_mult_merge

a2601be

mfdeakin added 3 commits February 22, 2024 10:17

Return iterators to the last non-zero element from merge sum

3cc2d76

Also add zero-pruning to the (slow) linear merge, gives substantial performance improvements (~150 us to ~15 us)

Add zero pruning to sparse mult

12acf42

Probably incurs a performance penalty to expressions using it Also move common helful functors like is_nonzero and zero_prune_store to utils

mfdeakin force-pushed the adaptive_linear_merge branch from bb7f7c2 to 7d86934 Compare February 25, 2024 15:53

mfdeakin added 2 commits February 26, 2024 11:48

Change memory layout for multiplication for better zero pruning

1b6b7cc

Add multiplication implementation tests Remove zero_prune_store_dec, rename zero_prune_store_inc to zero_prune_store More uses of zero_prune_store instead of copy-pasta

Take advantage of using zero pruning for adaptive evaluation

562261c

Now do only necessary work, don't add, negate, or multiply zeros Remove non-zero filtering

mfdeakin force-pushed the adaptive_linear_merge branch from 6a36e71 to 562261c Compare February 28, 2024 14:41

Add Mozilla Public License 2.0

cb5b24c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement a model to determine when it's better to use a linear merge sum rather than the standard quadratic merge sum #8

Implement a model to determine when it's better to use a linear merge sum rather than the standard quadratic merge sum #8

Uh oh!

mfdeakin commented Jan 30, 2024

Uh oh!

codecov-commenter commented Feb 3, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Implement a model to determine when it's better to use a linear merge sum rather than the standard quadratic merge sum #8

Are you sure you want to change the base?

Implement a model to determine when it's better to use a linear merge sum rather than the standard quadratic merge sum #8

Uh oh!

Conversation

mfdeakin commented Jan 30, 2024

Uh oh!

codecov-commenter commented Feb 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov-commenter commented Feb 3, 2024 •

edited

Loading